UKI-SOUTHGRID-RALPP
Contents
UKI-SOUTHGRID-RALPP
Topic: HEPSPEC06
CPU | OS | Kernel | 32/64 | mem | gcc | Total | Per Core | Notes |
Dual Opteron 270 @ 2GHz | SL4.6 | 2.6.9-89.0.9.ELsmp | 64 | 4GB | 3.4.6 | 27.75 | 6.94 | |
Dual Xeon 5130 @ 2 GHz | SL4.6 | 2.6.9-89.0.9.ELsmp | 64 | 8GB | 3.4.6 | 26.49 | 6.62 | |
Dual Xeon E5410 @ 2.33GHz | SL4.6 | 2.6.9-89.0.9.ELsmp | 64 | 16GB | 3.4.6 | 57.98 | 7.25 | |
Dual Xeon E5420 @ 2.50GHz | SL4.6 | 2.6.9-89.0.9.ELsmp | 64 | 16GB | 3.4.6 | 61.13 | 7.64 | |
Dual Xeon L5420 @ 2.50GHz | SL4.6 | 2.6.9-89.0.9.ELsmp | 64 | 16GB | 3.4.6 | 60.82 | 7.60 | |
Dual Opteron 270 @ 2GHz | SL5.2 | 2.6.18-128.7.1.el5 | 64 | 4GB | 4.1.2 | 27.18 | 6.80 | |
Dual Xeon 5130 @ 2 GHz | SL5.2 | 2.6.18-128.7.1.el5 | 64 | 8GB | 4.1.2 | 30.88 | 7.72 | |
Dual Xeon E5410 @ 2.33GHz | SL5.2 | 2.6.18-128.7.1.el5 | 64 | 16GB | 4.1.2 | 65.45 | 8.18 | |
Dual Xeon E5420 @ 2.50GHz | SL5.2 | 2.6.18-92.1.6.el5 | 64 | 16GB | 4.1.2 | 68.00 | 8.50 | |
Dual Xeon L5420 @ 2.50GHz | SL5.2 | 2.6.18-92.1.6.el5 | 64 | 16GB | 4.1.2 | 68.15 | 8.52 | |
Dual Xeon E5520 @ 2.27GHz | SL5.4 | 2.6.18-194.26.1.el5 | 64 | 24GB | 4.1.2 | 91.07 | 11.38 | HT enabled but running 8 processes (1/core) |
Dual Xeon X5650 (6 core) @ 2.67GHz | SL5.4 | 2.6.18-194.32.1.el5 | 64 | 48GB | 4.1.2 | 166.39 | 13.87 | |
Dual Xeon X5650 (6 core) @ 2.67GHz | SL5.7 | 2.6.18-274.18.1.el5 | 64 | 48GB | 4.1.2-51 | 195.58 | 10.87 | HT, running 18 processes |
Dual Xeon X5645 (6 core) @ 2.40GHz | SL5.7 | 2.6.18-274.18.1.el5 | 64 | 48GB | 4.1.2-51 | 167.66 | 9.31 | HT, running 18 processes |
Topic: Middleware_transition
2 lcg-CE's, Both now decommissioned and reinstalled as CreamCEs.
gLite3.2/EMI
ARGUS: gLite-3.2
Will replace with second VM and transition service to that
BDII_site: gLite-3.2
Will replace with second VM and transition service to that
Cluster Publisher: UMD 1.1
CE (CREAM/LCG): 1x gLite 3.2 2 x UMD 1.1
gLite CreamCE will be decommissioned soon and another UMD CreamCE installed to replace it
WN/glexec: gLite 3.2
Believe there are current issues with the UMD release of WN and it is not currently recommended, will do a rolling reinstall/upgrade once it is.
gLexec version will follow WNs
SE: dCache 1.9.5
Starting testing update to 1.9.12
UI: gLite 3.2
No current plans for update, will probably do a rolling reinstall
Comments
Topic: Protected_Site_networking
TBC
Topic: Resiliency_and_Disaster_Planning
- This section intentionally left blank
Topic: SL4_Survey_August_2011
2 lcg-CE's, one upgrade scheduled thie week, the other in the next few weeks.
Topic: Site_information
Memory
1. Real physical memory per job slot:
1 or 2 GB/Core depending on node type, have VO queues that publish 985 MB/Core and SubCluster Queues that publish 500, 1000 and 2000 MB/Core.
2. Real memory limit beyond which a job is killed:
Not currently implemented although if a node starts to run out of swap and we notice in time we may manually kill jobs.
3. Virtual memory limit beyond which a job is killed:
See above
4. Number of cores per WN:
Comments:
we don't currently kill jobs for over memory use and just try to use it to give more info to the batch system, however, due to problem jobs killing Worker Nodes recently we may implement some killing policy, probably at 125% or 150% of the published queue limit (may be lower for higher memory queues)
Network
1. WAN problems experienced in the last year:
None.
2. Problems/issues seen with site networking:
Slight issue with link between the different sections of our cluster which was only 1GB, now increased to 2GB and things seem better.
3. Forward look:
In the future we will be separating the farm further with the Disk and CPU resources being hosted in different Machine Rooms and the link between these will become critical. We are looking at ways to upgrade this to 10Gb/s.
Comments:
Topic: Site_status_and_plans
SL5 WNs
Current status (date): All WNs nodes running SL5 (Was the first site to move across)
Planned upgrade:
Comments:
SRM
Current status (date): dcache 1.9.1-7
Planned upgrade:
Comments:
ARGUS/glexec
Current status (date): Installed and working(22.3.11)
Planned deployment:
Comments:
CREAM CE
Current status (date): Installed and working (22.3.11)
Planned deployment:
Comments:
glite-APEL
Current status (date): Installed and working (22.3.11)
Planned deployment:
Comments: