Difference between revisions of "UKI-SOUTHGRID-RALPP"

From GridPP Wiki
Jump to: navigation, search
 
(No difference)

Latest revision as of 14:26, 25 July 2012

UKI-SOUTHGRID-RALPP

Topic: HEPSPEC06


UKI-SOUTHGRID-RALPP
CPU OS Kernel 32/64 mem gcc Total Per Core Notes
Dual Opteron 270 @ 2GHz SL4.6 2.6.9-89.0.9.ELsmp 64 4GB 3.4.6 27.75 6.94
Dual Xeon 5130 @ 2 GHz SL4.6 2.6.9-89.0.9.ELsmp 64 8GB 3.4.6 26.49 6.62
Dual Xeon E5410 @ 2.33GHz SL4.6 2.6.9-89.0.9.ELsmp 64 16GB 3.4.6 57.98 7.25
Dual Xeon E5420 @ 2.50GHz SL4.6 2.6.9-89.0.9.ELsmp 64 16GB 3.4.6 61.13 7.64
Dual Xeon L5420 @ 2.50GHz SL4.6 2.6.9-89.0.9.ELsmp 64 16GB 3.4.6 60.82 7.60
Dual Opteron 270 @ 2GHz SL5.2 2.6.18-128.7.1.el5 64 4GB 4.1.2 27.18 6.80
Dual Xeon 5130 @ 2 GHz SL5.2 2.6.18-128.7.1.el5 64 8GB 4.1.2 30.88 7.72
Dual Xeon E5410 @ 2.33GHz SL5.2 2.6.18-128.7.1.el5 64 16GB 4.1.2 65.45 8.18
Dual Xeon E5420 @ 2.50GHz SL5.2 2.6.18-92.1.6.el5 64 16GB 4.1.2 68.00 8.50
Dual Xeon L5420 @ 2.50GHz SL5.2 2.6.18-92.1.6.el5 64 16GB 4.1.2 68.15 8.52
Dual Xeon E5520 @ 2.27GHz SL5.4 2.6.18-194.26.1.el5 64 24GB 4.1.2 91.07 11.38 HT enabled but running 8 processes (1/core)
Dual Xeon X5650 (6 core) @ 2.67GHz SL5.4 2.6.18-194.32.1.el5 64 48GB 4.1.2 166.39 13.87
Dual Xeon X5650 (6 core) @ 2.67GHz SL5.7 2.6.18-274.18.1.el5 64 48GB 4.1.2-51 195.58 10.87 HT, running 18 processes
Dual Xeon X5645 (6 core) @ 2.40GHz SL5.7 2.6.18-274.18.1.el5 64 48GB 4.1.2-51 167.66 9.31 HT, running 18 processes



Topic: Middleware_transition


2 lcg-CE's, Both now decommissioned and reinstalled as CreamCEs.

gLite3.2/EMI


ARGUS: gLite-3.2

Will replace with second VM and transition service to that

BDII_site: gLite-3.2

Will replace with second VM and transition service to that

Cluster Publisher: UMD 1.1

CE (CREAM/LCG): 1x gLite 3.2 2 x UMD 1.1

gLite CreamCE will be decommissioned soon and another UMD CreamCE installed to replace it

WN/glexec: gLite 3.2

Believe there are current issues with the UMD release of WN and it is not currently recommended, will do a rolling reinstall/upgrade once it is.
gLexec version will follow WNs

SE: dCache 1.9.5

Starting testing update to 1.9.12

UI: gLite 3.2

No current plans for update, will probably do a rolling reinstall

Comments


Topic: Protected_Site_networking


TBC



Topic: Resiliency_and_Disaster_Planning




      • This section intentionally left blank


Topic: SL4_Survey_August_2011


2 lcg-CE's, one upgrade scheduled thie week, the other in the next few weeks.

Topic: Site_information


Memory

1. Real physical memory per job slot:
1 or 2 GB/Core depending on node type, have VO queues that publish 985 MB/Core and SubCluster Queues that publish 500, 1000 and 2000 MB/Core.

2. Real memory limit beyond which a job is killed:
Not currently implemented although if a node starts to run out of swap and we notice in time we may manually kill jobs.

3. Virtual memory limit beyond which a job is killed:
See above

4. Number of cores per WN:

Comments:

we don't currently kill jobs for over memory use and just try to use it to give more info to the batch system, however, due to problem jobs killing Worker Nodes recently we may implement some killing policy, probably at 125% or 150% of the published queue limit (may be lower for higher memory queues)

Network

1. WAN problems experienced in the last year:

None.

2. Problems/issues seen with site networking:

Slight issue with link between the different sections of our cluster which was only 1GB, now increased to 2GB and things seem better.

3. Forward look:

In the future we will be separating the farm further with the Disk and CPU resources being hosted in different Machine Rooms and the link between these will become critical. We are looking at ways to upgrade this to 10Gb/s.

Comments:


Topic: Site_status_and_plans



SL5 WNs

Current status (date): All WNs nodes running SL5 (Was the first site to move across)

Planned upgrade:

Comments:

SRM

Current status (date): dcache 1.9.1-7

Planned upgrade:

Comments:

ARGUS/glexec

Current status (date): Installed and working(22.3.11)

Planned deployment:

Comments:

CREAM CE

Current status (date): Installed and working (22.3.11)

Planned deployment:

Comments:

glite-APEL

Current status (date): Installed and working (22.3.11)

Planned deployment:

Comments: