Site status and plans

From GridPP Wiki
Revision as of 16:20, 24 July 2012 by Stephen jones (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Page for tracking site status and plans

This page has been created to provide a central GridPP reference page that allows the project to understand the status and plans of each site for pending and future middleware upgrades. It can be updated by the site administrators concerned or the Tier-2 (deputy) coordinators.

Background to batch system memory request for details:

SL5 worker nodes

Put here the percentage of your cluster on SL5 and/or an indication when nodes will be moved to SL5.

SRM upgrades

Record in this area the current version of the SRM and dates of any expected upgrades.


LondonGrid

UKI-LT2-BRUNEL

25 March 2011 One CreamCE in Production.

SL5 WNs

All WNs on SL5. New worker nodes to be deployed in April.


SRM

Storage available to reach 560 TB by end o March.


SCAS/glexec

glexec to be deployed in the first week of April/2011.


CREAM CE

One CreamCE in production.

Planned deployment:

Replacing 3 lcg-CEs with CreamCEs.

One Argus server to be deployed in April.

UKI-LT2-IC-HEP

SL5 WNs

Current status (date): Jan 2012: All WNs are running CentOS 5.7. We are running the glite-WN 3.2.11-0 tarball.


SRM

Current status (date): Jan 2012 dCache 1.9.12-13


Argus/glexec

Current status (date): Jan 2012: abandonned


CREAM CE

Current status (date): Jan 2012: Two working glite cream-ces (ceprod05 and ceprod06) in production. One EMI cream-ce (ceprod07) being debugged.

UKI-LT2-QMUL

SL5

Current status (date): (23 Mar 2011)

* All WNs SL5
* Cream CE: ce04
* glite-apel: apel01

Comments:

SRM

Current status (date): (19 April 2011)

  • StoRM 1.6.2: se03
- Storm 1.6 supports checksums and better permission checking. 
- This is an early adopters release and is now in production at QMUL. 

Planned upgrade: 1.6.3 when available. Current blocker is that it doesn't report space used correctly.

Comments: After some initial teething troubles, StoRM 1.6 seems to be running well - 19 Apr 2011. New storage to be brought online very soon.

SCAS/glexec/ARGUS

Current status (19 April 2011): Not yet deployed.

Planned deployment: We plan to deploy ARGUS and glexec soon, but will need a version compatible with our tarball worker node install.

Comments: We do not currently have the manpower to be beta testers of this.

CREAM CE

Current status (date): (23 Mar 2011) Deployed

Planned deployment: 1 Cream CE deployed. We will convert one of our remaining lcg-CEs to Cream and decomission the remaining lcg-CEs

Comments:

UKI-LT2-RHUL

SRM

Current status: DPM 1.8.3 in production since 15Dec11

Argus/glexec

Current status: Installed Argus/Glexec on all worker node and passing tests 26May11


Comments: CREAM CE

Current status: cream2 in production

UKI-LT2-UCL-CENTRAL

No such site! We plan to make the WNs of the UCL central computing facility, Legion, available through the site UKI-LT2-UCL-HEP.

UKI-LT2-UCL-HEP

SL5 WNs

Upgraded.

Comments: UCL-CENTRAL cluster will become available through UCL-HEP as SL5 nodes

SRM

Current status (date): DPM 1.7.3 (7/4/2011)

Planned upgrade:

Comments:

Argus/glexec

Current status (date): not deployed (7/4/2011)

Planned deployment: will deploy on HEP WNs, and no objection in principle to installing on Legion cluster.

Comments: no instructions yet for tarball installations

CREAM CE

Current status (date): deployed as of early April 2011, but still troubleshooting (7/4/2011)

Comments:

NorthGrid

UKI-NORTHGRID-LANCS-HEP

SL5 WNs

Current status (date): 22/3/2011 All nodes are on Sl5 or CentOs 5.

Planned upgrade: NA

Comments:

SRM

Current status (date): 22/3/2011 All SL5 DPM 1.7.4-7

Planned upgrade: Plan to upgrade to 1.8 in the near future.

Comments:

SCAS/glexec

Current status (date): 22/3/2011 Had some testing experiance, plan to roll it out after discussion with local admins and other work. Tarball install wanted, so will require liasing with other tar-sites.

Planned deployment: ETA May

Comments:

CREAM CE

Current status (date): 22/3/2011 New cluster behind CREAM CE. Running without trouble.

Planned deployment: Plan to deploy another cream ce for our other resources in April.

Comments:

UKI-NORTHGRID-LIV-HEP

SL5 WNs

All nodes are SL5.


Comments: na

SRM

Current status (date): DPM 1.8.2 (13/06/2012)

Planned upgrade: Network to be improved. Then CEs, TORQUE, WNs to EMI.

Comments: na

ARGUS/SCAS/glexec

Current status (date): EMI ARGUS is running on hepgrid9.ph.liv.ac.uk, glexec is installed on all worker nodes.

Planned deployment: Ready to roll out to whole Torque cluster, upon request.

Comments: na

CREAM CE

Current status (date): Deployed

Planned deployment:

Comments:

UKI-NORTHGRID-MAN-HEP

SL5 WNs

Current status (date): SL5 (21/10/09)

Planned upgrade: Upgrade to SL5 on all the nodes completed on 15/10/09

Comments:

SRM

Current status (date): DPM 1.7.2 (21/10/09)

Planned upgrade: Upgrade to DPM 1.7.2 on both SEs completed on the 16/10/09

Comments: currently proceeding to unify the two DPM instances as requested by atlas. Head node and pools all SL4.

ARGUS/glexec

Current status (21 Jun 11): ARGUS server installed and tested

Planned deployment: glexec is installed and currently partially configured on WNs

Comments:

CREAM CE

Current status (3 May 11): Both CEs are CREAM CE.

Planned deployment:

Comments:

UKI-NORTHGRID-SHEF-HEP

SL5 WNs

Current status (date): SL5 (20/04/2011)

Comments:

SRM

Current status (date): DPM 1.8.0 (head node and all disk servers)(20/04/2011)

Planned upgrade:

Comments:

SCAS/glexec

Current status (date):to be installed (20/04/2011)

Planned deployment: in June 2011(20/04/2011)

Comments:

CREAM CE

Current status (date): installed and in production (20/04/2011)

Planned deployment:

Comments:

ScotGrid

UKI-SCOTGRID-DURHAM

  • SL5 WNs and UI
    • Current status (date): 2011-04-26
    • Planned upgrade: Currently at SL55.
    • Comments: Most servers are on SL49. Just added a new lot of recent 2x6 cores nodes, being commissioned.
  • SRM
    • Current status (date): 2011-04-26
    • Planned upgrade: Currently DPM at 1.8.0 on SE, 1.7.4 on disk nodes.
    • Comments: Still running gLite 3.1/SL49 32b/64b.
  • SCAS/glexec
    • Current status (date): 2011-04-26
    • Planned deployment: Could be deployed in future on request.
    • Comments: Not needed as Durham does not run analysis or pilot jobs.
  • CREAM CE
    • Current status (date): 2011-04-26
    • Planned deployment: in progress
    • Comments: This is part of moving to gLite 3.2/SL5.
  • Other
    • Site software is a bit behind and some hardware is also fairly old, so there is a significant ongoing effort to update and upgrade , both at the platform level and the middleware level. There is a similar time consuming effort on the Institute systems.

UKI-SCOTGRID-ECDF

SL5 WNs

Current status (date): Upgraded on 29th Oct.

Planned upgrade:

Comments:Problem with LHCb SAM test (script looks in /etc/redhat-release). Seemingly not affecting actual jobs (confirming) ATLAS pilot jobs issue (work in progress) (SAM tests and SL test passing).

SRM

Current status (date): Running DPM 1.8.0-1 for a long time.

Planned upgrade:

Comments:

SCAS/glexec

Current status (date): Not deployed

Planned deployment: None planned. Systems team do not object to deployment - but will need a stable tarball install that works on SGE.

Comments:

CREAM CE

Current status (date): Deployed

Planned deployment: Deployed as a replacement to LCG-CE.

Comments:

UKI-SCOTGRID-GLASGOW

SL5 WNs

Current status (date): Initial Migration Complete. 1912 cores total, 1848 SL5 on WN3.2.4-0, 48 SL4 on WN3.1.40-0

Planned upgrade: December move of remaining 48 SL4 cores to SL5.

Comments: Migration complete. Some SL4 capacity kept for local ATLAS users to run non ported versions of Athena.

SRM

Current status (date): 2 DPMS migrated to SL5 DPM3.2.1-0

Planned upgrade: Possible upgrade from DPM-srm-server-mysql.x86_64 1.7.2-5 when available

Comments:

SCAS/glexec/ARGUS

Current status: 28/04/2011 ARGUS installation planned for May.

Current status (date): 10/11/2009 SCAS & GLEXEC with CREAM and GLEXEC on WN deployed in UAT .

Planned deployment: SCAS, GLEXEC with CREAM, GLEXEC with WN in Production on request.

Comments: Documenting install and info on wiki.

CREAM CE

Current status (date): 10/11/2009 Deployed in Production currently running 3.1.22

Planned deployment: Completed. Migrated to svr014.gla.scotgrid.ac.uk, svr008.gla.scotgrid.ac.uk and svr026.gla.scotgrid.ac.uk

Comments: In Production and open to all VO's

SouthGrid

UKI-SOUTHGRID-BHAM-HEP

SL5 WNs

Current status (10/02/10): All WNs now running SL5.3

Planned upgrade: Complete.

SRM

Current status (27/10/09): DPM 1.7.2-4 on SL 4.6

Planned upgrade: Complete.

ARGUS/glexec

Current status (22/03/11):

Planned deployment: Deployed for the local cluster, still testing. Working on deploying for the shared cluster, but this requires glexec for a tarball WN release.

CREAM CE

Current status (22/03/11): Complete. Both clusters have a working CreamCE.

UKI-SOUTHGRID-BRIS-HEP

SL5 WNs

Current status (date): (Dec 2009) VM CE in production with SL5 WN passing all OPS SAM tests. More WN soon.

SRM

Current status (date): 1.6.11-3sec Planned upgrade: No plans to upgrade, plan to retire DPM in Dec 2009.

StoRM SE must be upgra^H^H^H^H^H rebuilt (there's no upgrade path!) to 1.4 & enable other VO support on it.

ARGUS/glexec

Current status (date): Not yet installed (22.3.11)

Planned deployment: Waiting to hear how it goes elsewhere first.

Comments:

CREAM CE

Current status (date): Installed and working (22.3.11)

Planned deployment:

Comments:

UKI-SOUTHGRID-CAM-HEP

SL5 WNs

Current status (22/03/2011): SL5 on all WNs

Comments:

SRM

Current status (19/11/2009): Presently at 1.6.11 ob gLite 3.1 (glite-SE_dpm_mysql-3.1.10-0.x86_64)

Planned upgrade: Already tried several times but error returned reporting:

Error: Missing Dependency: libapr-0.so.0()(64bit) is needed by package apr-util
Error: Missing Dependency: libapr-0.so.0()(64bit) is needed by package httpd

Comments: There is already a opened ticket for that: #52552

SCAS/glexec

Current status (19/11/2009): Reviewing the compatibility issue with Condor at site.

Planned deployment:

Comments:

CREAM CE

Current status (22/03/2011): CREAM CE with PBS deployed, not functional yet.

Planned deployment:

Comments:

EFDA-JET

SL5 WNs

Current status (date): We have just upgraded to SL5/glite 3.2 (191109)

Planned upgrade:

Comments:

SRM

Current status (date):

Planned upgrade:

Comments:

ARGUS/glexec

Current status (date): As not a major analysis site this is low priority at the moment (22.3.11)

Planned deployment:

Comments:

CREAM CE

Current status (date): Not installed will do soon (22.3.11)

Planned deployment:

Comments: A lot of work on MAST (UK Fusion project)

glite.APEL Current status (date): In the process of installing today (22.3.11)

Planned upgrade:

Comments:

UKI-SOUTHGRID-OX-HEP

SL5 WNs

Current status (date): All WN's at SL5 (19.10.09)

Planned upgrade:

Comments:

SRM

Current status (date): Running DPM 1.7.4-7 (22.3.11)

Planned upgrade:

Comments:

ARGUS/glexec

Current status (date): All WNs have glexec installed with an ARGUS server back end. . (22.3.11).

Planned deployment:

Comments:

CREAM CE

Current status (date): t2ce06 is a CREAM ce driving the all the WNs in the production cluster. t2ce02 is a CREAM ce driving a smaller subset of WNs and is used as part of the Early Adopter program. (22.1.11) Planned deployment:

Comments:

UKI-SOUTHGRID-RALPP

SL5 WNs

Current status (date): All WNs nodes running SL5 (Was the first site to move across)

Planned upgrade:

Comments:

SRM

Current status (date): dcache 1.9.1-7

Planned upgrade:

Comments:

ARGUS/glexec

Current status (date): Installed and working(22.3.11)

Planned deployment:

Comments:

CREAM CE

Current status (date): Installed and working (22.3.11)

Planned deployment:

Comments:

glite-APEL

Current status (date): Installed and working (22.3.11)

Planned deployment:

Comments:

Tier1

RAL-LCG2-Tier-1

SL5 WNs

Current status (date): 19/11/2009 All LHC accessible WNs are SL5,

Planned upgrade: None

Comments:

SRM

Current status (date):

Planned upgrade:

Comments:

SCAS/glexec

Current status (date): 10/05/2011 SCAS deployed, intend to look at ARGUS but delayed due to staff changes.

Planned deployment:

Comments:

CREAM CE

Current status (date): 10/05/2011 CREAM CEs available for all VOs, planning reinstallation of remaining 2 lcg-CEs

Planned deployment:

Comments: