RAL Tier1 weekly operations Overview 20090713

From GridPP Wiki
Revision as of 14:02, 13 July 2009 by Matt viljoen (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Overview of Milestones and Metrics

Key High Level dates

  • LHC schedule delayed 3 weeks. We now expect first beam in October and first collisions some time in November.
  • There will be no formal change in WLCG planning for data taking until the WLCG workshop on 9th July however in the light of

the above delay we will delay our freeze date to 31st August.

  • Data taking then expected to continue (with a 2 week stop for Christmas) through much of 2010. Alternative scenarios are being discussed.


Key Metrics

Owner Description Target Achieved
Gareth Smith Overall Tier-1 SAM Availability (last week) 93% 0%
Gareth Smith Alice SAM Availability (Jun) 97% 73%
Gareth Smith ATLAS SAM Availability (Jun) 97% 71%
Gareth Smith CMS SAM availability (Jun) 97% 71%
Gareth Smith LHCB SAM availability (Jun) 97% 67%
Andrew Sansum Fraction of Tier-1 Staff in Post (Jun) 93% 103%
Gareth Smith Number of days where called out (last spreadsheet full week) 3 2
Matt Hodges Percentage met of UB allocation of disk (May) 100% 91%
Matt Hodges Job Efficiency (May) 85% 81%
Matt Hodges Farm Occupancy (May) 85% 43%
Matt Viljoen Number of >Severe CASTOR Incidents (Jun) 6 2

Availability was poor in June owing to the move of the Tier-1 to R89.

Key Production Milestones

Expected to be in mytasks this week.

R89 Migration Summary

Is complete except for some final configuration as the non-HEP robot is attached to the HEP robot

High Level Schedule

Final Update Window					Mon 13/07/09	Wed 26/08/09
Tier-1 Stability Period (2)				September
LHC First beam				        	October?
LHC Collisions					        November?

Note that:

  • Final update window is now extended to end of August.
  • WLCG aiming for stability by the end of August

Disaster Management

Swine Flu (H1N1) is being handled in the Tier-1 Disaster Management System (currently level 2)

Swine Flu Response Plan

See: https://wiki.e-science.cclrc.ac.uk/web1/bin/view/EScienceInternal/TierOneSwineFlu

Purchasing and Finance

  • GRIDPP finalising spend plan
  • Commencing current disk and CPU tenders (Dave Corney leading). Target date for disk delivery is end of December. First meeting of HAG has occurred. Expect to have disk paperwork ready next week.

Staffing

  • One experiment support post accepted and expected to start in August. Second experiment support post, ready to make offer.
  • PPS recruitment re-approved.
  • YII post expected in July

PMB Experiment Reports

ATLAS

CMS

LHCB

1) Many Grid sites (Tier-2s so far) banned because they have been aborting pilots. Sites banned in the UK are Brunel and RAL-HEP. Possible bad publishing of information in the bdii, but investigations going on.

...

Outlook: 1 Billion event (minimum-bias) production proceeding - about 845 million events produced so far, at a rate of about 50 million events per day. User analysis as usual.

Hardware Deployment Report

None

Team Reports

Fabric

RAL Tier1 weekly operations Fabric 20090713

Grid Services

http://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_Grid_20090713

CASTOR

http://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor_13/07/2009

Database

http://www.gridpp.ac.uk/wiki/Operations_Report_13/07/2009

Production

Production Team Report 2009-07-13