Difference between revisions of "RAL Tier1 weekly operations Overview 20091130"

From GridPP Wiki
Jump to: navigation, search
 
(No difference)

Latest revision as of 15:32, 30 November 2009

Overview of Milestones and Metrics

Key Metrics

Owner Description Target Achieved
Gareth Smith Overall Tier-1 SAM Availability (last week) 97% 100%
Gareth Smith Alice SAM Availability (Oct) 97% 81%
Gareth Smith ATLAS SAM Availability (Oct) 97% 69%
Gareth Smith CMS SAM availability (Oct) 97% 82%
Gareth Smith LHCB SAM availability (Oct) 97% 94%
Andrew Sansum Fraction of (GRIDPP funded) Tier-1 Staff in Post (Oct) 93% 103%
Gareth Smith Number of days where called out (last spreadsheet full week) 3
Matt Hodges Percentage met of UB allocation of disk (Oct) 100% 97%
Matt Hodges Job Efficiency (Oct) 85% 56%
Matt Hodges Farm Occupancy (Oct) 85% 52%
Matt Viljoen Number of >Severe CASTOR Incidents (Oct) 6 4

Key Production Milestones

See myactions:

https://myactions.gridpp.rl.ac.uk/all/where/category_name/Operational/

High Level Schedule

  • LHC commissioning appears to be on track - ramped beam energy to record levels. Expect collisions at

record levels in the next 7 days.

  • GRIDPP Review of RAL Tier-1 14 December
Event Scehdule
LHC Standby December 19th
Restart 4th January
Run ends October 2010

Disaster Management

  • Multiple RAID Array failures. Likely cause identified. Plan to re-certify EMC hardware for move back on 5th January.
  • Disk deployment ongoing testing with Viglen. Finalising plan for replacement.

Purchasing and Finance

  • Disk tender orders placed. Deliver Late January, early February.
  • CPU tender suppliers selected - at standstill

Staffing

At full complement

PMB Experiment Reports

Note we agreed to agree all non-Trivial changes with the PMB through until the end of the 2009 run.

ATLAS

CMS

LHCB

Very low level of running. So, no major problems seen from site point of view. 1. T1 diskserver failure earlier this morning in lhcbDst service class. Awaiting further information. 2. LHCb application problems still under investigation 3. Waiting for more data.... (LHC reached 2*1.18 TeV earlier this morning).

Hardware Deployment Report (Matthew)

To confirm, there is no new disk deployment - all new machines for ATLAS and CMS have been allocated. Presumably you no longer need this weekly report until there is new deployment work to be done?

Team Reports

Fabric

RAL Tier1 weekly operations Fabric 20091130

Grid Services

http://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_Grid_20091130

CASTOR

http://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor_30/11/2009

Database

http://www.gridpp.ac.uk/wiki/Operations_Report_30/11/2009

Production

Production Team Report 2009-11-30