RAL Tier1 weekly operations Overview 20091005

From GridPP Wiki
Jump to: navigation, search

Overview of Milestones and Metrics

Key Metrics

Owner Description Target Achieved
Gareth Smith Overall Tier-1 SAM Availability (last week) 97% 86%
Gareth Smith Alice SAM Availability (Aug) 97% 77%
Gareth Smith ATLAS SAM Availability (Aug) 97% 75%
Gareth Smith CMS SAM availability (Aug) 97% 77%
Gareth Smith LHCB SAM availability (Aug) 97% 78%
Andrew Sansum Fraction of Tier-1 Staff in Post (Aug) 93% 103%
Gareth Smith Number of days where called out (last spreadsheet full week) 3 2
Matt Hodges Percentage met of UB allocation of disk (Aug) 100%
Matt Hodges Job Efficiency (Aug) 85% 67%
Matt Hodges Farm Occupancy (Aug) 85% 41%
Matt Viljoen Number of >Severe CASTOR Incidents (Aug) 6 1

Key Production Milestones

See myactions:

https://myactions.gridpp.rl.ac.uk/all/where/category_name/Operational/

High Level Schedule

Tier-1 Stability Period (2)				October-mid-November
LHC First beam				        	mid November
LHC Standby                                             December 19th
Restart                                                 4th January
Run ends                                                October 2010

Disaster Management

  • Swine Flu (H1N1) downgraded to level 1. No regular meetings, will re-activate when case frequency increases
  • Disk deployment (level 2) ongoing testing with Viglen. Increasing likelihood that we will escalate to L3 if no progress soon.
  • Machine room air-conditioning. Now level 2.
  • Water leak

Purchasing and Finance

  • GRIDPP finalised high level spend plan.
  • Disk tender at ITT evaluation stage.
  • CPU PQQ at ITT stage
  • Tape drives purchased
  • Finalising spend plan.

Staffing

  • Alastair Dewhurst started today

PMB Experiment Reports

ATLAS

CMS

October analysis tests starting. Not clear what the role of the Tier-1 will be. dave Colling will check.

LHCB

Hardware Deployment Report

1. Disk servers deployed last week: * none

2. Still waiting for SL5-64bit kickstart from Fabric Team.

3. Deployment Rota (05/10 - 09/10): * FabMon: Martin * DeputyFabMon: James T. * DepMon: Chris * DeputyDepMon: Shaun

4. Deployment for this week: * starting production disk server deployment for the four LHC VOs: 51842 Deploy 29 TB to ALICE. 51840 Deploy 149 TB to LHCb. 51838 Deploy 126 TB to CMS. 51836 Deploy 516 TB to ATLAS.


Team Reports

Fabric

RAL Tier1 weekly operations Fabric 20091005

Grid Services

http://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_Grid_20091005

CASTOR

http://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor_05/10/2009

Database

http://www.gridpp.ac.uk/wiki/Operations_Report_05/10/2009

Production

Production Team Report 2009-10-05