Production Team Report 2011-05-23
From GridPP Wiki
Contents
RAL Tier1 Production Team Report for 23rd May 2011.
AoD This Week
Mon - Tue: Tiju Wed: Gareth Thu: John Fri: Gareth
Last week
- Gareth: AoD (2 days); Post Mortem review for LFC outage, HEP SYSman preparations, Some Nagios work (including for facilities).
- John: AoD (1 day); re-adding db servers to ganglia.
- Tiju: AoD (2 days); Nagios(General, Facilities, CVMFS).
Changes to Operating procedures
- None
Declared Outages in GOC DB
- Tues 24 - Tue 7 Jun: Drain and Re-installation of CE07 as cream CE.
Advanced Warning
- None
Other Changes
- Fabric:
- Upgrade to networking for tape servers to enable sufficient bandwidth for T10KC tapes.
- Microcode update on tape robots.
- Database:
- Switch Castor databases to alternative array in UPS room (3-4 hour outage of Castor)
- Switch Castor databases back to array in R89 (3-4 hour outage of Castor)
- Switch non-Castor databases to new array. (~1 hour outage of LFC, FTS, 3D)
- Grid Services:
- None
- Castor:
- Change ATLAS castor permissions to prevent users deleting data
- Castor 2.1.10 client upgrade on WNs
- Castor update to obtain functionality for T10KC tapes.
- Updates to new hardware for castor head nodes.
- Networks:
- Firmware updates for central networking components (likely to have some short network breaks)