RAL Tier1 weekly operations castor 28/11/2011
From GridPP Wiki
Contents
Operations News
- Errata on LHCb SRMs updated and now Quattor is enabled
Operations Problems
- On Mon/Tue night, ATLAS and LHCb instances stopped working for ~1.5h while a database backup failed due to a bad DMF mount point.
Blocking Issues
- none
Planned, Scheduled and Cancelled Interventions
Entries in/planned to go to GOCDB none
Advanced Planning
- Move Tier1 instances to new Database infrastructure which with a Dataguard backup instance in R26
- Upgrade SRMs to 2.11 which incorporates VOMS support
- Certify 2.1.11 and evaluate the Transfer Manager (the new LSF replacement)
- Quattorization of remaining SRM servers
- Hardware upgrade, Quattorization and Upgrade to SL5 of Tier1 CASTOR headnodes
Staffing
- Castor on Call person: Matthew
- Staff absence/out of the office:
- (Thu-Fri) Shaun at Toulouse
- (Thu-Fri) Brian A/L