RAL Tier1 weekly operations castor 12/09/2011
From GridPP Wiki
Revision as of 14:01, 12 September 2011 by Matt viljoen (Talk | contribs)
Contents
Operations News
- New using new DB hardware running 10g for Facilities DLF (Stager will be moved soon)
- Draining and decommissioning of old disk servers continue
- Worked with Fabric Team to upgrade firmware on all tape-backed disk servers
Operations Problems
- gdss474 is failing draining using the tool and is being drained manually
- ATLAS workload over the weekend combined with draining resulted in a callout and draining was paused
Blocking Issues
none
Planned, Scheduled and Cancelled Interventions
Entries in/planned to go to GOCDB none
Advanced Planning
- Move Tier1 instances to new Database infrastructure which with a Dataguard backup instance in R26
- Move Facilities DB instance to new Database hardware running 10g
- Upgrade SRMs to 2.11 which incorporates VOMS support
- Start migrating from T10KA to T10KC media later this year
- Certify 2.1.11 and evaluate the new LSF replacement
- Quattorization of remaining SRM servers
- Hardware upgrade, Quattorization and Upgrade to SL5 of Tier1 CASTOR headnodes
Staffing
- Castor on Call person: Chris
- Staff absence/out of the office:
- Chris and Shaun at SDB training course from Mon-Tue
- Shaun and Matthew at CERN from Wed-Fri for CASTOR F2F and GridPP