RAL Tier1 weekly operations castor 05/11/2012
From GridPP Wiki
Revision as of 09:32, 5 November 2012 by Matt viljoen (Talk | contribs)
Contents
Operations News
- Gen upgraded to 2.1.12-10.
- Hot standby hardware running lcgclsf04 (LHCb LSF) replaced with repaired original hardware
Operations Problems
- SRM DB problem brought down ATLAS for 6(?) hours on Sunday
Blocking Issues
Enabling central syslog collection of central service logs is needed before we turn off Amanda backups on all CASTOR headnodes
Planned, Scheduled and Cancelled Interventions
Entries in/planned to go to GOCDB none
Advanced Planning
Tasks
- Simplify and document Quattor templates to make them easier to maintain
- Test and certify 2.1.13-5 with simplified Quattor templates
Interventions
- Upgrade stagers from 2.1.12 to 2.1.13 and central services (NS,CUPV,VDQM) from 2.1.11 to 2.1.13
Staffing
- Castor on Call person
- Matthew
- Staff absence/out of the office:
- (Mon) Chris A/L
- (Fri) Brian at HEPSYSMAN