Difference between revisions of "RAL Tier1 weekly operations castor 28/11/2011"

From GridPP Wiki
Jump to: navigation, search
 
(No difference)

Latest revision as of 15:43, 28 November 2011

Operations News

  • Errata on LHCb SRMs updated and now Quattor is enabled

Operations Problems

  • On Mon/Tue night, ATLAS and LHCb instances stopped working for ~1.5h while a database backup failed due to a bad DMF mount point.

Blocking Issues

  • none

Planned, Scheduled and Cancelled Interventions

Entries in/planned to go to GOCDB none

Advanced Planning

  • Move Tier1 instances to new Database infrastructure which with a Dataguard backup instance in R26
  • Upgrade SRMs to 2.11 which incorporates VOMS support
  • Certify 2.1.11 and evaluate the Transfer Manager (the new LSF replacement)
  • Quattorization of remaining SRM servers
  • Hardware upgrade, Quattorization and Upgrade to SL5 of Tier1 CASTOR headnodes

Staffing

  • Castor on Call person: Matthew
  • Staff absence/out of the office:
    • (Thu-Fri) Shaun at Toulouse
    • (Thu-Fri) Brian A/L