RAL Tier1 weekly operations castor 12/09/2011

From GridPP Wiki
Jump to: navigation, search

Operations News

  • New using new DB hardware running 10g for Facilities DLF (Stager will be moved soon)
  • Draining and decommissioning of old disk servers continue
  • Worked with Fabric Team to upgrade firmware on all tape-backed disk servers

Operations Problems

  • gdss474 is failing draining using the tool and is being drained manually
  • ATLAS workload over the weekend combined with draining resulted in a callout and draining was paused

Blocking Issues

none

Planned, Scheduled and Cancelled Interventions

Entries in/planned to go to GOCDB none

Advanced Planning

  • Move Tier1 instances to new Database infrastructure which with a Dataguard backup instance in R26
  • Move Facilities DB instance to new Database hardware running 10g
  • Upgrade SRMs to 2.11 which incorporates VOMS support
  • Start migrating from T10KA to T10KC media later this year
  • Certify 2.1.11 and evaluate the new LSF replacement
  • Quattorization of remaining SRM servers
  • Hardware upgrade, Quattorization and Upgrade to SL5 of Tier1 CASTOR headnodes

Staffing

  • Castor on Call person: Chris
  • Staff absence/out of the office:
    • Chris and Shaun at SDB training course from Mon-Tue
    • Shaun and Matthew at CERN from Wed-Fri for CASTOR F2F and GridPP