RAL Tier1 weekly operations castor 04/04/2011

From GridPP Wiki
Jump to: navigation, search

Operations News

  • ATLAS, Gen, LHCb successfully upgraded to 2.1.10-0
  • 3rd SRM for LHCb back in production (lcg0680 which replaces broken lcgsrm0660)
  • Testing of SRM 2.10 against 2.1.10 progressing well

Operations Issues

  • Loss of one unmigrated t2k file on gdss502 during a disk corruption. Users informed.

Blocking Issues

  • Lack of production-class hardware running ORACLE 10g needs to be resolved prior to CASTOR for Facilities going into full production. Has arrived and we are awaiting installation.

Planned, Scheduled and Cancelled Interventions

Entries in/planned to go to GOCDB

  • None planned

Advanced Planning

  • Upgrade of SRM to 2.10 which incorporates:
    • fix to report files on draining disk servers accessed by FTS to be NEARLINE not UNAVAILABLE
  • Upgrade of CASTOR clients on WNs to 2.1.10
  • Upgrade tape subsystem to 2.1.10-1 which allows us to support files >2TB
  • Move Tier1 instances to new Database infrastructure which with a Dataguard backup instance in R26
  • Upgrade Facilities instance to 2.1.10-0
  • Move Facilities instance to new Database hardware running 10g
  • Start migrating from T10KA to T10KC media later this year

Staffing

  • Castor on Call person: Shaun
  • Staff absence/out of the office:
    • Chris on A/L (all week)