Production Team Report 2011-03-07

From GridPP Wiki
Jump to: navigation, search

RAL Tier1 Production Team Report for 7th March 2011.

AoD This Week

Mon,Tue: John Wed - Fri: Tiju

Gareth in Lyons Mon - Wed

Last week

  • Gareth: AoD 1 day;
  • John: AoD (4 days);
  • Tiju: A/L (5 days);

Changes to Operating procedures

  • Procedure for dealing with files with bad checksums.
  • Changes in method of access to the SAM website. [[1][RT74271]]

Declared Outages in GOC DB

  • Wednesdasy 9th March: Upgrade of Castor nameserver (downtime)
  • Thursday 10th March: Atlas file renaming (At risk)

Advanced Warning - dates provisional

  • Wednesday 9th March: Castor Nameserver upgrade
  • Wednesday 9th March: Install isolating transformers in power supply to database disk array.
  • Atlas Castor file renaming (Thurday 10th March)
  • Tuesday 15th March network outage.
  • Monday 28th March: Castor Atlas Upgrade (2.1.9-10 OR 2.1.10-0)
  • Tuesday 29th March: Castor CMS Upgrade (2.1.9-10 OR 2.1.10-0)
  • Wednesday 30th March: Castor LHCb & GEN Upgrade (2.1.9-10 OR 2.1.10-0)
  • Thursday 31st: - Switch castor to new database in R26

Other Changes

  • Fabric:
    • Swap out the older of the pair of SAN switches in the Tier1 Oracle databases for its new replacement. (Requires FTS, LFC, 3D stop).
  • Database:
    • Re-visit non-Castor database multipathing
    • Increase shared memory for LUGH & SOMNUS.
  • Grid Services:
    • None
  • Castor:
    • Change ATLAS castor permissions to prevent users deleting data
  • Networks:
    • Firmware updates for central networking components (likely to have some short network breaks - maybe in March)