Production Team Report 2010-06-28

From GridPP Wiki
Jump to: navigation, search

RAL Tier1 Production Team Report for 28th June 2010.

AoD This Week

Mon & Tues: John Wed: & Thu: Gareth Fri: John

Last week

  • All: Departmental training day last Tuesday.
  • Gareth: AoD (1 day), Oracle SSC finance training, adding timesheets to SSC Oracle; Tracking EMC/UPS issues.
  • John: AoD (4 days), Created script to clean up redundant CRLs.
  • Tiju: Continued to set-up modem replacement for SURE; Tracked changes to SAM (now Nagios) test submission; Investigated pager problems.

Changes to Operating procedures

  • Note: Changes to disk server intervention procedure - Wiki updated. See:
  https://wiki.e-science.cclrc.ac.uk/web1/bin/view/EScienceInternal/CastorDiskServerIntervention
 

Declared Outages in GOC DB

  • Ongoing. June 28-30 OLD LHC Technical Stop Dates. Transformer checks. (Site At Risk). TX2 & TX3. At Risk on whole Tier1 from 08:30 on Monday 28th to 17:00 on Wednesday 30th.
  • July 19-22 NEW LHC Technical Stop Dates. Transformer checks. (Site At Risk). TX1 & TX4. At Risk on whole Tier1 from 08:30 on Monday 19th to 17:00 on Thursday 22nd July.

Advanced Warning

  • Tomorrow: (Tuesday 29th June) Move one power unit for one EMC array unit behind the non-Castor databases to UPS power.
  • Monday 5th or Tuesday 6th July - switch to T10KB (proposed) (Then Atlas tape recall test on 8th July).
  • Tuesday 20th July: Microcode update for tape robot
  • 1st August: Stop SL4 batch service
  • Fabric:
    • Double the network link to the tape robot stack (stack 12), postponed from the last TS. (Requires Castor stop).
    • Swap out the older of the pair of SAN switches in the Tier1 Oracle databases for its new replacement. (Requires FTS, LFC, 3D stop).
    • Multipath mods to stop errors. (Not yet sure of effect).
    • Swap Solaris tape controllers (for robot) over (?)
    • New kernels and glibc updates on non-castor Oracle RAC nodes. (Done for LUGH).
  • Database:
    • Restict access to FTS & LFC databases.
  • 'Grid Services:
    • Add Quatorised BDII to Top-BDII set. (Below threshold for technical stop).
  • Castor:
    • Possible SRM update
  • Networks:
    • Commissioning OPN link