Production Team Report 2010-06-14

From GridPP Wiki
Jump to: navigation, search

RAL Tier1 Production Team Report for 14th June 2010.

AoD This Week

Mon & Tues: Tiju Wed: Gareth Thu: John Fri: Tiju

Last week

  • Gareth: AoD (1 day), HEP SYSMAN planning & running, Following up with Dust in computer room.
  • John: AoD (3 days)
  • Tiju: Worked on Nagios replacement for SAM test, HEPSYSMAN, Dashboard updates

Changes to Operating procedures

  • None

Declared Outages in GOC DB

  • FTS 2.2.4 update. At Risk. Wednesday 16th June 08:00-12:00.

June 28-30 OLD LHC Technical Stop Dates

  • Transformer checks. (Site At Risk). TX2 & TX3. At Risk on whole Tier1 from 08:30 on Monday 28th to 17:00 on Wednesday 30th.

July 19-22 NEW LHC Technical Stop Dates

  • Transformer checks. (Site At Risk). TX1 & TX4. At Risk on whole Tier1 from 08:30 on Monday 19th to 17:00 on Thursday 22nd July.

Advanced Warning

  • Fabric:
    • Move one power unit for one EMC array unit behind the non-Castor databases to UPS power.
    • Double the network link to the tape robot stack (stack 12), postponed from the last TS. (Requires Castor stop).
    • Swap out the older of the pair of SAN switches in the Tier1 Oracle databases for its new replacement. (Requires FTS, LFC, 3D stop).
    • Multipath mods to stop errors. (Not yet sure of effect).
    • Microcode update for tape robot
    • Swap Solaris tape controllers (for robot) over (?)
    • New kernels and glibc updates on non-castor Oracle RAC nodes. (Done for LUGH).
  • Database:
    • nothing
  • Grid Services:
    • Stop SL4 batch service (Start August)
    • Add Quatorised BDII to Top-BDII set. (Below threshold for technical stop).
    • glite 3.2 WMS (Below threshold for technical stop).
  • Castor:
    • Possible SRM update
  • Networks:
    • Commissioning OPN link