Production Team Report 2010-09-27

From GridPP Wiki
Jump to: navigation, search

RAL Tier1 Production Team Report for 27th September 2010.

AoD This Week

Mon - Wed: Tiju Thu: Gareth Fri: Tiju

Last week

  • Gareth: AoD(1 Day), Common Technologies Workshop & preparation; Some planning for LHCb outage, Followed up with problem on Lancaster link, Disk Post Mortems.
  • John: AoD(4 Day), Work on castor Nagios tests - preparations for castor LHCb update.
  • Tiju: Review nagios configuration, Security task, Make offline copy of Wiki.

Changes to Operating procedures

  • None.

Declared Outages in GOC DB

  • LHCb Castor update (27-29 September).

Advanced Warning

  • Weekend 2/3 October: Power outage in atlas building.
  • Monday 18th October - R89 Transformer Checks.
  • Wednesday 20th October - UPS maintenance.
  • Monday 13th December - UPS test.

Other Changes

  • Fabric:
    • Double the network link to the tape robot stack (stack 12), postponed from the last TS. (Requires Castor stop).
    • Swap out the older of the pair of SAN switches in the Tier1 Oracle databases for its new replacement. (Requires FTS, LFC, 3D stop).
    • New kernels and glibc updates on non-castor Oracle RAC nodes. (Done for LUGH).
    • Update firmware in RAID controller cards for a batch of disk servers.
  • Database:
    • Re-visit non-Castor database multipathing
  • Grid Services:
    • New Quattorized front ends for FTS.
  • Castor:
    • Possible SRM update
    • Castor 2.1.9 upgrades
  • Networks:
    • None