Operations Report 10/10/2011

From GridPP Wiki
Jump to: navigation, search

Summary of Previous Week

  • CASTOR: Got the HW from Martin, now working on replicate the crash problem
  • CASTOR: Preprod and certification backup up and running
  • CASTOR: Fix for deadlocking on SRM
  • GC: Working on GC resilience

Operational Issues and Incidents

  • 3D/FTS: backup failure due to disk array controller problems

Plans for Week(s) Ahead

  • CASTOR: Replicate problems (and test potential firmware fix)
  • JUNO: Update the backup testing routine to test also Juno backups
  • GC: Work on GC resilience

Downtimes and At Risk

Description Start End Affected VO(s) Type

Development Priorities

  • Test and migrate to new HW

Requirements and Blocking Issues


OnCall

  • Carmine

Absences

  • Eddy: Tuesday
  • Keir: Off sick today