Operations Report 12/04/2010

From GridPP Wiki
Jump to: navigation, search

Summary of Previous Week

  • Testing CASTOR 2.1.9 Upgrade Procedures
  • Oracle Consultancy for CASTOR Architecture
  • Upgrade OS kernel on ATLAS 3D databases nodes


Operational Issues and Incidents

  • During OS kernel upgrade on ATLAS 3D we lost an ASM disk from both 3D databases ATLAS and LHCb.
  • Somnus (LFC/FTS) database went down for FTS data corruption. Investigation on going with Oracle Support.

Plans for Week(s) Ahead

  • Continue Testing CASTOR 2.1.9 Upgrade
  • Develop CASTOR Upgrade Fallback Plan
  • Test Hammerora freeware application to test load Oracle databases

Downtimes and At Risk

None


Development Priorities

  • Migrate ATLAS TAGs to 64bit systems
  • Investigate ORACLE replication technique for LFC/FTS resilience
  • Investigate hardware architecture, backup and recovery strategy, resilience and validation of restored backup.


Requirements and Blocking Issues

None

OnCall

  • Carmine/Keir

Absences

Carmine - Friday