Operations Report 12/10/2009

From GridPP Wiki
Jump to: navigation, search

Summary of Previous Week

Developments

  • CASTOR Databases Recovered and Moved to Old Disk Array
  • LFC/FTS Services Moved to Orisa Database
  • LHCB 3D Database Moved to Old Disk Array
  • ATLAS 3D Database Restored - Currently Being Re-Synced with CERN

Operational Issues and Incidents

  • CASTOR: Databases down because of disk arrays problems from 4/10 to 8/10
  • 3D: Database because of disk array problems from 6/10 to 9/10
  • LFC/FTS Databse down due to disk array problem (6/10) Restored Service on Orisa DB (7/10)

Plans for Week(s) Ahead

  • CASTOR: Analyse Last Weeks Recovery Operations
  • CASTOR:Investigate Moving Pluto Database Backup Area to Bulk Array
  • CASTOR: Implement Copy of Redo Logs on Server as well as ASM (Improve Resilience)
  • 3D: Re-sync ATLAS Database with CERN

Downtimes and At Risk

Description Start End Affected VO(s)

Development Priorities

  • CASTOR Database Monitoring
  • Migrate ATLAS TAGs to 64bit systems
  • Investigate ORACLE replication technique for LFC/FTS resilience

Requirements and Blocking Issues

Description Required By Priority Status
Hardware for Tag databases Medium Waiting
Hardware to test LFC database replication Medium/high Waiting

OnCall

  • Richard Sinclair