Operations Report 19/10/2009

From GridPP Wiki
Jump to: navigation, search

Summary of Previous Week

Developments

  • Castor: Name server restored up to the 3rd October
  • 3D: ATLAS database has been resync
  • Castor: We got a much better understanding of the problems we had on restoring Pluto from backup.

Operational Issues and Incidents

  • CASTOR: Pluto restore from backup failed.


Plans for Week(s) Ahead

  • Analyze a strategy to test backups
  • Get from oracle a workaround to the problem we had in restoring Pluto


Downtimes and At Risk

Description Start End Affected VO(s)

Development Priorities

  • CASTOR Database Monitoring
  • Migrate ATLAS TAGs to 64bit systems
  • Investigate ORACLE replication technique for LFC/FTS resilience

Requirements and Blocking Issues

Description Required By Priority Status
Hardware for Tag databases Medium Waiting
Hardware to test LFC database replication Medium/high Waiting

OnCall

  • Keir Hawker