RAL Tier1 weekly operations Fabric 20110404

From GridPP Wiki
Jump to: navigation, search

Developments

  • All:
  • Martin:
  • Ian:
  • Tim:
  • James A:
  • Cheney
    • DMF DR testing (failed)
    • set up backups for isis
  • Kash:
    • Drive replacement.
    • Fixing broken WNs.
    • Decommissioning old batch systems.(R 27)
    • Test room review. (Now monthly)
    • gdss496 need to install smartd tool.
    • Sent firmware details of Jetstor 3 system to VSPL.
    • gdss481 and gdss488 re-created raid array to fix failed stripes.
    • Update firmware on Jetstor systems.(ongoing) Updated on three.
    • gdss502 found bad blocks while initializing raid array.
    • R410 updated firmware on all drives.
    • Couple of motherboard replaced by Dell engineer in CV10 batch systems.
    • SL08 testing started again by James T.
    • Labelling racks and systems in UPS and HPD room.
    • gdss426 re-create raid array and test system.


Operational Issues and Incidents

Index Description Start End Severity Affected VO(s)

Summary of plans for week ahead

Scheduled and Cancelled Down Times

Type=Down/At Risk/Cancelled entries in/planned to go to GOCDB

Component Description Start End Affected VO(s) Type

Development priorities

  • All
  • Martin:
  • Ian:


  • Tim:
  • Cheney
    • DMF DR
  • James A:
  • Kash:
    • Drive replacement.
    • Fixing broken WNs.
    • Hardware failure metrics continue.
    • Continue SL08 testing.
    • Continuous decommissioning old batch systems.(R 27)
    • Continue Labelling racks and systems in UPS and HPD room.

Absences

Fabric On-Call

  • Monday - Sunday

Advanced Warning of Requirements and Blocking issues

Services Issues


RAL Tier1 weekly operations fabric

Category:RAL_Tier1