RAL Tier1 weekly operations Fabric 20110404
From GridPP Wiki
Contents
Developments
- All:
- Martin:
- Ian:
- Tim:
- James A:
- Cheney
- DMF DR testing (failed)
- set up backups for isis
- Kash:
- Drive replacement.
- Fixing broken WNs.
- Decommissioning old batch systems.(R 27)
- Test room review. (Now monthly)
- gdss496 need to install smartd tool.
- Sent firmware details of Jetstor 3 system to VSPL.
- gdss481 and gdss488 re-created raid array to fix failed stripes.
- Update firmware on Jetstor systems.(ongoing) Updated on three.
- gdss502 found bad blocks while initializing raid array.
- R410 updated firmware on all drives.
- Couple of motherboard replaced by Dell engineer in CV10 batch systems.
- SL08 testing started again by James T.
- Labelling racks and systems in UPS and HPD room.
- gdss426 re-create raid array and test system.
Operational Issues and Incidents
Index | Description | Start | End | Severity | Affected VO(s) |
---|
Summary of plans for week ahead
Scheduled and Cancelled Down Times
Type=Down/At Risk/Cancelled entries in/planned to go to GOCDB
Component | Description | Start | End | Affected VO(s) | Type |
---|
Development priorities
- All
- Martin:
- Ian:
- Tim:
- Cheney
- DMF DR
- James A:
- Kash:
- Drive replacement.
- Fixing broken WNs.
- Hardware failure metrics continue.
- Continue SL08 testing.
- Continuous decommissioning old batch systems.(R 27)
- Continue Labelling racks and systems in UPS and HPD room.
Absences
Fabric On-Call
- Monday - Sunday