RAL T1 weekly ops Fabric 20111024
From GridPP Wiki
Contents
Developments
- All:
- Tim:
- James A:
- Cheney
- Kash:
- Drive replacement.
- Fixing broken WNs.
- Decommissioning old disk servers/batch systems. (Viglen 2006 started)
- gdss296 started acceptance testing.
- Update wiki page about how to report fault to Vendors.
- gdss353 given back to Castor team.
- gdss456 re-create raid array.
- lcgcts13 stuck while initializing iDRAC controller. Dell suggest update firmware.
- Appointment with Physio.
- gdss396 started 7 days acceptance testing.
- gdss295 started 7 days acceptance testing.
- Create change control for updating firmware on Adaptec controllers on Viglen 2009 disk servers.
- gdss487 given back to Castor team.
- Martin:
- Ian:
Operational Issues and Incidents
Index | Description | Start | End | Severity | Affected VO(s) |
---|
Summary of plans for week ahead
Scheduled and Cancelled Down Times
Type=Down/At Risk/Cancelled entries in/planned to go to GOCDB
Component | Description | Start | End | Affected VO(s) | Type |
---|
Development priorities
- All
- Tim:
- Cheney
- James A:
- Kash:
- Drive replacement.
- Fixing broken WNs.
- Hardware failure review and metrics continue.
- Continuous decommissioning old disk servers/batch systems.(R 27)
- Continue Labelling racks and systems in UPS and HPD room.
- Martin:
- Ian:
Absences