RAL T1 weekly ops Fabric 20111010

From GridPP Wiki
Jump to: navigation, search

Developments

  • All:
  • Tim:
  • James A:
  • Cheney
    • install of acsls
    • upgrade solaris to accommodate acsls
    • recarve disks to accommodate solaris upgrade
    • documentation of the above (now on twiki)
    • fix stuck requests to solarb
    • fix stuck database backups
    • fix tsbn stats and metrics
    • sanity check tsbn stats
    • attended chris's castor load workshop
    • improve website for backups reporting
    • tweak iptables firewalls
    • fix c2probe for atlas
    • regenerate webalizer stats for solarb
    • fix solarb sticky bit bug on /tmp


  • Kash:

Annual Leave.

  • Martin:
  • Ian:


Operational Issues and Incidents

Index Description Start End Severity Affected VO(s)

Summary of plans for week ahead

Scheduled and Cancelled Down Times

Type=Down/At Risk/Cancelled entries in/planned to go to GOCDB

Component Description Start End Affected VO(s) Type

Development priorities

  • All
  • Tim:
  • Cheney
    • have a quattor day - errata / logrotates / amanda /
    • fix network-connection-count script
  • James A:
  • Kash:
    • Drive replacement.
    • Fixing broken WNs.
    • Hardware failure review and metrics continue.
    • Continuous decommissioning old disk servers/batch systems.(R 27)
    • Continue Labelling racks and systems in UPS and HPD room.


  • Martin:
  • Ian:

Absences

    • cheney out on friday

Fabric On-Call

Advanced Warning of Requirements and Blocking issues

Services Issues


RAL Tier1 weekly operations fabric

Category:RAL_Tier1