RAL Tier1 weekly operations Fabric 20100913

From GridPP Wiki
Jump to: navigation, search

Developments

  • All:
  • Martin:
  • Ian:
  • Tim:
    • Ongoing repack tickling
    • DMF hangs
    • Monthly stats
    • Non-lhc tape library usage
    • Castor "stuff"
  • Jonathan:
  • James A:
    • Benchmarking nodes for tender evaluation.
    • Two days of SINDES work.
    • Continued moving repos away from touch.
    • Moved five servers to the UPS room for Grid Services.
    • Tidied test area.
    • Updated quattor worker node scripting to handle new cluster layout.
    • Added power cabling to Castor Rack G.
  • James T
    • CASTOR 2.1.9 upgrade test
    • SL09 re-cabling for vendor testing
    • Disk server fixing in Kash's absence
    • OPB planning with James A. and Duncan
    • AoD on Friday
    • New loggers
  • Cheney
    • busily quatting the facilities castor
    • replace drives on rhubarb
    • rename buxton to kiki for patching


  • Kash:

Away all week. (A/L)

Absences

  • Jonathan on partial retirement (not in on Monday and Friday)
  • Cheney dentist monday morning and leave tuesday.

Operational Issues and Incidents

Index Description Start End Severity Affected VO(s)

Summary of plans for week ahead

Scheduled and Cancelled Down Times

Type=Down/At Risk/Cancelled entries in/planned to go to GOCDB

Component Description Start End Affected VO(s) Type

Development priorities

  • All
  • Martin:
  • Ian:
  • Tim:
  • Cheney
    • quatt the facilities castor
  • Jonathan:
  • James T:
    • CASTOR 2.1.9 upgrade testing
    • Preparation for Atlas power outage
      • New loggers
      • Replacements for gdss51, csfnfs5{5,8}
    • Streamline 2009 testing
    • A/L Friday PM
  • James A:
    • Two days of SINDES work.
    • Finishing benchmarking nodes for tender.
    • Continuing moving repos away from touch.
  • Kash:
    • Drive replacement.
    • Fixing broken WNs.
    • Start acceptance test on gdss380.
    • Replaced raid card in gdss477 as borrowed for gdss473.
    • Catch up with James Thorne.
    • Liase with Streamline in James T absense for SL09 disk servers testing.
    • gdss470 and gdss475 chase vendor about logs/fault.
    • Update daily status of Streamline 2009 disk servers testing.
    • Continuous decommissioning old batch systems.(R 27)

Absences

  • Jonathan on partial retirement (not in on Monday and Friday)
  • James T A/L Friday PM

Fabric On-Call

  • Kashif Hafeez

Advanced Warning of Requirements and Blocking issues

Services Issues


RAL Tier1 weekly operations fabric

Category:RAL_Tier1