RAL T1 weekly ops Fabric 20110627

From GridPP Wiki
Jump to: navigation, search

Developments

  • All:
    • Away Day
  • Tim:
  • James A:
  • Cheney
  • Kash:
    • Drive replacement.
    • Fixing broken WNs.
    • Decommissioning old disk servers/batch systems.
    • gdss256 read-only file system.
    • Viglen 2007 all disk servers firmware update. (ongoing)
    • Update firmware on Jetstor systems.(ongoing) Updated on three.
    • Try to send a faulty drive from SL08 batch to Areca.
    • Running 'verify fix' on SL09 disk servers with bad blocks on drives.
    • Re-create and configure raid array of 5-7 CV 05 disk servers after decommissioning. (Done)
    • Quattor02 swap drive in port 1 with R410.


  • Martin:
    • Database testing for DB team
    • Disk ITT
    • Common Ops project
  • Ian:
    • EqualLogic evaluation
    • Quattor template clean up w. James
    • Meeting and planning with Nuffield bursary student
    • Preparation for Erasmus student
    • Refactoring CVMFS config in Quattor
    • Generated June errata templates
    • Setup on Hyper-V VMM with Dave Drummond


Operational Issues and Incidents

Index Description Start End Severity Affected VO(s)

Summary of plans for week ahead

Scheduled and Cancelled Down Times

Type=Down/At Risk/Cancelled entries in/planned to go to GOCDB

Component Description Start End Affected VO(s) Type

Development priorities

  • All
  • Tim:
  • Cheney
  • James A:
  • Kash:
    • Drive replacement.
    • Fixing broken WNs.
    • Hardware failure metrics continue.
    • Continue SL08 testing.
    • Continuous decommissioning old disk servers/batch systems.(R 27)
    • Continue Labelling racks and systems in UPS and HPD room.


  • Martin:
    • Prep for HEPSysMan
    • Disk ITT
    • DB testing work as needed
  • Ian:
    • Finalise COp WP3 report
    •  ?Take delivery and start tests on new EqualLogic array
    • Hepsysman attend and give talk(s)
    • Set up local storage hypervisor(s)

Absences

Fabric On-Call

  • Ian Monday-Tuesday; Kash Wednesday - Sunday

Advanced Warning of Requirements and Blocking issues

Services Issues


RAL Tier1 weekly operations fabric

Category:RAL_Tier1