RAL Tier1 weekly operations castor 23/07/2012

From GridPP Wiki
Jump to: navigation, search

Operations News

  • Preprod database upgraded to 2.1.12-4 and testing started
  • Castormon now with improved logs and support for multiple transfermanagers
  • No draining problems found in testing June errata - we haven't been able to replicate the problems we saw in production
  • files reported as vanished by LHCb - they were actually legitimately deleted by a LHCb end user

Operations Problems

  • CMS have been maxing out their TM slots - causing delays on reads. We have doubled the slots and increased the timeout threefold.

Blocking Issues

none

Planned, Scheduled and Cancelled Interventions

Entries in/planned to go to GOCDB none

Advanced Planning

Tasks

  • Test and certify 2.1.12-4 (Matthew, Chris)

Interventions

  • Upgrade repack to 2.1.12-4 (Jul)
  • Upgrade to 2.1.12 on Tier1 instances once we are happy with TM and TG in performance (Sep)

Staffing

  • Castor on Call person
    • Matthew (Mon-Thu)
    • Chris (Fri-Sun)
  • Staff absence/out of the office:
    • Matthew A/L (Wed) and in DL (Fri)