Difference between revisions of "RAL Tier1 weekly operations castor 27/09/2010"

From GridPP Wiki
Jump to: navigation, search
 
(No difference)

Latest revision as of 09:55, 4 October 2010

Work previous week

  • Matthew:
    • Upgrade planning
    • gridftp-int xroot and parallel transfers testing
    • CoD duties
  • Shaun:
    • ..
  • Chris:
    • Preparation for Castor Upgrade
    • Castor Facilities work
  • Richard:
    • Ran 2.1.9 functional test suite on newly upgraded 2.1.9 LHCB instance
  • Brian:
    • ..
  • Jens:
    • Work on double checking post-upgrade dynamic information data
    • Discussions on tape publishing

Operations Issues

  • Bad job efficiency for LHCb continues. We will increase memory on one SRM next week and plan to add a new SRM node. We also hope that gridftp-int over xroot will improve matters in 2.1.9.
  • LSF daemons stopped on gdss81 (atlasStripInput). Heavy load killed LSF again after restarting. Decreased job slots from 50 to 15 helped.

PreProd


  • ..

Blocking issues

  • Any ongoing production problems at present will jepordize the timeline for starting 2.1.9 upgrades at the end of this month.

Planned, Scheduled and Cancelled Interventions

Entries in/planned to go to GOCDB

Description Start End Type Affected VO(s)
Update LHCb to 2.1.9 27/09/2010 08:00 29/09/2010 18:00 Downtime LHCb
Update Gen to 2.1.9 (STC) 25/10/2010 08:00 27/10/2010 18:00 Downtime Gen
Update CMS to 2.1.9 (STC) 08/11/2010 08:00 10/11/2010 18:00 Downtime CMS
Update ATLAS to 2.1.9 (STC) 22/11/2010 08:00 24/11/2010 18:00 Downtime ATLAS

Advanced Planning

  • Upgrade to 2.1.9 2010

Staffing

  • Castor on Call person: Shaun
  • Staff absences:
    • ..