RAL Tier1 weekly operations castor 27/09/2010
From GridPP Wiki
Revision as of 09:55, 4 October 2010 by Chris kruk (Talk | contribs)
Contents
Work previous week
- Matthew:
- Upgrade planning
- gridftp-int xroot and parallel transfers testing
- CoD duties
- Shaun:
- ..
- Chris:
- Preparation for Castor Upgrade
- Castor Facilities work
- Richard:
- Ran 2.1.9 functional test suite on newly upgraded 2.1.9 LHCB instance
- Brian:
- ..
- Jens:
- Work on double checking post-upgrade dynamic information data
- Discussions on tape publishing
Operations Issues
- Bad job efficiency for LHCb continues. We will increase memory on one SRM next week and plan to add a new SRM node. We also hope that gridftp-int over xroot will improve matters in 2.1.9.
- LSF daemons stopped on gdss81 (atlasStripInput). Heavy load killed LSF again after restarting. Decreased job slots from 50 to 15 helped.
PreProd
- ..
Blocking issues
- Any ongoing production problems at present will jepordize the timeline for starting 2.1.9 upgrades at the end of this month.
Planned, Scheduled and Cancelled Interventions
Entries in/planned to go to GOCDB
Description | Start | End | Type | Affected VO(s) |
---|---|---|---|---|
Update LHCb to 2.1.9 | 27/09/2010 08:00 | 29/09/2010 18:00 | Downtime | LHCb |
Update Gen to 2.1.9 (STC) | 25/10/2010 08:00 | 27/10/2010 18:00 | Downtime | Gen |
Update CMS to 2.1.9 (STC) | 08/11/2010 08:00 | 10/11/2010 18:00 | Downtime | CMS |
Update ATLAS to 2.1.9 (STC) | 22/11/2010 08:00 | 24/11/2010 18:00 | Downtime | ATLAS |
Advanced Planning
- Upgrade to 2.1.9 2010
Staffing
- Castor on Call person: Shaun
- Staff absences:
- ..