RAL Tier1 weekly operations Grid 20110822
From GridPP Wiki
Revision as of 11:01, 22 August 2011 by Andrew lahiff (Talk | contribs)
Contents
Operational Issues
Description | Start | End | Affected VO(s) | Severity | Status |
---|---|---|---|---|---|
Downtimes
Description | Hosts | Type | Start | End | Affected VO(s) |
---|
Blocking Issues
Description | Requested Date | Required By Date | Priority | Status |
---|---|---|---|---|
Developments/Plans
Highlights for Tier-1 Ops Meeting
Highlights for Tier-1 VO Liaison Meeting
Detailed Individual Reports
Alastair
- Working on permission change. [Ongoing]
- Looking at Hammer Cloud test results across UK Cloud.
- Frontier, testing new API, monitoring packagaes and helping deploy new box.
Andrew
- Diskserver deployment for CMS, ATLAS, Gen [Done]
- CMS ACL change; empty directory cleanup [Done]
- VOBOX upgrades to glite update 28; UI upgrade to glite update 29 [Done]
- Updated scripts due to Overwatch DB change [Done]
- Added VM APEL, UI, VOBOXs to testbed [Done]
- Setup new CMS tape families [Done]
- Prepare for capacity signoff meeting [To do]
Catalin
- glite-LB updates [done]
- work on VMs and HyperV [ongoing]
- LHCb VOBOX updates [done]
VO Reports
ALICE
ATLAS
CMS
- Software server was temporarily overloaded on 20th August causing some JobRobot and production jobs to fail. Caused by a period of higher than normal job start rate.
LHCb
OnCall/AoD Cover
- Primary OnCall: Catalin (Mon - Fri)
- Grid OnCall: Andrew (Sat - Sun)