RAL Tier1 weekly operations Grid 20110822
From GridPP Wiki
Operational Issues
Description
|
Start
|
End
|
Affected VO(s)
|
Severity
|
Status
|
|
|
|
|
|
|
Downtimes
Description
|
Hosts
|
Type
|
Start
|
End
|
Affected VO(s)
|
Blocking Issues
Description
|
Requested Date
|
Required By Date
|
Priority
|
Status
|
|
|
|
|
|
Developments/Plans
Highlights for Tier-1 Ops Meeting
Highlights for Tier-1 VO Liaison Meeting
Detailed Individual Reports
Alastair
- Working on permission change. [Ongoing]
- Looking at Hammer Cloud test results across UK Cloud.
- Frontier, testing new API, monitoring packagaes and helping deploy new box.
Andrew
- Diskserver deployment for CMS, ATLAS, Gen [Done]
- CMS ACL change; empty directory cleanup [Done]
- VOBOX upgrades to glite update 28; UI upgrade to glite update 29 [Done]
- Updated scripts due to Overwatch DB change [Done]
- Added VM APEL, UI, VOBOXs to testbed [Done]
- Setup new CMS tape families [Done]
- Prepare for capacity signoff meeting [To do]
Catalin
- glite-LB updates [done]
- work on VMs and HyperV [ongoing]
- LHCb VOBOX updates [done]
VO Reports
ALICE
ATLAS
CMS
- Software server was temporarily overloaded on 20th August causing some JobRobot and production jobs to fail. Caused by a period of higher than normal job start rate.
LHCb
OnCall/AoD Cover
OnCall Rota
- Primary OnCall: Catalin (Mon - Fri)
- Grid OnCall: Andrew (Sat - Sun)