RAL Tier1 weekly operations Grid 20110321
From GridPP Wiki
Operational Issues
Description
|
Start
|
End
|
Affected VO(s)
|
Severity
|
Status
|
Downtimes
Description
|
Hosts
|
Type
|
Start
|
End
|
Affected VO(s)
|
Blocking Issues
Description
|
Requested Date
|
Required By Date
|
Priority
|
Status
|
|
|
|
|
|
Developments/Plans
Highlights for Tier-1 Ops Meeting
Highlights for Tier-1 VO Liaison Meeting
Detailed Individual Reports
Alastair
- Working on ATLAS permission change. [On hold]
- Setting up xrootd for ATLAS at RAL.
- Talking to ALICE
- Looking into upgrading castor client on all WN.
- Disk pool merging and DB change.
- Cleaning up dark data [Ongoing]
- Writing change control [Done]
- Moving files! [Done!]
- Preparing for Beauty 2011 conference.
- Requested new VO box for ATLAS Frontier.
Andrew
- Investigating PhEDEx deletion problems for failed debug transfers [Done]
- Upgraded PhEDEx prod & debug instances to 4_0_0 [Done]
- Capacity planning system tidying, merging with UB Schedule [Ongoing]
- Testing, writing change-control for FTS destination site name change [Done]
- CMS Data Ops
- MC rereco at FNAL (WMS issues, problems caused by CERN Oracle outage) [Ongoing]
Catalin
- involved with CREAM CEs installation and configuration [ongoing]
- work on quattorised ATLAS Frontier installation [ongoing]
- investigate another problem/crash on lcgwms03 [done]
Derek
- Investigating BLParser isssues on lcgce09 [ongoing]
- Publishing whole node queue [done]
- Syncing with QWG templates [ongoing]
- Improving config of small vos in quattor [ongoing]
- Castor client update rollout [ongoing]
Matt
- Deploy testbed LFC and MyProxy. [New]
- Management of FTS groups. [New]
- Prep for training course (Mon-Wed next week). [New]
- Testing Hadoop instance. [Ongoing]
- Contact NFS users. [Ongoing]
Richard
- Dealing with fall-out from moving a top BDII into the UPS room. [Ongoing]
- Building an ARGUS server using the new QWG templates [Ongoing]
- Documenting various items preparatory to handover [Ongoing]
- CASTOR items:
- Running some stress tests on preprod instance. [Ongoing]
VO Reports
ALICE
ATLAS
CMS
LHCb
OnCall/AoD Cover
OnCall Rota
- Primary OnCall:
- Grid OnCall: Derek
- AoD: