RAL Tier1 weekly operations Grid 20110321

From GridPP Wiki
Jump to: navigation, search

Operational Issues

Description Start End Affected VO(s) Severity Status

Downtimes

Description Hosts Type Start End Affected VO(s)

Blocking Issues

Description Requested Date Required By Date Priority Status

Developments/Plans

Highlights for Tier-1 Ops Meeting

Highlights for Tier-1 VO Liaison Meeting

Detailed Individual Reports

Alastair

  • Working on ATLAS permission change. [On hold]
  • Setting up xrootd for ATLAS at RAL.
    • Talking to ALICE
    • Looking into upgrading castor client on all WN.
  • Disk pool merging and DB change.
    • Cleaning up dark data [Ongoing]
    • Writing change control [Done]
    • Moving files! [Done!]
  • Preparing for Beauty 2011 conference.
  • Requested new VO box for ATLAS Frontier.

Andrew

  • Investigating PhEDEx deletion problems for failed debug transfers [Done]
  • Upgraded PhEDEx prod & debug instances to 4_0_0 [Done]
  • Capacity planning system tidying, merging with UB Schedule [Ongoing]
  • Testing, writing change-control for FTS destination site name change [Done]
  • CMS Data Ops
    • MC rereco at FNAL (WMS issues, problems caused by CERN Oracle outage) [Ongoing]

Catalin

  • involved with CREAM CEs installation and configuration [ongoing]
  • work on quattorised ATLAS Frontier installation [ongoing]
  • investigate another problem/crash on lcgwms03 [done]

Derek

  • Investigating BLParser isssues on lcgce09 [ongoing]
  • Publishing whole node queue [done]
  • Syncing with QWG templates [ongoing]
  • Improving config of small vos in quattor [ongoing]
  • Castor client update rollout [ongoing]

Matt

  • Deploy testbed LFC and MyProxy. [New]
  • Management of FTS groups. [New]
  • Prep for training course (Mon-Wed next week). [New]
  • Testing Hadoop instance. [Ongoing]
  • Contact NFS users. [Ongoing]

Richard

  • Dealing with fall-out from moving a top BDII into the UPS room. [Ongoing]
  • Building an ARGUS server using the new QWG templates [Ongoing]
  • Documenting various items preparatory to handover [Ongoing]
  • CASTOR items:
    • Running some stress tests on preprod instance. [Ongoing]

VO Reports

ALICE

ATLAS

CMS

LHCb

OnCall/AoD Cover

OnCall Rota

  • Primary OnCall:
  • Grid OnCall: Derek
  • AoD: