RAL Tier1 weekly operations Grid 20110328

From GridPP Wiki
Jump to: navigation, search

Operational Issues

Description Start End Affected VO(s) Severity Status

Downtimes

Description Hosts Type Start End Affected VO(s)

Blocking Issues

Description Requested Date Required By Date Priority Status

Developments/Plans

Highlights for Tier-1 Ops Meeting

Highlights for Tier-1 VO Liaison Meeting

Detailed Individual Reports

Alastair

  • Working on ATLAS permission change. [On hold]
  • Setting up xrootd for ATLAS at RAL.
    • Talking to ALICE
    • Looking into upgrading castor client on all WN.
  • Disk pool merging and DB change.
    • Cleaning up dark data [Ongoing]
    • Writing change control [Done]
    • Moving files! [Done!]
  • Preparing for Beauty 2011 conference.
  • Requested new VO box for ATLAS Frontier.

Andrew

  • Deployed 18 diskservers to cmsNonProd [Done]
  • CMS VOBOX upgraded to glite version 21 [Done]
  • Capacity Planning System (added Q2 allocations; 2010 hardware updates; plot transparency issue fixed; UB schedule merging)
  • CMS Data Ops
    • MC rereco at FNAL, IN2P3 [Ongoing]

Catalin

  • involved with CREAM CEs installation and configuration [ongoing]
  • work on quattorised ATLAS Frontier installation [ongoing]
  • GridPP 26 - Brighton (Tue - Wed)

Derek

  • Investigating BLParser isssues on lcgce09 [done]
  • Syncing with QWG templates [ongoing]
  • Updating templates for change to QWG VO configuration [done]
  • Improving config of small vos in quattor [ongoing]
  • Castor client update rollout [done]
  • Deploying test quattorised WMS [ongoing]
  • CE Documentation [done]
  • GridPP Mon-Wed
  • Proposed decommission of lcgce02 on 4th Apr to H1

Matt

  • Arrange Grid Team Future meeting. [Done]
  • Deploy testbed LFC and MyProxy. [Ongoing]
  • Management of FTS groups. [Ongoing]
  • Prep for training course (Mon-Wed next week). [Done]
  • Testing Hadoop instance. [Ongoing]
  • Contact NFS users. [Ongoing]

Richard

  • New top BDII Quattor templates built and tested. Now plan to re-instate the two "missing" servers and then update the others. [Ongoing]
  • CASTOR items:
    • Writing up some documentation on stress testing and giving demos to Rich S, Matthew & Chris. [Ongoing]

VO Reports

ALICE

ATLAS

CMS

LHCb

OnCall/AoD Cover

OnCall Rota

  • Primary OnCall:
  • Grid OnCall: Matt/Derek
  • AoD: