RAL Tier1 weekly Operations Grid 20130211

From GridPP Wiki
Jump to: navigation, search

Operational Issues

Description Start End Affected VO(s) Severity Status

Downtimes

Description Hosts Type Start End Affected VO(s)

Blocking Issues

Description Requested Date Required By Date Priority Status
Site firewall hole adjustment for lcggwms02 (rt #107281) 2013-01-30 2013-02-06 Medium Request not submitted to networking yet

Developments/Plans

Highlights for Tier-1 Ops Meeting

Highlights for Tier-1 VO Liaison Meeting

Detailed Individual Reports

Andrew

  • Last week:
    • APEL upgrade testing + change control [done]
    • January accounting [done]
    • GridPP cloud (getting CRAB to submit to Condor)
    • CMS HLT farm testing [ongoing]
    • Setup SLURM test batch system [ongoing]
    • CMS processing
    • A/L on Tues
  • Coming week:
    • Preparations for batch system meeting
    • Preparations for upcoming capacity signoff meeting
    • Testing SLURM
    • GridPP cloud testing
    • CMS processing
    • A/L on Tues

Catalin

  • Last week
  • This week

Ian

  • Last week:
    • Kernel & errata updates on cvmfs replica
    • Discussion about departmental Public Engagement
    • Clean up of Tier1 core machine types in Quattor
    • Began looking at ceph config & S3
  • Coming week:
    • Further work on adding S3 support to ceph cluster
    • More errata
    • Experiment with errata kernel update workflow in Aquilon

James

  • Last Week
  • This Week

Orlin

  • Last Week
    • Install & Test EMI2/SL6 WNs on the gridSL6 queue [testing ongoing]
    • Clean up & optimize quattor configuration for EMI2/SL6 WNs [ongoing]
    • Bring the Testbed back in order, check the list of services [ongoing]
    • Install & Test Latest EMI2 update for SL5 WNs production WNs [testing ongoing]
    • Learn more about Cloud computing - Open Nebula; EC2; OpenStack & do some tests [ongoing]
    • Update the annual Job plan [ongoing]
    • Remove AFS from EMI2/SL6 WNs [done]
  • This Week
    • Move the gridWN and gridTest queues to EMI2/SL6 CREAMCE lcgce07 [to do]
    • Update the annual Job plan [ongoing]
    • Clean up & optimize quattor configuration for EMI2/SL6 WNs [ongoing]
    • Install & Test EMI2/SL6 WNs on the gridSL6 queue [testing ongoing]
    • Bring the Testbed back in order, check the list of services [ongoing]
    • Install & Test Latest EMI2 update for SL5 production WNs [testing ongoing]
    • Disable OR Remove AFS from EMI2/SL5 WNs [to do]
    • Learn more about Cloud computing - Open Nebula; EC2; OpenStack & do some tests [ongoing]

VO Reports

ALICE

ATLAS

CMS

LHCb

OnCall/AoD Cover

OnCall Rota

  • Grid OnCall: Andrew (Mon- Sun)

Absences

Andrew - Tues A/L

Catalin - All week A/L

James - Monday

Ian - Weds & Thurs PM A/L