RAL Tier1 weekly Operations Grid 20130204

From GridPP Wiki
Jump to: navigation, search

Operational Issues

Description Start End Affected VO(s) Severity Status

Downtimes

Description Hosts Type Start End Affected VO(s)

Blocking Issues

Description Requested Date Required By Date Priority Status
Site firewall hole adjustment for lcggwms02 (rt #107281) 2013-01-30 2013-02-06 Medium Request not submitted to networking yet

Developments/Plans

Highlights for Tier-1 Ops Meeting

Highlights for Tier-1 VO Liaison Meeting

Detailed Individual Reports

Andrew

  • Last week:
    • Maui upgrade to 3.3.1
    • Investigated 'cmsuspilot' jobs
    • Helped NA62 & ATLAS with FTS issues
    • Storage consistency check
    • CMS UK computing meeting
    • CMS processing
    • A/L on Tues
  • Coming week:
    • Try to get CRAB to work with RAL glideinWMS & GridPP cloud
    • HLT cloud & GridPP cloud testing
    • January accounting, incl fixing 1st half January
    • APEL upgrade to EMI-2
    • CMS processing
    • A/L on Tues

Catalin

  • Last week
    • CRISTAL 2 course
    • new topBDII nodes deployment
    • work on cvmfs@RAL for mice
  • This week
    • finalise new topBDII deployment
    • work on cvmfs@RAL for hone
    • consolidate cvmfs"RAL for small VOs

Ian

  • Last week:
    • Upgraded ceph cluster
    • Updated EMI and CA repositories
    • Looked at GSTAT erros in BDII output


  • Coming week:
    • Learn to use ncm-metaconfig
    • setting up S3 interface to ceph
    • Planning aquilon kernel update workflow
    • cleaning up RAL core machien types in Quattor

James

  • Last Week
  • This Week

Orlin

  • Last Week
  • Install & Test EMI2/SL6 WNs on the gridSL6 queue [testing ongoing]
  • Clean up & optimize quattor configuration for EMI2/SL6 WNs [ongoing]
  • Bring the Testbed back in order, check the list of services [ongoing]
  • Install & Test Latest EMI2 update for SL5 WNs production WNs [testing ongoing]
  • Migrate production UMD1/SL5 to EMI2/SL6 Argus Server [done]
  • Learn more about Cloud computing - Open Nebula; EC2; OpenStack & do some tests [ongoing]


  • This Week
  • Clean up & optimize quattor configuration for EMI2/SL6 WNs [ongoing]
  • Update the Work plan for 2012 [to do]
  • Install & Test EMI2/SL6 WNs on the gridSL6 queue [testing ongoing]
  • Bring the Testbed back in order, check the list of services [ongoing]
  • Install & Test Latest EMI2 update for SL5 production WNs [testing ongoing]
  • Remove AFS from EMI2/SL6 WNs [to do]
  • Disable OR Remove AFS from EMI2/SL5 WNs [to do]
  • Learn more about Cloud computing - Open Nebula; EC2; OpenStack & do some tests [ongoing]
  • Test a possibility of EMI2/SL6 WN - preinstalled cloud image with a batch-client [to do]
  • Test and compare jobs running on cloud/hypervisor with physical hardware [to do]
  • Focus on running software from CVMFS on SL6 WNs ---> ask Ian about it [to do]
  • Learn more about Frontier services and install test lcgvo01/02 frontiers following Alastair's wikipage [ongoing]

VO Reports

ALICE

ATLAS

CMS

LHCb

OnCall/AoD Cover

OnCall Rota

  • Grid OnCall: Catalin (Mon-Tue, Thu-Sat), Andrew (Wed & Sun)

Absences

Andrew - Tues A/L

James & Catalin - Weds A/L