RAL Tier1 weekly Operations Grid 20130128

From GridPP Wiki
Jump to: navigation, search

Operational Issues

Description Start End Affected VO(s) Severity Status

Downtimes

Description Hosts Type Start End Affected VO(s)

Blocking Issues

Description Requested Date Required By Date Priority Status

Developments/Plans

Highlights for Tier-1 Ops Meeting

Highlights for Tier-1 VO Liaison Meeting

Detailed Individual Reports

Andrew

  • Last week:
    • Wrote FTS3 change control document
    • Carried out Maui 3.3.1 testing; wrote change control document
    • Built Torque 4.1.4; tried job query & job submission scale tests
    • CMS HLT farm testing
    • CMS processing
    • A/L on Tues, Fri
  • Coming week:
    • Deploy www-ftsmon
    • Upgrade PhEDEx
    • APEL UMD-2 upgrade testing, change control
    • CMS HLT farm testing
    • CMS processing
    • A/L on Tues
    • CMS UK computing meeting Thurs

Catalin

  • Last week
    • work on cvmfs for small VOs
    • SL6 EMI-2 topBDII installation and testing
  • This week
    • CRISTAL 2

Ian

  • Last week:
    • First HEPiX planning meeting
    • Network planning
    • Look at new SL59 Beta 1
    • Investigating LHCB job failures.


  • Coming week:
    • Planning HEPiX talks
    • Investigating upgrading & expanding CEPH

James

  • Last Week
  • This Week

Orlin

  • Install & Test EMI2/SL6 WNs on the gridSL6 queue [testing ongoing]
  • Install & Configure lcgce12 as separate cluster/queue, dedicated CREAMCE for the gridSL6 queue [done]
  • Bring the Testbed back in order, check the list of services [ongoing]
  • Create user accounts for NEISS VO [done]
  • Migrate production UMD1/SL5 to EMI2/SL6 Argus Server [to do]
  • Learn more about Cloud computing - Open Nebula; EC2; OpenStack & do some tests [ongoing]
  • Install & Test Cloud Apache server serving munge keys to WNs [done]
  • Test a possibility of EMI2/SL6 WN - preinstalled cloud image with a batch-client [to do]
  • Test and compare jobs running on cloud/hypervisor with physical hardware [to do]
  • Focus on running software from CVMFS on SL6 WNs ---> ask Ian about it [to do]
  • Learn more about Frontier services and install test lcgvo01/02 frontiers following Alastair's wikipage [ongoing]

VO Reports

ALICE

ATLAS

CMS

LHCb

OnCall/AoD Cover

OnCall Rota

  • Grid OnCall: Andrew (Mon-Sun)

Absences

Catalin Mon-Wed (CRISTAL 2) Andrew Tues (Annual Leave)