RAL Tier1 weekly operations Grid 20101011

From GridPP Wiki
Revision as of 14:39, 11 October 2010 by Derek ross (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Operational Issues

Description Start End Affected VO(s) Severity Status
lcgce01 All WLCG vos have reduced resiliency Replacement host lcgce09 in testing, will be moving into production shortly

Downtimes

Description Hosts Type Start End Affected VO(s)

Blocking Issues

Description Requested Date Required By Date Priority Status

Developments/Plans

Highlights for Tier-1 Ops Meeting

Highlights for Tier-1 VO Liaison Meeting

Detailed Individual Reports

Alastair

  • Working on ATLAS software server, testing CVMFS
    • 825 test jobs have been run.
    • lcg0805 has been setup for production style testing, need to add queue into ATLAS system.
    • Production tasks submitted.
  • Writing script to graph transfer times for FTS transfers
  • Working on Hammer cloud test of castor 2.1.9
    • Analysis queue setup
    • Need to copy DBrelease into pre-prod and replicate
  • Deploying gdss5xx series to atlasStripInput
  • Written checksumming script for diskservers

Andrew

  • Testing glite-APEL configuration/joining/publishing; writing change-control document [Ongoing]
  • Capacity planning project [Ongoing]
  • September accounting [Done]
  • Writing squid per-site monitoring scripts [Ongoing]
  • CMS data ops
    • Running data rereco at ASGC [Ongoing]

Catalin

  • work on glite-LB quattor profile [ongoing]
  • investigate (x)ROOT(d) [ongoing]
  • prepare deployment of production glite 3.2 LB
  • migrate remaining databases [done]

Derek

  • CREAM CE quattor profile [ongoing]
  • Investigating CREAM CE instability [ongoing]
  • Sync'd quattor templates to QWG
  • Deployed lcgce09 to prepare to replace lcgce01

Matt

  • Further testing of Quattorised gLite3.2 FTS FEs. [Ongoing]
  • Quattorisation of MyProxy nodes (write up Change Control). [Ongoing]
  • Test FTS SRM/GridFTP ratio configuration.
  • Prep for Tier-1 Resources meeting. [New]
  • Nagios logfile monitoring development. [New]

Richard

  • Prepping for kernel security patch on RAL site-level BDIIs
  • Developing a Quattor template for an ARGUS server
  • Working on the "team status page" being developed as an action from team awayday [ongoing]
  • Reviewing G/S process documentation [ongoing]
  • CASTOR items:


VO Reports

ALICE

ATLAS

CMS

LHCb

OnCall/AoD Cover

OnCall Rota

  • Primary OnCall: Catalin (Mon - Sun)
  • Grid OnCall: