RAL Tier1 weekly operations Overview 20100802

From GridPP Wiki
Jump to: navigation, search

Production Managers Report

Planned interventions and other operational issues Production Team Report 2010-08-02

LHC Schedule and Experiment Issues

https://wiki.e-science.cclrc.ac.uk/web1/bin/view/EScienceInternal/LhcSchedule

Changes

Approved Changes

Other considered changes

  • CMS move to T10K drives and media (review progress at end of August)
  • Unmount MINOS and SNO filesystems on csfnfs58 (Gareth and Jonathan to discuss)
  • CASTOR 2.1.9 upgrade (meeting to be scheduled for w/b August 16)
  • amanda backup rpm (to be discussed at future CASTOR/Fabric meeting)
  • Update WN to glite update 10 (to be assessed by Martin)
  • Increase walltime on grid2000M queue to 140 hours (revised change needed?)
  • Setting up CMS t1production role (waiting for confirmation of tests by CMS)

Pending scheduling or pending implementation

  • Close SL4 Batch farm (host ACLs changed)

Pending Review

  • Configure all LHCb disk servers to use WAN tcp tuning (associated GGUS ticket open)

Reviewed Changes

  • Restrict access to FTS/LFC production databases to trusted hosts
  • cutover new robot controller
  • Update RAL top-level BDII servers to required baseline version

Team Reports

Fabric

RAL Tier1 weekly operations Fabric 20100726

Grid Services

http://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_Grid_20100802

CASTOR

http://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor_02/08/2010

Database

http://www.gridpp.ac.uk/wiki/Operations_Report_02/08/2010