RAL Tier1 weekly operations Overview 20100222

From GridPP Wiki
Jump to: navigation, search

Production Managers Report

Planned interventions and other operational issues Production Team Report 2010-02-22

LHC Schedule and Experiment Issues

Changes

Approved Changes

  • D/B Server monitoring - adds enhanced monitoring to database servers
  • Update database clusterware timeout from 10 minutes to 1 minute
  • Address Oracle security vulnerability (java)

Other considered changes

  • Migration of CMS to T10KB media - separate meeting scheduled to consider plan in detail
  • Add Quattorised BDII - requested additional info in change form
  • Stop serving ATLAS pacman mirror on cmsmove02 - requested change form and more info
  • Migrate nagios slave servers - requested completed change form

Pending assessment or approval

  • ns checksum trigger on CASTOR SRM
  • Install glexec on worker nodes
  • CASTOR mega database intervention

Reviewed Changes

None

Team Reports

Fabric

RAL Tier1 weekly operations Fabric 20100222

Grid Services

http://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_Grid_20100222

CASTOR

http://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor_22/02/2010

Database

http://www.gridpp.ac.uk/wiki/Operations_Report_22/02/2010