RAL Tier1 weekly operations castor 12/04/2010

From GridPP Wiki
Jump to: navigation, search

Summary of Previous Week

  • Matthew:
    • Preparing for GridPP24 CASTOR session
    • Install puppetmaster on new hardware
    • Depmon work
  • Shaun:
    • More work on castormon
    • Work on pre-prod
    • Prep for gridPP24
  • Chris:
    • Tested cold stand-by central castor servers
    • Disk server deployment duties
    • Castor 2.1.8/2/1.9 upgrade work
    • Doing work related to Tier1 Security Group project
  • Richard:
    • 2 days A/L
    • Catch-up and preparation for running stress tests on pre-prod
  • Brian:
    • ..
  • Jens:
    • Fixed path for ATLASGROUPDISK (a configo which had not been discovered till now)

Developments for this week

  • Matthew:
    • GridPP24 CASTOR session
    • PMB F2F
    • Install puppetmaster on new hardware
    • Testing quattorized disk server deployment
    • COD and depmon work
  • Shaun:
    • gridPP24
    • Testing SRM2.8-6
    • castormon work
  • Chris:
    • Test SL5 (64bit) disk server with xfs
    • Test cold stand-by central castor servers and then write documentation
    • Disk server deployment duties
    • Test Quattor disk server procedure and build castor disk server
    • Castor 2.1.8/2/1.9 upgrade work
    • Doing work related to Tier1 Security Group project
  • Richard:
    • Now indexes have been added to d/b tables in p/p, re-start running of stress tests on pre-prod
    • 2 days at GridPP at RHUL
  • Brian:
    • ..
  • Jens:
    • GridPP storage workshop and GridPP at RHUL

Operations Issues

  • fetch-crl missing from a few redeployed disk servers
  • congestion during busy period on Atlas due to limited number of disk servers and slots for d2d copy. Relieved by adding 7 disk servers to atlasStripDeg

Blocking issues

None

Planned, Scheduled and Cancelled Interventions

Entries in/planned to go to GOCDB

None

Advanced Planning

  • Upgrade to 2.1.8/2.1.9 2010
  • Upgrade to SRM 2.8-6 after testing is complete
  • ATLAS want to know how much capacity is available in disabled servers (published as Capability). Low priority CIP change to do this.
  • CASTOR Instance for Non LHC 2010Q2
  • Install/enable gridftp-internal on Gen (Before 2.1.8 upgrade)

Staffing

  • Castor on Call person: Matthew
  • Staff absences:
    • Richard,Brian,Jens,Matthew variously at PMB F2F/GridPP/storage workshop (Tues-Thurs)
    • Matthew away Thurs pm