RAL Tier1 weekly operations castor 12/04/2010
From GridPP Wiki
Contents
Summary of Previous Week
- Matthew:
- Preparing for GridPP24 CASTOR session
- Install puppetmaster on new hardware
- Depmon work
- Shaun:
- More work on castormon
- Work on pre-prod
- Prep for gridPP24
- Chris:
- Tested cold stand-by central castor servers
- Disk server deployment duties
- Castor 2.1.8/2/1.9 upgrade work
- Doing work related to Tier1 Security Group project
- Richard:
- 2 days A/L
- Catch-up and preparation for running stress tests on pre-prod
- Brian:
- ..
- Jens:
- Fixed path for ATLASGROUPDISK (a configo which had not been discovered till now)
Developments for this week
- Matthew:
- GridPP24 CASTOR session
- PMB F2F
- Install puppetmaster on new hardware
- Testing quattorized disk server deployment
- COD and depmon work
- Shaun:
- gridPP24
- Testing SRM2.8-6
- castormon work
- Chris:
- Test SL5 (64bit) disk server with xfs
- Test cold stand-by central castor servers and then write documentation
- Disk server deployment duties
- Test Quattor disk server procedure and build castor disk server
- Castor 2.1.8/2/1.9 upgrade work
- Doing work related to Tier1 Security Group project
- Richard:
- Now indexes have been added to d/b tables in p/p, re-start running of stress tests on pre-prod
- 2 days at GridPP at RHUL
- Brian:
- ..
- Jens:
- GridPP storage workshop and GridPP at RHUL
Operations Issues
- fetch-crl missing from a few redeployed disk servers
- congestion during busy period on Atlas due to limited number of disk servers and slots for d2d copy. Relieved by adding 7 disk servers to atlasStripDeg
Blocking issues
None
Planned, Scheduled and Cancelled Interventions
Entries in/planned to go to GOCDB
None
Advanced Planning
- Upgrade to 2.1.8/2.1.9 2010
- Upgrade to SRM 2.8-6 after testing is complete
- ATLAS want to know how much capacity is available in disabled servers (published as Capability). Low priority CIP change to do this.
- CASTOR Instance for Non LHC 2010Q2
- Install/enable gridftp-internal on Gen (Before 2.1.8 upgrade)
Staffing
- Castor on Call person: Matthew
- Staff absences:
- Richard,Brian,Jens,Matthew variously at PMB F2F/GridPP/storage workshop (Tues-Thurs)
- Matthew away Thurs pm