RAL Tier1 weekly operations castor 28/4/2017

From GridPP Wiki
Jump to: navigation, search

Draft agenda

1. Problems encountered this week

2. Upgrades/improvements made this week

3. What are we planning to do next week?

4. Long-term project updates (if not already covered)

  1. SL7 upgrade on tape servers
  2. SRM upgrade to SL6/CASTOR 2.1.16
  3. SL5 elimination from CASTOR functional test boxes and tape verification server

5. Special topics

6. Actions

7. Anything for CASTOR-Fabric?

8. AoTechnicalB

9. Availability for next week

10. On-Call

11. AoOtherB

Operation problems

Bug in the 2.1.16 CASTOR SRM: srmbed is picking up requests from srmfed with delay that ranges from 1-10 sec

SAM tests fail; may have something to do with the large number of locking sessions on the CMS stager DB

Operation news

Plans for next week

RA to work on testing CASTOR 2.16-13 on preprod

Miguel to time the DB upgrade script

Long-term projects

CIP migration to aquilon and upgrade to SL6

SL6 upgrade on functional test boxes and tape verification server

Tape-server migration to aquilon and SL7 upgrade (on hold at the moment)

Actions

DB hardware upgrade tracking

Drain and decomission/recomission the 12 generation disk servers

Staffing

GP on call next week

CP away