RAL Tier1 weekly operations castor 09/1/2017

From GridPP Wiki
Jump to: navigation, search

Draft agenda

1. Problems encountered this week

2. Upgrades/improvements made this week

3. What are we planning to do next week?

4. Long-term project updates (if not already covered)

 1. Castor 2.1.15
 2. SL7 upgrade on tape servers

5. Special topics

6. Actions

7. Anything for CASTOR-Fabric?

8. AoTechnicalB

9. Availability for next week

10. On-Call

11. AoOtherB

Operation problems

There was a lag in file transfer from to StorageD to CASTOR

SAM tests failed over Christmas and today

gdss655 RT184514 and gdss780 RT184616 failed and removed from production

Operation news

The firmware upgrade on all V13 disk servers was completed RT182223

All production SRMs have been patched RT182223

Long-term projects

All componnents of Castor 2.1.15 are working. The upgrade of the nameserver is scheduled for 10/01/2017

First draft of castor tapeserver features completed and published for review.

Actions

Drain 10% of the 13 generation of disk servers (lhcbDst) for decommissioning RT181930

Create a RT ticket for a Nagios test of "zombie" transfer manager processes

Arrange a meeting with James to discuss the tapeserver features published in tapeserver-migration sandbox

Staffing

RA on call next week