Difference between revisions of "RAL Tier1 weekly operations castor 09/1/2017"
(Created page with "== Draft agenda == 1. Problems encountered this week 2. Upgrades/improvements made this week 3. What are we planning to do next week? 4. Long-term project updates (if not ...") |
(→Long-term projects) |
||
(4 intermediate revisions by one user not shown) | |||
Line 27: | Line 27: | ||
== Operation problems == | == Operation problems == | ||
+ | |||
+ | There was a lag in file transfer from to StorageD to CASTOR | ||
+ | |||
+ | SAM tests failed over Christmas and today | ||
+ | |||
+ | gdss655 [https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=184514 RT184514] and gdss780 [https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=184616 RT184616] failed and removed from production | ||
== Operation news == | == Operation news == | ||
+ | |||
+ | The firmware upgrade on all V13 disk servers was completed [https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=182223 RT182223] | ||
+ | |||
+ | All production SRMs have been patched [https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=182223 RT182223] | ||
+ | |||
+ | == Long-term projects == | ||
+ | |||
+ | All componnents of Castor 2.1.15 are working. The upgrade of the nameserver is scheduled for 10/01/2017 | ||
+ | |||
+ | First draft of castor tapeserver features completed and published for review. | ||
+ | |||
+ | == Actions == | ||
+ | |||
+ | Drain 10% of the 13 generation of disk servers (lhcbDst) for decommissioning [https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=181930 RT181930] | ||
+ | |||
+ | Create a RT ticket for a Nagios test of "zombie" transfer manager processes | ||
+ | |||
+ | Arrange a meeting with James to discuss the tapeserver features published in tapeserver-migration sandbox | ||
+ | |||
+ | == Staffing == | ||
+ | |||
+ | RA on call next week |
Latest revision as of 14:21, 9 January 2017
Contents
Draft agenda
1. Problems encountered this week
2. Upgrades/improvements made this week
3. What are we planning to do next week?
4. Long-term project updates (if not already covered)
1. Castor 2.1.15 2. SL7 upgrade on tape servers
5. Special topics
6. Actions
7. Anything for CASTOR-Fabric?
8. AoTechnicalB
9. Availability for next week
10. On-Call
11. AoOtherB
Operation problems
There was a lag in file transfer from to StorageD to CASTOR
SAM tests failed over Christmas and today
gdss655 RT184514 and gdss780 RT184616 failed and removed from production
Operation news
The firmware upgrade on all V13 disk servers was completed RT182223
All production SRMs have been patched RT182223
Long-term projects
All componnents of Castor 2.1.15 are working. The upgrade of the nameserver is scheduled for 10/01/2017
First draft of castor tapeserver features completed and published for review.
Actions
Drain 10% of the 13 generation of disk servers (lhcbDst) for decommissioning RT181930
Create a RT ticket for a Nagios test of "zombie" transfer manager processes
Arrange a meeting with James to discuss the tapeserver features published in tapeserver-migration sandbox
Staffing
RA on call next week