Difference between revisions of "RAL Tier1 weekly operations castor 17/2/2017"
(→Plans for next week) |
(→Operations problems) |
||
Line 30: | Line 30: | ||
CMS user time-out problems around 94-96%. | CMS user time-out problems around 94-96%. | ||
− | LHCb file transfers from worker to CASTOR turned out be due to LHCb file catalogs | + | LHCb file transfers from worker nodes to CASTOR turned out be due to LHCb file catalogs |
== Operations news == | == Operations news == |
Revision as of 11:54, 17 February 2017
Contents
Draft agenda
1. Problems encountered this week
2. Upgrades/improvements made this week
3. What are we planning to do next week?
4. Long-term project updates (if not already covered)
1. SL7 upgrade on tape servers 2. SRM upgrade to SL6/CASTOR 2.1.16
5. Special topics
6. Actions
7. Anything for CASTOR-Fabric?
8. AoTechnicalB
9. Availability for next week
10. On-Call
11. AoOtherB
Operations problems
CMS user time-out problems around 94-96%.
LHCb file transfers from worker nodes to CASTOR turned out be due to LHCb file catalogs
Operations news
Plans for next week
Rob will continue writing the CIP on aquilon
Long-term projects
CIP migration to aquilon and upgrade to SL6
SRM upgrade to SL6/CASTOR 2.1.16: An VM configured as SL6/2.1.16 SRM for preprod passed the CASTOR functional tests
Tape-server migration to aquilon and SL7 upgrade (on hold for the moment)
Actions
Drain 10% of the 13 generation of disk servers (lhcbDst) for decommissioning
Generate a CASTOR bug report for the the open DB cursors problem
Add GP to the mail of CASTOR overwatch script
Staffing
RA on call next week
GP away