Difference between revisions of "RAL Tier1 weekly operations castor 26/10/2018"
(→Operation problems) |
(→Plans for next few weeks) |
||
(10 intermediate revisions by one user not shown) | |||
Line 27: | Line 27: | ||
Failure of gdss732 and gdss739 (both from lhcbDst). Back in production | Failure of gdss732 and gdss739 (both from lhcbDst). Back in production | ||
+ | |||
+ | Many checksum errors reported for files sent to CASTOR by Elastic tape for ingest (Kevin) | ||
+ | |||
+ | The new tape accounting system broke | ||
== Operation news == | == Operation news == | ||
− | + | Facilities patching complete | |
− | + | All CASTOR d0t1 disk pools and atlasStripInput and cmsDisk patched | |
== Plans for next few weeks == | == Plans for next few weeks == | ||
− | + | ||
− | * Push gsi authentication for xrootd in production | + | * Migrate ATLAS to the CASTOR WLCGTape |
− | + | ||
− | + | * Kernel patching of Eris | |
+ | |||
+ | * Upgrade CASTOR_REPACK schema on pluto to 2.1.17-35 | ||
+ | |||
+ | * Push gsi authentication for xrootd in production, instance by instance (mon 29/10) | ||
* Proceed with the cmsDisk decommissioning | * Proceed with the cmsDisk decommissioning | ||
Line 44: | Line 52: | ||
== Long-term projects == | == Long-term projects == | ||
− | |||
− | New CASTOR WLCGTape instance. Things need doing: 1) Update of the CASTOR ldif file and 2 | + | New CASTOR WLCGTape instance. Things need doing: 1) Update of the CASTOR ldif file and 2) Fix a misconfiguration on the Eris disk array (cannot bring WLCGTape into prod before this is done) 3) Create a seperate xrootd redirector for ALICE |
− | + | ||
+ | == Actions == | ||
+ | |||
+ | John to document changes in the CASTOR ldif file | ||
− | + | Ask TimA about whether remaining dark data can be deleted and wheteher ATLAS still needs the cinstancedlf alias | |
== Staffing == | == Staffing == | ||
− | + | RA on call |
Latest revision as of 10:22, 26 October 2018
Contents
Standing agenda
1. Problems encountered this week
2. Upgrades/improvements made this week
3. What are we planning to do next week?
4. Long-term project updates (if not already covered)
5. Special topics
6. Actions
7. Review Fabric tasks
1. Link
8. AoTechnicalB
9. Availability for next week
10. On-Call
11. AoOtherB
Operation problems
Failure of gdss732 and gdss739 (both from lhcbDst). Back in production
Many checksum errors reported for files sent to CASTOR by Elastic tape for ingest (Kevin)
The new tape accounting system broke
Operation news
Facilities patching complete
All CASTOR d0t1 disk pools and atlasStripInput and cmsDisk patched
Plans for next few weeks
* Migrate ATLAS to the CASTOR WLCGTape
* Kernel patching of Eris
* Upgrade CASTOR_REPACK schema on pluto to 2.1.17-35
* Push gsi authentication for xrootd in production, instance by instance (mon 29/10)
* Proceed with the cmsDisk decommissioning
Long-term projects
New CASTOR WLCGTape instance. Things need doing: 1) Update of the CASTOR ldif file and 2) Fix a misconfiguration on the Eris disk array (cannot bring WLCGTape into prod before this is done) 3) Create a seperate xrootd redirector for ALICE
Actions
John to document changes in the CASTOR ldif file
Ask TimA about whether remaining dark data can be deleted and wheteher ATLAS still needs the cinstancedlf alias
Staffing
RA on call