RAL Tier1 weekly operations castor 02/11/2018
From GridPP Wiki
Revision as of 16:06, 5 November 2018 by Rob Appleyard 7f7797b74a (Talk | contribs)
Contents
Standing agenda
1. Problems encountered this week
2. Upgrades/improvements made this week
3. What are we planning to do next week?
4. Long-term project updates (if not already covered)
5. Special topics
6. Actions
7. Review Fabric tasks
1. Link
8. AoTechnicalB
9. Availability for next week
10. On-Call
11. AoOtherB
Operation problems
* NRPE RPMs not installed on ERIS nodes. Currently with John, ticket: https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=211056 * castor-functional-test1 is still running tests that it shouldn't and called out on Thursday night. Genius John identified the problem (still running tests against decommissioned disk pools), RA to implement a permanent fix.
Operation news
* wlcgTape is in prod.
* GSI authentication for xrootd in production (mon 29/10)
* na62 recovery operation ongoing using wlcgTape.
Plans for next few weeks
* Decommission disk servers from ATLAS d1t0.
* Move all needed disk servers from ATLAS d0t1 to wlcgTape (gdss893, gdss894, gdss895)
* Proceed with the cmsDisk decommissioning
Long-term projects
* New CASTOR WLCGTape instance. Things need doing: Create a seperate xrootd redirector for ALICE
Actions
* Ask TimA about whether remaining dark data can be deleted and whether ATLAS still needs the cinstancedlf alias
Staffing
* GP out until Friday.
* RA on call until Friday (then off to the US)