Difference between revisions of "RAL Tier1 weekly operations castor 07/12/2018"

From GridPP Wiki
Jump to: navigation, search
(Created page with "== Standing agenda == 1. Problems encountered this week 2. Upgrades/improvements made this week 3. What are we planning to do next week? 4. Long-term project updates (if n...")
 
(Operation problems)
Line 25: Line 25:
  
 
== Operation problems ==
 
== Operation problems ==
 +
 +
Argo tests for CMS temporarily failed while preparing for CMS migration
  
 
== Operation news ==
 
== Operation news ==

Revision as of 09:56, 7 December 2018

Standing agenda

1. Problems encountered this week

2. Upgrades/improvements made this week

3. What are we planning to do next week?

4. Long-term project updates (if not already covered)

5. Special topics

6. Actions

7. Review Fabric tasks

  1.   Link

8. AoTechnicalB

9. Availability for next week

10. On-Call

11. AoOtherB

Operation problems

Argo tests for CMS temporarily failed while preparing for CMS migration

Operation news

 * Almost all CMS files have been deleted from on cmsDisk
 * Recovery of na62 files declared successfull by the VO
 * Decommission ATLAS headnodes
 * Decommission xrootd-cms-manager

Plans for next few weeks

  * Proceed with the cmsDisk decommissioning
  * Complete kernel patching on CASTOR hosts
  * Oracle/kernel patching for CASTOR Facilities DB
  * Deploy new disk servers for Facilities

Long-term projects

  * New CASTOR WLCGTape instance. Things need doing: Create a seperate xrootd redirector for ALICE
  * CASTOR disk server migration to Aquilon: gdss742 has been compiled with a draft aquilon profile
    but there are problems with the SL7 installation RT216885 

Actions

Staffing

  * RA out until 10/12