Difference between revisions of "RAL Tier1 weekly operations castor 20/05/2016"

From GridPP Wiki
Jump to: navigation, search
(Created page with " == Operation news == Automated workflow for disk server deployment has been disabled New CASTOR functional testing using xrootd will be enabled on Monday 23/5/2016 == CASTO...")
 
(CASTOR issues)
Line 18: Line 18:
 
rebuilding  
 
rebuilding  
  
GDSS727 (production D1T0 CMS disk server)FSProbe Error
+
GDSS727 (production D1T0 CMS disk server) FSProbe Error
It has been removed from Production and Overwatch Updated (Gareth)
+
It has been removed from Production and Overwatch Updated (Gareth, [https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=172141])
  
 
Ongoing work on the upgrade to CASTOR 2.1.15 on preprod
 
Ongoing work on the upgrade to CASTOR 2.1.15 on preprod

Revision as of 10:30, 20 May 2016

Operation news

Automated workflow for disk server deployment has been disabled New CASTOR functional testing using xrootd will be enabled on Monday 23/5/2016

CASTOR issues

Heavy wokload on the Atlas scracth disk resulting in almost nothing being achieved

Full recover from the tape robot and air condition problems

Double put start on CASTOR facilities

Some work to be done on the imrovement of the logic of the new draining script

gdss664 was brought back to production on 18/05/2016 at ca. 15:00 folowing a sucessfull rebuilding

GDSS727 (production D1T0 CMS disk server) FSProbe Error It has been removed from Production and Overwatch Updated (Gareth, [1])

Ongoing work on the upgrade to CASTOR 2.1.15 on preprod

GP to talk to Andrew Lahif about a SL7 upgrade on the worker nodes

SRM DB duplicates removal script is under testing

Will test the newly created taple families for ATLAS