Difference between revisions of "RAL Tier1 weekly operations castor 05/08/2016"

From GridPP Wiki
Jump to: navigation, search
Line 44: Line 44:
 
The name server dump script for ATLAS appears to work and it was successfully cronified.
 
The name server dump script for ATLAS appears to work and it was successfully cronified.
 
See [https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=154144 RT 154144]
 
See [https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=154144 RT 154144]
 +
 +
The 2014 disk servers from Ceph are in the last stage of conversion to CASTOR [https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=173922&results=c4fe907db036834dc966d3a86199515c RT 173922]

Revision as of 10:22, 5 August 2016

Minutes from the previous meeting

Operation problems

gdss678 failed went out production

Operation news

All 9 new Dell tape-backed disk servers have been deployed into CASTOR

Long-term projects

Good progress has been made with the CASTOR 2.1.15 upgrade. The gridFTP transfer problem was fixed and a configuration check bug was isolated. Stress, functional, xroot tests will be scheduled

George will liaise with Bruno so that he can understand better the technical requirements of the SL7 upgrade on the tape servers

Stafing

RA on call

Alison away

Actions

RA disks servers requiring RAID update - locate servers and plan for update with fabric

RA decide what to do with persistent data (for daily test) is still on GenScratch

RA to update the doc for xroot certificates

GP to present the stress test results of gdss596 configured with the WAN tuning parameters

Operation problems

gdss634 (atlasTape) and gdss651, gdss763 (preprod) failed and went out of prod. gdss634 had all Hard drives replaced and currently is under acceptance testing.

A large number of GridFTP transfers on a 2011 lhcb server resulting in reduction of performance. Solution: Global tightening of to transfermanager weightings. For details see here

Operation news

The name server dump script for ATLAS appears to work and it was successfully cronified. See RT 154144

The 2014 disk servers from Ceph are in the last stage of conversion to CASTOR RT 173922