Difference between revisions of "RAL Tier1 weekly operations castor 29/07/2016"

From GridPP Wiki
Jump to: navigation, search
(Created page with "== Minutes from the previous meeting == Operation problems gdss650 (LHCB d1t0) gdss634 (atlasTape) went out of production due to hard drive and RAID card problems respective...")
 
Line 37: Line 37:
 
== Operation problems ==
 
== Operation problems ==
  
gdss678 failed and went out production
+
gdss678 failed went out production
  
 
== Operation news ==
 
== Operation news ==
  
 
All 9 new Dell tape-backed disk servers have been deployed into CASTOR
 
All 9 new Dell tape-backed disk servers have been deployed into CASTOR
 
  
 
== Long-term projects ==  
 
== Long-term projects ==  

Revision as of 12:55, 29 July 2016

Minutes from the previous meeting

Operation problems

gdss650 (LHCB d1t0) gdss634 (atlasTape) went out of production due to hard drive and RAID card problems respectively

Operation news

Three Dell 2015 disk servers were deployed into atlasTape. Six more are under deployment into cmsTape and lhcbRawRdst

Long-term projects

xroot tests for the 2.1.5 upgrade are in progress. A phone meeting will be arranged with Giussepe next week

Migration to aquilon and SL7 upgrade. GP has created a VM on the cloud and will start adding tape server features on aquilon

Staffing

CP on call next week

Actions

RA disks servers requiring RAID update - locate servers and plan for update with fabric

RA decide what to do with persistent data (for daily test) is still on GenScratch

RA to update the doc for xroot certificates

GP to present the stress test results of gdss596 configured with the WAN tuning parameters

Completed actions

CASTOR TEAM Durham / Leicester Dirac data - need to create separate tape pools / uid / gid

GP to review with RA the mailing lists he is on

Operation problems

gdss678 failed went out production

Operation news

All 9 new Dell tape-backed disk servers have been deployed into CASTOR

Long-term projects

Good progress has been made with 2.1.15 upgrade. The gridFTP transfer problem was fixed and a configuration check bug was isolated. Stress, functional, xroot tests will be scheduled

George will liaise with Bruno so that he can undertand better the technical requirements of the SL7 upgrade on the tape servers

Stafing

RA on call

Alison, Andrey away

Actions

RA disks servers requiring RAID update - locate servers and plan for update with fabric

RA decide what to do with persistent data (for daily test) is still on GenScratch

RA to update the doc for xroot certificates

GP to present the stress test results of gdss596 configured with the WAN tuning parameters