RAL Tier1 weekly operations castor 29/07/2016

From GridPP Wiki
Jump to: navigation, search

Draft Agenda

  1. Problems encountered this week
  2. Upgrades/improvements made this week
  3. What are we planning to do next week?
  4. Long-term project updates (if not already covered)
    1. Facilities drive reallocation
    2. 2.1.15
  5. Special topics
  6. Actions
  7. Anything for CASTOR-Fabric?
  8. AoTechnicalB
  9. Availability for next week
  10. On-Call
  11. AoOtherB

Minutes from the previous meeting

Operation problems

gdss650 (LHCB d1t0) gdss634 (atlasTape) went out of production due to hard drive and RAID card problems respectively

Operation news

Three Dell 2015 disk servers were deployed into atlasTape. Six more are under deployment into cmsTape and lhcbRawRdst

Long-term projects

xroot tests for the 2.1.5 upgrade are in progress. A phone meeting will be arranged with Giussepe next week

Migration to aquilon and SL7 upgrade. GP has created a VM on the cloud and will start adding tape server features on aquilon

Staffing

CP on call next week

Actions

RA disks servers requiring RAID update - locate servers and plan for update with fabric

RA decide what to do with persistent data (for daily test) is still on GenScratch

RA to update the doc for xroot certificates

GP to present the stress test results of gdss596 configured with the WAN tuning parameters

Completed actions

CASTOR TEAM Durham / Leicester Dirac data - need to create separate tape pools / uid / gid

GP to review with RA the mailing lists he is on

Operation problems

gdss678 failed went out production

Operation news

All 9 new Dell tape-backed disk servers have been deployed into CASTOR

Long-term projects

Good progress has been made with the CASTOR 2.1.15 upgrade. The gridFTP transfer problem was fixed and a configuration check bug was isolated. Stress, functional, xroot tests will be scheduled

George will liaise with Bruno so that he can understand better the technical requirements of the SL7 upgrade on the tape servers

Stafing

RA on call

Alison away

Actions

RA disks servers requiring RAID update - locate servers and plan for update with fabric

RA decide what to do with persistent data (for daily test) is still on GenScratch

RA to update the doc for xroot certificates

GP to present the stress test results of gdss596 configured with the WAN tuning parameters