RAL Tier1 weekly operations castor 14/10/2016

From GridPP Wiki
Jump to: navigation, search

Draft agenda

1. Problems encountered this week

2. Upgrades/improvements made this week

3. What are we planning to do next week?

4. Long-term project updates (if not already covered)

1. Castor 2.1.15 2. SL7 upgrade on tape servers

5. Special topics

6. Actions

7. Anything for CASTOR-Fabric?

8. AoTechnicalB

9. Availability for next week

10. On-Call

11. AoOtherB

Operation problems

Gdss730 cmsdisk – disabled expected to return RO Friday afternoon – UPDATE looking touch and go at the moment, will discuss with Chetan towards end of day.

Gdss744 atlasdisk – currently RO and will stay that way until next wk

Gdss612 – has had battery

Lcgclsf03 – gen lsf, failed drive replaced on 12th

Andrey mentioned an issue with Preprod db and manually entering some data for preprod env. I expect Rob will know more about this.

Operation news

Long-term projects

Castor 2.1.15 upgrade has been postoponed until January 2017

Development continues to migrate castor tape servers to aquilon

Actions

GP to present the WAN tuning effect on transfer rates

Get dedlines from Fabric team for OCF/CV14 hand over to CASTOR

RA/GP to deploy the former Ceph OCF14 servers into aliceDisk (see RAL disk server deployment plan by Alastair)

Talk to AL about the issue with unrouted files to tape in CMS

Check if there is a nagios test that checks for facilities tape drives being down

Staffing

George P holiday

Andrey – out wed

Rob – currently off ill?

Oncall – CP could do the week but not the weekend.