RAL Tier1 weekly operations castor 19/04/2010

From GridPP Wiki
Jump to: navigation, search

Summary of Previous Week

  • Matthew:
    • PMB F2F, GridPP24
    • setting up new puppetmaster
    • CoD work
  • Shaun:
    • ..
  • Chris:
    • Tested cold stand-by central castor servers
    • Disk server deployment duties
    • Castor 2.1.8/2/1.9 upgrade work
    • Worked on Certification instance
  • Richard:
    • 2 days at GridpPP 24
    • Continued stress testing on pre-prod instance
  • Brian:
    • ..
  • Jens:
    • Nothing much castorific; at GridPP most of the week.

Developments for this week

  • Matthew:
    • building and testing new puppetmaster server
    • testing quattorized disk server deployment
  • Shaun:
    • ..
  • Chris:
    • Test SL5 (64bit) disk server with xfs
    • Test cold stand-by central castor servers and then write documentation
    • Disk server deployment duties
    • Test Quattor disk server procedure and build castor disk server
    • Castor 2.1.8/2/1.9 upgrade work
    • Doing work related to Tier1 Security Group project
  • Richard:
    • Stress testing on pre-prod instance
  • Brian:
    • ..
  • Jens:
    • See if I can finish one or more CIP development strands. Hack, hack.

Operations Issues

  • CMS migrations stopped due to cmsWanIn & cmsFarmRead shared tape pools stopping each others streams being created, during CMS reprocessing, while cmsWanIn was quiet and cmsFarmRead was busy. cmsWanIn mighunter schedule changed from 5->30 minutes which fixed the problem.
  • passwd file on gdss92 dropped entry for stager which stopped atlasFarm recalls to this machine
  • hone gsoap errors due to an lsf group not being created

Blocking issues

None

Planned, Scheduled and Cancelled Interventions

Entries in/planned to go to GOCDB

None

Advanced Planning

  • Upgrade to 2.1.8/2.1.9 2010
  • Upgrade to SRM 2.8-6 after testing is complete
  • ATLAS want to know how much capacity is available in disabled servers (published as Capability). Low priority CIP change to do this.
  • CASTOR Instance for Non LHC 2010Q2
  • Install/enable gridftp-internal on Gen (Before 2.1.8 upgrade)

Staffing

  • Castor on Call person: Chris
  • Staff absences:
    • ..