Difference between revisions of "RAL Tier1 weekly operations castor 28/06/2010"

From GridPP Wiki
Jump to: navigation, search
 
(No difference)

Latest revision as of 14:09, 28 June 2010

Summary of Previous Week

  • Matthew:
    • MICE D1T2 testing
    • Working with CMS to find best solution to their migration problem
    • DBWF meeting
    • CoD work
    • Changed CMS synchronization from daily to hourly
    • Started the 2.1.9 upgrade change control document
  • Shaun:
    • ..
  • Chris:
    • at CERN
  • Richard:
    • Trying to get new functional tests working
    • Some updates on the pre-prod test CIP machine
    • Spotted some "unreported" SRM test failures
  • Brian:
    • ..
  • Jens:
    • ..

Developments for this week

  • Matthew:
    • CoD work
    • 2.1.9 change control document
    • arranging for access of BADC people to Gen
  • Shaun:
    • ..
  • Chris:
    • ..
  • Richard:
    • Condensing benchmarking metrics into a spreadsheet
    • Adding an extra CIP box as a testbed for changes
    • Run new 2.1.9 functional test suite
    • 2 days A/L
  • Brian:
    • ..
  • Jens:
    • ..

Operations Issues

  • high CMS migration queue due to a large number of temporary files (unmerged files + logs) that should not go to tape. We have agreed to split cmsFarm and move 200TB to a new D1T0 service class (cmsTemp) to avoid the problem in the future
  • Some gridftp failures noticed on gdss547 (atlasScratchDisk). Restarting xinetd fixed the problem
  • voms mapping caused Alice OPS test to run as wrong person - tests failed.

Blocking issues

Lack of a switch which will be needed for the new Facilities instance

Planned, Scheduled and Cancelled Interventions

Entries in/planned to go to GOCDB

None

Advanced Planning

  • Upgrade to 2.1.8/2.1.9 2010

Staffing

  • Castor on Call person: Matt
  • Staff absences: