Difference between revisions of "RAL Tier1 weekly operations castor 17/03/2014"

From GridPP Wiki
Jump to: navigation, search
(Created page with "== Operations News == * Tape servers have been upgraded to 2.1.14-11 - following advice from CERN and successful testing of one tape server. == Operations Problems == * .. =...")
 
Line 1: Line 1:
 
== Operations News ==
 
== Operations News ==
 
* Tape servers have been upgraded to 2.1.14-11 - following advice from CERN and successful testing of one tape server.
 
* Tape servers have been upgraded to 2.1.14-11 - following advice from CERN and successful testing of one tape server.
 +
* Ongoing testing of 2.1.14.  We plan to upgrade Facilities on 1 Apr and Tier 1 NS a few weeks after
  
 
== Operations Problems ==
 
== Operations Problems ==
* ..
+
* CMS was brought down twice on Friday morning with an unusual problem which appeared under high load. This appears similar to https://savannah.cern.ch/support/?132773 - an incident at CERN where a single subrequest without the corresponding Client entry caused problems.
  
 
== Blocking Issues ==
 
== Blocking Issues ==
Line 26: Line 27:
 
**  Matthew
 
**  Matthew
 
* Staff absence/out of the office:
 
* Staff absence/out of the office:
** ...
+
** (Mon-Wed) Bruno at QWG meeting
 +
** (Mon-Fri) Shaun at EUDAT meeting then ISGC
 +
** (Thu-Fri) Rob at ISGC

Revision as of 09:58, 17 March 2014

Operations News

  • Tape servers have been upgraded to 2.1.14-11 - following advice from CERN and successful testing of one tape server.
  • Ongoing testing of 2.1.14. We plan to upgrade Facilities on 1 Apr and Tier 1 NS a few weeks after

Operations Problems

  • CMS was brought down twice on Friday morning with an unusual problem which appeared under high load. This appears similar to https://savannah.cern.ch/support/?132773 - an incident at CERN where a single subrequest without the corresponding Client entry caused problems.

Blocking Issues

  • none

Planned, Scheduled and Cancelled Interventions

Entries in/planned to go to GOCDB


Advanced Planning

Tasks

  • CASTOR 2.1.14 + SL5/6 testing. The change control has gone through today with few problems.
  • iptables to be installed on lcgcviewer01 to harden the logging system against the injection of junk data by security scans.
  • Quattor cleanup process is ongoing.
  • Installation of new Preprod headnodes

Interventions

  • none

Staffing

  • Castor on Call person
    • Matthew
  • Staff absence/out of the office:
    • (Mon-Wed) Bruno at QWG meeting
    • (Mon-Fri) Shaun at EUDAT meeting then ISGC
    • (Thu-Fri) Rob at ISGC