Tier1 Operations Report 2019-09-18

From GridPP Wiki
Jump to: navigation, search

RAL Tier1 Operations Report for 18th September 2019

Review of Issues during the week 11th September 2019 to the 18th September 2019.
  • TBA


Current operational status and issues
Notable Changes made since the last meeting.
  • NTR
Entries in GOC DB starting since the last report.
Service ID Scheduled? Outage/At Risk Start End Duration Reason
All 27719 Yes At Risk 11-09-19 0530 11-09-19 0700 90mins Network routing hardware link change
Declared in the GOC DB
Service ID Scheduled? Outage/At Risk Start End Duration Reason
- - - - - - - -
  • No ongoing downtime
Advanced warning for other interventions
The following items are being discussed and are still to be formally scheduled and announced.


Listing by category:

  • DNS servers will be rolled out within the Tier1 network.
Open GGUS Tickets
Ticket-ID Type VO Site Priority Responsible Unit Status Last Update Subject Scope
143231 USER other RAL-LCG2 urgent EGI CVMFS Service in progress 2019-09-18 09:28:00 CVMFS repo dirac.egi.eu updates are not propagated EGI
143225 USER cms RAL-LCG2 very urgent NGI_UK in progress 2019-09-17 08:25:00 some of RAL FTS servers are not running? WLCG
143218 TEAM lhcb RAL-LCG2 urgent NGI_UK in progress 2019-09-16 06:45:00 FTS3 transfers problem to GRIDKA for transfers executing at RAL FTS3 server WLCG
142835 USER snoplus.snolab.ca RAL-LCG2 less urgent NGI_UK in progress 2019-09-16 14:41:00 Connection Issues EGI
142689 USER cms RAL-LCG2 very urgent NGI_UK waiting for reply 2019-09-12 12:09:00 Transfer failing to RAL_Disk WLCG
142350 TEAM lhcb RAL-LCG2 top priority NGI_UK in progress 2019-09-17 10:03:00 Proble accessing some LHCb files at RAL WLCG
140447 USER dteam RAL-LCG2 less urgent NGI_UK in progress 2019-09-11 10:37:00 packet loss outbound from RAL-LCG2 over IPv6 EGI
GGUS Tickets Closed Last week


Ticket-ID Type VO Site Priority Responsible Unit Status Last Update Subject Scope
142981 USER mice RAL-LCG2 less urgent NGI_UK solved 2019-09-04 13:02:00 mice; LFC to DFC transition EGI
142955 USER ops RAL-LCG2 less urgent NGI_UK verified 2019-09-05 13:15:00 [Rod Dashboard] Issues detected at RAL-LCG2 EGI
142782 TEAM lhcb RAL-LCG2 very urgent NGI_UK verified 2019-09-04 12:32:00 FTS3 transfers Failed to RAL-RDST at RAL-LCG2 WLCG
142751 USER snoplus.snolab.ca RAL-LCG2 top priority NGI_UK closed 2019-09-04 23:59:00 Data transfer failure and proxy issue EGI


Availability Report

Day Atlas CMS LHCB Alice Comments
2019-09-04 100 100 100 91
2019-09-05 100 99 100 100
2019-09-06 100 100 100 100
2019-09-07 100 100 91 92
2019-09-08 100 100 0 0
2019-09-09 100 100 63 64
2019-09-10 100 100 100 100
Hammercloud Test Report
Target Availability for each site is 97.0%
Day Atlas HC CMS HC Comment
2019-09-04 96 99
2019-09-05 100 99
2019-09-06 100 98
2019-09-07 100 99
2019-09-08 100 100
2019-09-09 93 100
2019-09-10 100 100

Key: Atlas HC = Atlas HammerCloud (Queue RAL-LCG2_UCORE, Template 841); CMS HC = CMS HammerCloud

Notes from Meeting.