Difference between revisions of "Tier1 Operations Report 2019-08-14"

From GridPP Wiki
Jump to: navigation, search
()
()
Line 127: Line 127:
  
  
 
+
{| border=1 align=center
 +
|- bgcolor="#7c8aaf"
 +
! Ticket-ID
 +
! Type
 +
! VO
 +
! Site
 +
! Priority
 +
! Responsible Unit
 +
! Status
 +
! Last Update
 +
! Subject
 +
! Scope
 +
|-
 +
| 142710
 +
| TEAM
 +
| lhcb
 +
| RAL-LCG2
 +
| very urgent
 +
| NGI_UK
 +
| waiting for reply
 +
| 2019-08-14 10:14:00
 +
| Staging problems
 +
| WLCG
 +
|-
 +
| 142689
 +
| USER
 +
| cms
 +
| RAL-LCG2
 +
| urgent
 +
| NGI_UK
 +
| in progress
 +
| 2019-08-13 10:17:00
 +
| Transfer failing to RAL_Disk
 +
| WLCG
 +
|-
 +
| 142350
 +
| TEAM
 +
| lhcb
 +
| RAL-LCG2
 +
| top priority
 +
| NGI_UK
 +
| in progress
 +
| 2019-08-14 09:03:00
 +
| Proble accessing some LHCb files at RAL
 +
| WLCG
 +
|-
 +
| 142337
 +
| TEAM
 +
| lhcb
 +
| RAL-LCG2
 +
| very urgent
 +
| NGI_UK
 +
| waiting for reply
 +
| 2019-08-07 15:27:00
 +
| Pilots Failed at RAL-LCG2
 +
| WLCG
 +
|-
 +
| 140447
 +
| USER
 +
| dteam
 +
| RAL-LCG2
 +
| less urgent
 +
| NGI_UK
 +
| on hold
 +
| 2019-07-10 13:41:00
 +
| packet loss outbound from RAL-LCG2 over IPv6
 +
| EGI
 +
|-
 +
| 140220
 +
| USER
 +
| mice
 +
| RAL-LCG2
 +
| less urgent
 +
| NGI_UK
 +
| waiting for reply
 +
| 2019-07-31 15:27:00
 +
| mice LFC to DFC transition
 +
| EGI
 +
|}
  
  

Revision as of 11:43, 14 August 2019

RAL Tier1 Operations Report for 14 August 2019

Review of Issues during the week 25th July2019 to the 31st July 2019.
  • NTR


Current operational status and issues
Notable Changes made since the last meeting.
  • NTR
Entries in GOC DB starting since the last report.
Service ID Scheduled? Outage/At Risk Start End Duration Reason
Declared in the GOC DB
Service ID Scheduled? Outage/At Risk Start End Duration Reason
- - - - - - - -
  • No ongoing downtime
Advanced warning for other interventions
The following items are being discussed and are still to be formally scheduled and announced.


Listing by category:

  • DNS servers will be rolled out within the Tier1 network.
Open GGUS Tickets


Ticket-ID Type VO Site Priority Responsible Unit Status Last Update Subject Scope
142710 TEAM lhcb RAL-LCG2 very urgent NGI_UK waiting for reply 2019-08-14 10:14:00 Staging problems WLCG
142689 USER cms RAL-LCG2 urgent NGI_UK in progress 2019-08-13 10:17:00 Transfer failing to RAL_Disk WLCG
142350 TEAM lhcb RAL-LCG2 top priority NGI_UK in progress 2019-08-14 09:03:00 Proble accessing some LHCb files at RAL WLCG
142337 TEAM lhcb RAL-LCG2 very urgent NGI_UK waiting for reply 2019-08-07 15:27:00 Pilots Failed at RAL-LCG2 WLCG
140447 USER dteam RAL-LCG2 less urgent NGI_UK on hold 2019-07-10 13:41:00 packet loss outbound from RAL-LCG2 over IPv6 EGI
140220 USER mice RAL-LCG2 less urgent NGI_UK waiting for reply 2019-07-31 15:27:00 mice LFC to DFC transition EGI



GGUS Tickets Closed Last week
Ticket-ID Type VO Site Priority Responsible Unit Status Last Update Subject Scope
142520 USER cms RAL-LCG2 urgent NGI_UK solved 2019-07-31 14:03:00 T1_UK_RAL is failing SAM tests WLCG
142516 ALARM none RAL-LCG2 top priority NGI_UK verified 2019-08-05 12:46:00 This TEST ALARM has been raised for testing GGUS alarm work flow after a new GGUS release. WLCG
142241 TEAM atlas RAL-LCG2 less urgent NGI_UK closed 2019-07-31 23:59:00 ATLAS-RAL-Frontier service degraded WLCG
142203 TEAM atlas RAL-LCG2 urgent NGI_UK solved 2019-07-31 12:43:00 RAL-LCG2_MCORE jobs failing WLCG


Availability Report


Day Atlas CMS LHCB Alice Comments
2019-07-31 100 44 100 100
2019-08-01 100 100 77 77
2019-08-02 100 100 100 100
2019-08-03 100 100 100 96
2019-08-04 100 100 100 100
2019-08-05 100 100 100 100
2019-08-06 100 100 100 100
Hammercloud Test Report
Target Availability for each site is 97.0%
Day Atlas HC CMS HC Comment
2019-07-31 96 100
2019-08-01 98 100
2019-08-02 97 100
2019-08-03 97 100
2019-08-04 97 100
2019-08-05 97 96
2019-08-06 97 100

Key: Atlas HC = Atlas HammerCloud (Queue RAL-LCG2_UCORE, Template 841); CMS HC = CMS HammerCloud

Notes from Meeting.