Difference between revisions of "Tier1 Operations Report 2019-11-20"

From GridPP Wiki
Jump to: navigation, search
()
()
Line 233: Line 233:
 
! Scope
 
! Scope
 
|-
 
|-
| 143967
+
| 143917
 
| USER
 
| USER
 
| cms
 
| cms
Line 239: Line 239:
 
| urgent
 
| urgent
 
| NGI_UK
 
| NGI_UK
| solved
+
| closed
| 2019-11-09 00:17:00
+
| 2019-11-19 23:59:00
| T1_UK_RAL is failing SAM - SRM, XRD
+
| Transfers failing to T1_UK_RAL_Disk
 
| WLCG
 
| WLCG
 
|-
 
|-
| 143965
+
| 143876
| TEAM
+
| atlas
+
| RAL-LCG2
+
| urgent
+
| NGI_UK
+
| solved
+
| 2019-11-08 11:42:00
+
| RAL-LCG2: TRANSFER [70] TRANSFER globus_ftp_client: the server responded with an error 421
+
| WLCG
+
|-
+
| 143916
+
 
| USER
 
| USER
 
| cms
 
| cms
Line 261: Line 250:
 
| urgent
 
| urgent
 
| NGI_UK
 
| NGI_UK
| solved
+
| closed
| 2019-11-11 08:39:00
+
| 2019-11-15 23:59:00
| Transfers failing to T1_UK_RAL_Disk
+
| T1_UK_RAL HammerCloud cannot reach files via xrootd
 
| WLCG
 
| WLCG
 
|-
 
|-
| 143869
+
| 143838
 
| TEAM
 
| TEAM
| lhcb
+
| atlas
 
| RAL-LCG2
 
| RAL-LCG2
| very urgent
+
| less urgent
 
| NGI_UK
 
| NGI_UK
| verified
+
| closed
| 2019-11-06 11:31:00
+
| 2019-11-15 23:59:00
| (again) file transfers low efficiency
+
| RAL-LCG2: TRANSFER an end-of-file was reached globus_xio: An end of file occurred
 
| WLCG
 
| WLCG
 
|-
 
|-
| 143774
+
| 143834
 
| USER
 
| USER
 
| cms
 
| cms
Line 284: Line 273:
 
| NGI_UK
 
| NGI_UK
 
| closed
 
| closed
| 2019-11-08 23:59:00
+
| 2019-11-13 23:59:00
| cernvmfs.gridpp.rl.ac.uk inaccessible over IPv6
+
| transfers failing to T1_UK_RAL_Disk
| EGI
+
|-
+
| 143767
+
| USER
+
| cms
+
| RAL-LCG2
+
| urgent
+
| NGI_UK
+
| solved
+
| 2019-11-11 08:24:00
+
| FIle read issues for Workflows where data is located at T1_UK_RAL
+
 
| WLCG
 
| WLCG
 
|-
 
|-
| 143765
+
| 143645
| USER
+
| TEAM
| cms
+
| lhcb
 
| RAL-LCG2
 
| RAL-LCG2
| urgent
+
| top priority
 
| NGI_UK
 
| NGI_UK
| closed
+
| verified
| 2019-11-07 23:59:00
+
| 2019-11-19 08:30:00
| RAL redirector unsubscribed from  federation
+
| Jobs Failed to access files at RAL-LCG2
 
| WLCG
 
| WLCG
 
|}
 
|}
 +
 +
 
<!-- **********************End Availability Report************************** ----->
 
<!-- **********************End Availability Report************************** ----->
 
<!-- *********************************************************************** ----->
 
<!-- *********************************************************************** ----->

Revision as of 12:58, 20 November 2019

RAL Tier1 Operations Report for 20th November 2019

Review of Issues during the week 13th November 2019 to the 19th November 2019.
  • N


Current operational status and issues
Notable Changes made since the last meeting.
  • NTR
Entries in GOC DB starting since the last report.
Service ID Scheduled? Outage/At Risk Start End Duration Reason
Declared in the GOC DB
Service ID Scheduled? Outage/At Risk Start End Duration Reason
- - - - - - - -
  • No ongoing downtime
Advanced warning for other interventions
The following items are being discussed and are still to be formally scheduled and announced.


Listing by category:


Open GGUS Tickets
Ticket-ID Type VO Site Priority Responsible Unit Status Last Update Subject Scope
144024 USER cms RAL-LCG2 very urgent NGI_UK waiting for reply 2019-11-15 15:39:00 File Read Issues where files are located at RAL WLCG
144015 USER other RAL-LCG2 less urgent NGI_UK in progress 2019-11-20 10:08:00 Stalled LSST jobs at RAL EGI
143762 TEAM lhcb RAL-LCG2 urgent NGI_UK in progress 2019-10-23 14:12:00 Stop using sl6 queues at RAL WLCG
143669 USER snoplus.snolab.ca RAL-LCG2 urgent NGI_UK on hold 2019-11-18 09:13:00 SNO+ LFC to DFC migration EGI
143323 TEAM lhcb RAL-LCG2 top priority NGI_UK on hold 2019-11-18 14:16:00 File deletion at RAL ECHO WLCG
142350 TEAM lhcb RAL-LCG2 top priority NGI_UK on hold 2019-11-18 14:59:00 Proble accessing some LHCb files at RAL WLCG



GGUS Tickets Closed Last week
Ticket-ID Type VO Site Priority Responsible Unit Status Last Update Subject Scope
143917 USER cms RAL-LCG2 urgent NGI_UK closed 2019-11-19 23:59:00 Transfers failing to T1_UK_RAL_Disk WLCG
143876 USER cms RAL-LCG2 urgent NGI_UK closed 2019-11-15 23:59:00 T1_UK_RAL HammerCloud cannot reach files via xrootd WLCG
143838 TEAM atlas RAL-LCG2 less urgent NGI_UK closed 2019-11-15 23:59:00 RAL-LCG2: TRANSFER an end-of-file was reached globus_xio: An end of file occurred WLCG
143834 USER cms RAL-LCG2 urgent NGI_UK closed 2019-11-13 23:59:00 transfers failing to T1_UK_RAL_Disk WLCG
143645 TEAM lhcb RAL-LCG2 top priority NGI_UK verified 2019-11-19 08:30:00 Jobs Failed to access files at RAL-LCG2 WLCG


Availability Report

Day Atlas CMS LHCB Alice
2019-11-06 100 97 100 100
2019-11-07 100 87 100 100
2019-11-08 100 100 100 100
2019-11-09 100 100 100 100
2019-11-10 100 100 100 100
2019-11-11 100 100 100 100
2019-11-12 100 100 100 100
Hammercloud Test Report
Target Availability for each site is 97.0%
Day Atlas HC CMS HC Comment
2019-11-06 100 98
2019-11-07 100 n/a
2019-11-08 100 n/a
2019-11-09 0 93
2019-11-10 0 n/a
2019-11-11 0 88
2019-11-12 100 88

Key: Atlas HC = Atlas HammerCloud (Queue RAL-LCG2_UCORE, Template 841); CMS HC = CMS HammerCloud

Notes from Meeting.