Difference between revisions of "Tier1 Operations Report 2019-10-30"

From GridPP Wiki
Jump to: navigation, search
()
()
 
Line 237: Line 237:
 
! Subject
 
! Subject
 
! Scope
 
! Scope
 +
|-
 +
| 143838
 +
| TEAM
 +
| atlas
 +
| RAL-LCG2
 +
| less urgent
 +
| NGI_UK
 +
| solved
 +
| 2019-10-30 11:21:00
 +
| RAL-LCG2: TRANSFER an end-of-file was reached globus_xio: An end of file occurred
 +
| WLCG
 +
|-
 +
| 143834
 +
| USER
 +
| cms
 +
| RAL-LCG2
 +
| urgent
 +
| NGI_UK
 +
| solved
 +
| 2019-10-30 11:48:00
 +
| transfers failing to T1_UK_RAL_Disk
 +
| WLCG
 +
|-
 +
| 143831
 +
| TEAM
 +
| lhcb
 +
| RAL-LCG2
 +
| very urgent
 +
| NGI_UK
 +
| solved
 +
| 2019-10-30 11:49:00
 +
| low efficiency at gsiftp://gridftp.echo.stfc.ac.uk
 +
| WLCG
 
|-
 
|-
 
| 143818
 
| 143818

Latest revision as of 11:57, 30 October 2019

RAL Tier1 Operations Report for 30th October 2019

Review of Issues during the week 23rd October 2019 to the 29th October 2019.
  • Echo IOPS contention leading to failing trasnfers ( in particular into ECHO)
  • Ipv6 CVMFS issue reappeared and been resolved.
  • DUNE CASTOR intervention
  • ALICE migrating data from CASTOR DISK to ECHO using new gateways.


Current operational status and issues
Notable Changes made since the last meeting.
  • NTR
Entries in GOC DB starting since the last report.
Service ID Scheduled? Outage/At Risk Start End Duration Reason
Declared in the GOC DB
Service ID Scheduled? Outage/At Risk Start End Duration Reason
- - - - - - - -
  • No ongoing downtime
Advanced warning for other interventions
The following items are being discussed and are still to be formally scheduled and announced.


Listing by category:

  • DNS servers will be rolled out within the Tier1 network.
Open GGUS Tickets
Ticket-ID Type VO Site Priority Responsible Unit Status Last Update Subject Scope
143767 USER cms RAL-LCG2 urgent NGI_UK in progress 2019-10-29 07:33:00 FIle read issues for Workflows where data is located at T1_UK_RAL WLCG
143762 TEAM lhcb RAL-LCG2 urgent NGI_UK in progress 2019-10-23 14:12:00 Stop using sl6 queues at RAL WLCG
143669 USER snoplus.snolab.ca RAL-LCG2 urgent NGI_UK in progress 2019-10-18 14:25:00 SNO+ LFC to DFC migration EGI
143645 TEAM lhcb RAL-LCG2 top priority NGI_UK in progress 2019-10-21 11:02:00 Jobs Failed to access files at RAL-LCG2 WLCG
143323 TEAM lhcb RAL-LCG2 top priority NGI_UK in progress 2019-10-14 08:45:00 File deletion at RAL ECHO WLCG
142350 TEAM lhcb RAL-LCG2 top priority NGI_UK in progress 2019-10-28 17:14:00 Proble accessing some LHCb files at RAL WLCG



GGUS Tickets Closed Last week
Ticket-ID Type VO Site Priority Responsible Unit Status Last Update Subject Scope
143838 TEAM atlas RAL-LCG2 less urgent NGI_UK solved 2019-10-30 11:21:00 RAL-LCG2: TRANSFER an end-of-file was reached globus_xio: An end of file occurred WLCG
143834 USER cms RAL-LCG2 urgent NGI_UK solved 2019-10-30 11:48:00 transfers failing to T1_UK_RAL_Disk WLCG
143831 TEAM lhcb RAL-LCG2 very urgent NGI_UK solved 2019-10-30 11:49:00 low efficiency at gsiftp://gridftp.echo.stfc.ac.uk WLCG
143818 TEAM lhcb RAL-LCG2 very urgent NGI_UK verified 2019-10-28 09:23:00 Data transfers problem at RAL-LCG2 WLCG
143774 USER cms RAL-LCG2 urgent NGI_UK solved 2019-10-25 11:00:00 cernvmfs.gridpp.rl.ac.uk inaccessible over IPv6 EGI
143765 USER cms RAL-LCG2 urgent NGI_UK solved 2019-10-23 16:26:00 RAL redirector unsubscribed from federation WLCG
143569 TEAM atlas RAL-LCG2 top priority NGI_UK closed 2019-10-23 23:59:00 Problem with FTS at RAL WLCG
143565 USER cms RAL-LCG2 urgent NGI_UK closed 2019-10-23 23:59:00 RAL FTS is Down WLCG




Availability Report

Day Atlas CMS LHCB Alice Comments
2019-10-23 100 100 100 100
2019-10-24 100 100 100 100
2019-10-25 100 100 100 100
2019-10-26 100 100 100 100
2019-10-27 100 100 100 100
2019-10-28 100 98 100 100
2019-10-29 100 100 100 100
Hammercloud Test Report
Target Availability for each site is 97.0%
Day Atlas HC CMS HC Comment
2019-10-23 100 98
2019-10-24 100 98
2019-10-25 100 96
2019-10-26 100 88
2019-10-27 100 68
2019-10-28 97 58
2019-10-29 100 58

Key: Atlas HC = Atlas HammerCloud (Queue RAL-LCG2_UCORE, Template 841); CMS HC = CMS HammerCloud

Notes from Meeting.