Difference between revisions of "Tier1 Operations Report 2019-11-20"

From GridPP Wiki
Jump to: navigation, search
()
()
Line 10: Line 10:
 
| style="background-color: #b7f1ce; border-bottom: 1px solid silver; text-align: center; font-size: 1em; font-weight: bold; margin-top: 0; margin-bottom: 0; padding-top: 0.1em; padding-bottom: 0.1em;" | Review of Issues during the week 13th November 2019 to the 19th November 2019.
 
| style="background-color: #b7f1ce; border-bottom: 1px solid silver; text-align: center; font-size: 1em; font-weight: bold; margin-top: 0; margin-bottom: 0; padding-top: 0.1em; padding-bottom: 0.1em;" | Review of Issues during the week 13th November 2019 to the 19th November 2019.
 
|}
 
|}
* N
+
* reboot of the XRootd GW
 +
* Test FTS DB  restarted after failure
 +
* LSST jobs running
  
 
<!-- ***********End Review of Issues during last week*********** ----->
 
<!-- ***********End Review of Issues during last week*********** ----->

Revision as of 13:03, 20 November 2019

RAL Tier1 Operations Report for 20th November 2019

Review of Issues during the week 13th November 2019 to the 19th November 2019.
  • reboot of the XRootd GW
  • Test FTS DB restarted after failure
  • LSST jobs running


Current operational status and issues
Notable Changes made since the last meeting.
  • NTR
Entries in GOC DB starting since the last report.
Service ID Scheduled? Outage/At Risk Start End Duration Reason
Declared in the GOC DB
Service ID Scheduled? Outage/At Risk Start End Duration Reason
- - - - - - - -
  • No ongoing downtime
Advanced warning for other interventions
The following items are being discussed and are still to be formally scheduled and announced.


Listing by category:


Open GGUS Tickets
Ticket-ID Type VO Site Priority Responsible Unit Status Last Update Subject Scope
144024 USER cms RAL-LCG2 very urgent NGI_UK waiting for reply 2019-11-15 15:39:00 File Read Issues where files are located at RAL WLCG
144015 USER other RAL-LCG2 less urgent NGI_UK in progress 2019-11-20 10:08:00 Stalled LSST jobs at RAL EGI
143762 TEAM lhcb RAL-LCG2 urgent NGI_UK in progress 2019-10-23 14:12:00 Stop using sl6 queues at RAL WLCG
143669 USER snoplus.snolab.ca RAL-LCG2 urgent NGI_UK on hold 2019-11-18 09:13:00 SNO+ LFC to DFC migration EGI
143323 TEAM lhcb RAL-LCG2 top priority NGI_UK on hold 2019-11-18 14:16:00 File deletion at RAL ECHO WLCG
142350 TEAM lhcb RAL-LCG2 top priority NGI_UK on hold 2019-11-18 14:59:00 Proble accessing some LHCb files at RAL WLCG



GGUS Tickets Closed Last week
Ticket-ID Type VO Site Priority Responsible Unit Status Last Update Subject Scope
143917 USER cms RAL-LCG2 urgent NGI_UK closed 2019-11-19 23:59:00 Transfers failing to T1_UK_RAL_Disk WLCG
143876 USER cms RAL-LCG2 urgent NGI_UK closed 2019-11-15 23:59:00 T1_UK_RAL HammerCloud cannot reach files via xrootd WLCG
143838 TEAM atlas RAL-LCG2 less urgent NGI_UK closed 2019-11-15 23:59:00 RAL-LCG2: TRANSFER an end-of-file was reached globus_xio: An end of file occurred WLCG
143834 USER cms RAL-LCG2 urgent NGI_UK closed 2019-11-13 23:59:00 transfers failing to T1_UK_RAL_Disk WLCG
143645 TEAM lhcb RAL-LCG2 top priority NGI_UK verified 2019-11-19 08:30:00 Jobs Failed to access files at RAL-LCG2 WLCG


Availability Report


Day Atlas CMS LHCB Alice Comments
2019-11-13 100 100 100 100
2019-11-14 100 100 100 98
2019-11-15 100 100 100 100
2019-11-16 100 100 100 100
2019-11-17 100 100 100 100
2019-11-18 100 99 100 100
2019-11-19 100 100 100 100
Hammercloud Test Report
Target Availability for each site is 97.0%
Day Atlas HC CMS HC Comment
2019-11-13 100 98
2019-11-14 100 99
2019-11-15 100 98
2019-11-16 100 97
2019-11-17 100 98
2019-11-18 96 98
2019-11-19 100 98

Key: Atlas HC = Atlas HammerCloud (Queue RAL-LCG2_UCORE, Template 841); CMS HC = CMS HammerCloud

Notes from Meeting.