Difference between revisions of "Tier1 Operations Report 2019-07-24"
From GridPP Wiki
(→) |
(→) |
||
Line 260: | Line 260: | ||
Availability Report | Availability Report | ||
|} | |} | ||
− | |||
{| border=1 align=center | {| border=1 align=center | ||
|- bgcolor="#7c8aaf" | |- bgcolor="#7c8aaf" | ||
Line 270: | Line 269: | ||
! Comments | ! Comments | ||
|- | |- | ||
− | | 2019-07- | + | | 2019-07-17 |
+ | | 100 | ||
| 100 | | 100 | ||
| 100 | | 100 | ||
| 100 | | 100 | ||
− | |||
| | | | ||
|- | |- | ||
− | | 2019-07- | + | | 2019-07-18 |
| 100 | | 100 | ||
| 100 | | 100 | ||
Line 284: | Line 283: | ||
| | | | ||
|- | |- | ||
− | | 2019-07- | + | | 2019-07-19 |
| 100 | | 100 | ||
| 100 | | 100 | ||
Line 291: | Line 290: | ||
| | | | ||
|- | |- | ||
− | | 2019-07- | + | | 2019-07-20 |
| 100 | | 100 | ||
| 100 | | 100 | ||
Line 298: | Line 297: | ||
| | | | ||
|- | |- | ||
− | | 2019-07- | + | | 2019-07-21 |
+ | | 100 | ||
| 100 | | 100 | ||
| 100 | | 100 | ||
− | |||
| 100 | | 100 | ||
| | | | ||
|- | |- | ||
− | | 2019-07- | + | | 2019-07-22 |
+ | | 100 | ||
+ | | 100 | ||
+ | | 100 | ||
+ | | 100 | ||
+ | | | ||
+ | |- | ||
+ | | 2019-07-23 | ||
+ | | 100 | ||
+ | | 100 | ||
| 100 | | 100 | ||
− | |||
| 100 | | 100 | ||
− | |||
| | | | ||
|- | |- | ||
− | | 2019-07- | + | | 2019-07-24 |
| 100 | | 100 | ||
| 100 | | 100 |
Revision as of 11:46, 24 July 2019
RAL Tier1 Operations Report for 24th July 2019
Review of Issues during the week 17th July2019 to the 24th July 2019. |
- All Tier-1 production systems and processes are currently operating optimally within known and recorded parameters as agreed by all primary stake holders. (Read as "Business as Usual"!)
Current operational status and issues |
Notable Changes made since the last meeting. |
- NTR
Entries in GOC DB starting since the last report. |
Service | ID | Scheduled? | Outage/At Risk | Start | End | Duration | Reason |
---|---|---|---|---|---|---|---|
- | - | - | - | - | - | - | - |
Declared in the GOC DB |
Service | ID | Scheduled? | Outage/At Risk | Start | End | Duration | Reason |
---|---|---|---|---|---|---|---|
- | - | - | - | - | - | - | - |
- No ongoing downtime
Advanced warning for other interventions |
The following items are being discussed and are still to be formally scheduled and announced. |
Listing by category:
- DNS servers will be rolled out within the Tier1 network.
Open GGUS Tickets |
Ticket-ID | Type | VO | Site | Priority | Responsible Unit | Status | Last Update | Subject | Scope |
---|---|---|---|---|---|---|---|---|---|
142350 | TEAM | lhcb | RAL-LCG2 | top priority | NGI_UK | in progress | 2019-07-24 10:45:00 | Proble accessing some LHCb files at RAL | WLCG |
142337 | TEAM | lhcb | RAL-LCG2 | very urgent | NGI_UK | in progress | 2019-07-19 12:14:00 | Pilots Failed at RAL-LCG2 | WLCG |
142203 | TEAM | atlas | RAL-LCG2 | urgent | NGI_UK | on hold | 2019-07-19 07:58:00 | RAL-LCG2_MCORE jobs failing | WLCG |
140447 | USER | dteam | RAL-LCG2 | less urgent | NGI_UK | on hold | 2019-07-10 13:41:00 | packet loss outbound from RAL-LCG2 over IPv6 | EGI |
140220 | USER | mice | RAL-LCG2 | less urgent | NGI_UK | waiting for reply | 2019-07-17 15:52:00 | mice LFC to DFC transition | EGI |
GGUS Tickets Closed Last week |
Ticket-ID | Type | VO | Site | Priority | Responsible Unit | Status | Last Update | Subject | Scope |
---|---|---|---|---|---|---|---|---|---|
141990 | USER | cms | RAL-LCG2 | urgent | NGI_UK | closed | 2019-07-23 23:59:00 | Intermittent HC failures at T1_UK_RAL | WLCG |
141968 | USER | cms | RAL-LCG2 | very urgent | NGI_UK | closed | 2019-07-18 23:59:00 | SAM (CE) and Hammer Cloud Failures at T1_UK_RAL | WLCG |
139672 | USER | other | RAL-LCG2 | urgent | NGI_UK | closed | 2019-07-23 23:59:00 | No LIGO pilots running at RAL | EGI |
Availability Report |
Day | Atlas | CMS | LHCB | Alice | Comments |
---|---|---|---|---|---|
2019-07-17 | 100 | 100 | 100 | 100 | |
2019-07-18 | 100 | 100 | 100 | 100 | |
2019-07-19 | 100 | 100 | 100 | 100 | |
2019-07-20 | 100 | 100 | 100 | 100 | |
2019-07-21 | 100 | 100 | 100 | 100 | |
2019-07-22 | 100 | 100 | 100 | 100 | |
2019-07-23 | 100 | 100 | 100 | 100 | |
2019-07-24 | 100 | 100 | 100 | 100 |
Hammercloud Test Report |
Target Availability for each site is 97.0% |
Day | Atlas HC | CMS HC | Comment |
---|---|---|---|
2019-06-10 | 92 | 93 | |
2019-06-11 | 83 | 89 | |
2019-06-12 | 100 | 96 | |
2019-06-13 | 100 | 100 | |
2019-06-14 | 100 | 84 | |
2019-07-15 | 100 | 100 | |
2019-07-16 | 95 | 100 |
Key: Atlas HC = Atlas HammerCloud (Queue RAL-LCG2_UCORE, Template 841); CMS HC = CMS HammerCloud
Notes from Meeting. |