Difference between revisions of "Tier1 Operations Report 2019-08-07"
From GridPP Wiki
(→) |
(→) |
||
Line 199: | Line 199: | ||
| style="background-color: #b7f1ce; border-bottom: 1px solid silver; text-align: center; font-size: 1em; font-weight: bold; margin-top: 0; margin-bottom: 0; padding-top: 0.1em; padding-bottom: 0.1em;" | GGUS Tickets Closed Last week | | style="background-color: #b7f1ce; border-bottom: 1px solid silver; text-align: center; font-size: 1em; font-weight: bold; margin-top: 0; margin-bottom: 0; padding-top: 0.1em; padding-bottom: 0.1em;" | GGUS Tickets Closed Last week | ||
|} | |} | ||
− | |||
{| border=1 align=center | {| border=1 align=center | ||
Line 214: | Line 213: | ||
! Scope | ! Scope | ||
|- | |- | ||
− | | | + | | 142520 |
| USER | | USER | ||
| cms | | cms | ||
Line 220: | Line 219: | ||
| urgent | | urgent | ||
| NGI_UK | | NGI_UK | ||
− | | | + | | solved |
− | | 2019-07- | + | | 2019-07-31 14:03:00 |
− | | | + | | T1_UK_RAL is failing SAM tests |
| WLCG | | WLCG | ||
|- | |- | ||
− | | | + | | 142516 |
− | | | + | | ALARM |
− | | | + | | none |
| RAL-LCG2 | | RAL-LCG2 | ||
− | | urgent | + | | top priority |
+ | | NGI_UK | ||
+ | | verified | ||
+ | | 2019-08-05 12:46:00 | ||
+ | | This TEST ALARM has been raised for testing GGUS alarm work flow after a new GGUS release. | ||
+ | | WLCG | ||
+ | |- | ||
+ | | 142241 | ||
+ | | TEAM | ||
+ | | atlas | ||
+ | | RAL-LCG2 | ||
+ | | less urgent | ||
| NGI_UK | | NGI_UK | ||
| closed | | closed | ||
− | | 2019-07- | + | | 2019-07-31 23:59:00 |
− | | | + | | ATLAS-RAL-Frontier service degraded |
− | | | + | | WLCG |
|- | |- | ||
− | | | + | | 142203 |
− | | | + | | TEAM |
− | | | + | | atlas |
| RAL-LCG2 | | RAL-LCG2 | ||
| urgent | | urgent | ||
| NGI_UK | | NGI_UK | ||
− | | | + | | solved |
− | | 2019-07- | + | | 2019-07-31 12:43:00 |
− | | | + | | RAL-LCG2_MCORE jobs failing |
| WLCG | | WLCG | ||
|} | |} | ||
+ | |||
<!-- **********************End Availability Report************************** -----> | <!-- **********************End Availability Report************************** -----> | ||
<!-- *********************************************************************** -----> | <!-- *********************************************************************** -----> |
Revision as of 10:36, 7 August 2019
RAL Tier1 Operations Report for 07st August 2019
Review of Issues during the week 25th July2019 to the 31st July 2019. |
- I
Current operational status and issues |
Notable Changes made since the last meeting. |
- Production FTS instance upgraded
Entries in GOC DB starting since the last report. |
Service | ID | Scheduled? | Outage/At Risk | Start | End | Duration | Reason |
---|---|---|---|---|---|---|---|
- | - | - | - | - | - | - | - |
Declared in the GOC DB |
Service | ID | Scheduled? | Outage/At Risk | Start | End | Duration | Reason |
---|---|---|---|---|---|---|---|
- | - | - | - | - | - | - | - |
- No ongoing downtime
Advanced warning for other interventions |
The following items are being discussed and are still to be formally scheduled and announced. |
Listing by category:
- DNS servers will be rolled out within the Tier1 network.
Open GGUS Tickets |
Ticket-ID | Type | VO | Site | Priority | Responsible Unit | Status | Last Update | Subject | Scope |
---|---|---|---|---|---|---|---|---|---|
142350 | TEAM | lhcb | RAL-LCG2 | top priority | NGI_UK | in progress | 2019-08-06 14:48:00 | Proble accessing some LHCb files at RAL | WLCG |
142337 | TEAM | lhcb | RAL-LCG2 | very urgent | NGI_UK | waiting for reply | 2019-07-31 15:13:00 | Pilots Failed at RAL-LCG2 | WLCG |
140447 | USER | dteam | RAL-LCG2 | less urgent | NGI_UK | on hold | 2019-07-10 13:41:00 | packet loss outbound from RAL-LCG2 over IPv6 | EGI |
140220 | USER | mice | RAL-LCG2 | less urgent | NGI_UK | waiting for reply | 2019-07-31 15:27:00 | mice LFC to DFC transition | EGI |
GGUS Tickets Closed Last week |
Ticket-ID | Type | VO | Site | Priority | Responsible Unit | Status | Last Update | Subject | Scope |
---|---|---|---|---|---|---|---|---|---|
142520 | USER | cms | RAL-LCG2 | urgent | NGI_UK | solved | 2019-07-31 14:03:00 | T1_UK_RAL is failing SAM tests | WLCG |
142516 | ALARM | none | RAL-LCG2 | top priority | NGI_UK | verified | 2019-08-05 12:46:00 | This TEST ALARM has been raised for testing GGUS alarm work flow after a new GGUS release. | WLCG |
142241 | TEAM | atlas | RAL-LCG2 | less urgent | NGI_UK | closed | 2019-07-31 23:59:00 | ATLAS-RAL-Frontier service degraded | WLCG |
142203 | TEAM | atlas | RAL-LCG2 | urgent | NGI_UK | solved | 2019-07-31 12:43:00 | RAL-LCG2_MCORE jobs failing | WLCG |
Availability Report |
Day | Atlas | CMS | LHCB | Alice | Comments |
---|---|---|---|---|---|
2019-07-24 | 100 | 100 | 100 | 100 | |
2019-07-25 | 100 | 100 | 100 | 100 | |
2019-07-26 | 100 | 100 | 100 | 100 | |
2019-07-27 | 100 | 100 | 100 | 100 | |
2019-07-28 | 100 | 100 | 100 | 100 | |
2019-07-29 | 100 | 100 | 100 | 100 | |
2019-07-30 | 100 | 91 | 100 | 100 |
Hammercloud Test Report |
Target Availability for each site is 97.0% |
Day | Atlas HC | CMS HC | Comment |
---|---|---|---|
2019-06-24 | 100 | 100 | |
2019-06-25 | 62 | 100 | |
2019-06-26 | 100 | n/a | |
2019-06-27 | 100 | 100 | |
2019-06-28 | 100 | 100 | |
2019-07-29 | 100 | 96 | |
2019-07-30 | 96 | 96 |
Key: Atlas HC = Atlas HammerCloud (Queue RAL-LCG2_UCORE, Template 841); CMS HC = CMS HammerCloud
Notes from Meeting. |