Difference between revisions of "Tier1 Operations Report 2019-06-24"
From GridPP Wiki
(→) |
|||
Line 180: | Line 180: | ||
GGUS Tickets (Snapshot taken during morning of the meeting). | GGUS Tickets (Snapshot taken during morning of the meeting). | ||
|} | |} | ||
+ | |||
+ | |||
{| border=1 align=center | {| border=1 align=center | ||
|- bgcolor="#7c8aaf" | |- bgcolor="#7c8aaf" | ||
Line 193: | Line 195: | ||
! Scope | ! Scope | ||
|- | |- | ||
− | | | + | | 141872 |
+ | | TEAM | ||
+ | | lhcb | ||
+ | | RAL-LCG2 | ||
+ | | top priority | ||
+ | | NGI_UK | ||
+ | | in progress | ||
+ | | 2019-06-26 08:29:00 | ||
+ | | srm-lhcb.gridpp.rl.ac.uk seems in a bad state (time out) | ||
+ | | WLCG | ||
+ | |- | ||
+ | | 141838 | ||
| USER | | USER | ||
| cms | | cms | ||
Line 200: | Line 213: | ||
| NGI_UK | | NGI_UK | ||
| in progress | | in progress | ||
− | | 2019-06- | + | | 2019-06-24 11:13:00 |
− | | | + | | Transfers failing from CERN Tape to RAL Disk |
| WLCG | | WLCG | ||
|- | |- | ||
Line 214: | Line 227: | ||
| Permissions on RAL SE | | Permissions on RAL SE | ||
| EGI | | EGI | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
|- | |- | ||
| 140870 | | 140870 | ||
Line 243: | Line 234: | ||
| less urgent | | less urgent | ||
| NGI_UK | | NGI_UK | ||
− | | | + | | in progress |
− | | 2019-06- | + | | 2019-06-20 14:35:00 |
| Files vanished from RAL tape? | | Files vanished from RAL tape? | ||
| EGI | | EGI | ||
Line 266: | Line 257: | ||
| NGI_UK | | NGI_UK | ||
| in progress | | in progress | ||
− | | 2019-06- | + | | 2019-06-25 13:03:00 |
| mice LFC to DFC transition | | mice LFC to DFC transition | ||
| EGI | | EGI |
Revision as of 09:05, 26 June 2019
RAL Tier1 Operations Report for 24th June 2019
Review of Issues during the week 10th June 2019 to the 17th June 2019. |
Current operational status and issues |
Resolved Castor Disk Server Issues |
Machine | VO | DiskPool | dxtx | Comments |
---|---|---|---|---|
- | - | - | - |
Ongoing Castor Disk Server Issues |
Machine | VO | DiskPool | dxtx | Comments |
---|---|---|---|---|
- | - | - | - | - |
Limits on concurrent batch system jobs. |
Notable Changes made since the last meeting. |
- NTR
Entries in GOC DB starting since the last report. |
Service | ID | Scheduled? | Outage/At Risk | Start | End | Duration | Reason |
---|---|---|---|---|---|---|---|
- | - | - | - | - | - | - | - |
Declared in the GOC DB |
Service | ID | Scheduled? | Outage/At Risk | Start | End | Duration | Reason |
---|---|---|---|---|---|---|---|
- | - | - | - | - | - | - | - |
- No ongoing downtime
Advanced warning for other interventions |
The following items are being discussed and are still to be formally scheduled and announced. |
Listing by category:
- DNS servers will be rolled out within the Tier1 network.
Open
GGUS Tickets (Snapshot taken during morning of the meeting). |
Ticket-ID | Type | VO | Site | Priority | Responsible Unit | Status | Last Update | Subject | Scope |
---|---|---|---|---|---|---|---|---|---|
141872 | TEAM | lhcb | RAL-LCG2 | top priority | NGI_UK | in progress | 2019-06-26 08:29:00 | srm-lhcb.gridpp.rl.ac.uk seems in a bad state (time out) | WLCG |
141838 | USER | cms | RAL-LCG2 | urgent | NGI_UK | in progress | 2019-06-24 11:13:00 | Transfers failing from CERN Tape to RAL Disk | WLCG |
141608 | USER | snoplus.snolab.ca | RAL-LCG2 | less urgent | NGI_UK | in progress | 2019-06-06 08:55:00 | Permissions on RAL SE | EGI |
140870 | USER | t2k.org | RAL-LCG2 | less urgent | NGI_UK | in progress | 2019-06-20 14:35:00 | Files vanished from RAL tape? | EGI |
140447 | USER | dteam | RAL-LCG2 | less urgent | NGI_UK | on hold | 2019-05-22 14:20:00 | packet loss outbound from RAL-LCG2 over IPv6 | EGI |
140220 | USER | mice | RAL-LCG2 | less urgent | NGI_UK | in progress | 2019-06-25 13:03:00 | mice LFC to DFC transition | EGI |
139672 | USER | other | RAL-LCG2 | urgent | NGI_UK | waiting for reply | 2019-06-17 08:24:00 | No LIGO pilots running at RAL | EGI |
GGUS Tickets Closed Last week |
Ticket-ID | Type | VO | Site | Priority | Responsible Unit | Status | Last Update | Subject | Scope |
---|---|---|---|---|---|---|---|---|---|
141704 | USER | cms | RAL-LCG2 | less urgent | NGI_UK | solved | 2019-06-13 14:16:00 | PhedEX transfer 1799773 | WLCG |
141262 | TEAM | lhcb | RAL-LCG2 | very urgent | NGI_UK | verified | 2019-06-12 16:02:00 | Users are getting [FATAL] Auth failed | WLCG |
Availability Report |
Day | Atlas | Atlas-Echo | CMS | LHCB | Alice | OPS | Comments |
---|---|---|---|---|---|---|---|
2019-06-11 | 100 | na | 87 | 100 | 100 | na | |
2019-06-12 | 100 | na | 95 | 100 | 100 | na | |
2019-06-13 | 100 | na | 97 | 100 | 100 | na | |
2019-06-14 | 100 | na | 100 | 100 | 100 | na | |
2019-06-15 | 100 | na | 100 | 100 | 100 | na | |
2019-06-16 | 100 | na | 97 | 100 | 100 | na | |
2019-06-17 | 100 | na | 100 | 100 | 100 | na |
Hammercloud Test Report |
Target Availability for each site is 97.0% | Red <90% | Orange <97% |
Day | Atlas HC | CMS HC | Comment | |||||
---|---|---|---|---|---|---|---|---|
2019-06-11 | 100 | 98 | ||||||
2019-06-12 | 100 | 100 | ||||||
2019-06-13 | 100 | 100 | ||||||
2019-06-14 | 100 | 96 | ||||||
2019-06-15 | 100 | 97 | ||||||
2019-06-16 | 100 | 98 | 2019-06-17 | 100 | 98 |
Key: Atlas HC = Atlas HammerCloud (Queue RAL-LCG2_UCORE, Template 841); CMS HC = CMS HammerCloud
Notes from Meeting. |