Difference between revisions of "Operations Bulletin Latest"
(→) |
(→) |
||
Line 485: | Line 485: | ||
===== ===== | ===== ===== | ||
<!-- ******************Edit start********************* -----> | <!-- ******************Edit start********************* -----> | ||
+ | '''Monday 4th February 2019, 14.30 GMT'''<br /> | ||
+ | 41 Open UK Tickets this month. | ||
+ | '''NGI'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=139506 (4/2)<br /> | ||
+ | The NGI got a ticket regarding Birmingham's availability figures, which are thrown by the decommissioning of their SE. We need to formulate a reponse, but we should perhaps ask for an A/R recomputation for January for the site. Assigned (4/2) | ||
+ | '''OXFORD'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=139431 (30/1)<br /> | ||
+ | A request from CMS to updated the site's site-local-config. Being looked at. In progress (31/1) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=138647 (3/12/18)<br /> | ||
+ | Ticket tracking the t2k DFC migration at Oxford. Kashif has supplied the best file dump that he can without DOME installed. Daniela has asked the VO if they can enact a "clean slate" solution at Oxford to make life easier for all. In progress (31/1) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=131615 (3/11/17)<br /> | ||
+ | Oxford's IPv6 ticket. Kashif has kept this up to date, with some semi-positive news - things are moving in the right direction, however slowly. On Hold (7/1) | ||
+ | |||
+ | '''BRISTOL'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=139410 (30/1)<br /> | ||
+ | CMS ticket for transfer failures from Florida to the site. Investigation suggests that this might be an IPv6 issue. In progress (4/2) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=131613 (3/11/17)<br /> | ||
+ | Bristol's IPv6 ticket. Good progress here, but more holes needed to be poked in the site's v6 firewall. We'll need to check the PS mesh (still all grey for Bristol's v6 endpoints at time of writing). In progress (4/2) | ||
+ | |||
+ | '''BIRMINGHAM'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=137801 (17/10/18)<br /> | ||
+ | Ticket tracking the decommissioning of the Birmingham DPM. The node was removed from gocdb and switched off last week. I can't remember how long these tickets need to be kept open - I should look that up really. Just remember to keep your logs for 90 days Mark! In progress (30/1) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=138894 (17/12/18)<br /> | ||
+ | This ROD ticket for the decommissioned SE might have hit a problem - Mark removed the server from the gocdb but there's still an alarm on the dashboard... On Hold (9/1) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=138244 (12/11/18)<br /> | ||
+ | Meanwhile since killing off the old DPM completely the Birmingham Availability/Reliability figures have started to fix themselves. On Hold (1/2) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=131612 (3/11/17)<br /> | ||
+ | Birmingham's v6 ticket. Some good news just before Christmas, hopefully Mark will be able to start dual-stacking once he's cleared his plate a bit. On Hold (24/12/18) | ||
+ | |||
+ | '''GLASGOW'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=131611 (3/11/17)<br /> | ||
+ | Only the v6 ticket at Glasgow. Last update (today) was a request for info from the v6 ticket watchers. In progress (4/12/18) | ||
+ | |||
+ | '''EDINBURGH'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=139240 (21/1)<br /> | ||
+ | An LHCB ticket about jobs failing, tracked to a "black hole" node that was took offline. Last update was waiting on the VO to confirm if the problem has gone away, which they were having problems doing due to having "issues" at the time. If there's no word from LHCB soon then I would close this ticket. In progress (22/1) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=138243 (12/1/18)<br /> | ||
+ | An availability ticket. I'm a little confused as to why there's still an alarm on the dashboard, as the argo page looks to my eyes like the site has had >85% availability over the last 30 days (only one non-100% day). On Hold (1/2) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=131610 (3/11/17)<br /> | ||
+ | ECDF's v6 ticket. Some positive news back in early December, the ticket could do with an update. In progress (4/12/18) | ||
+ | |||
+ | '''DURHAM'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=131609 (3/11/17)<br /> | ||
+ | Another site with just the v6 ticket. Last update was the start of December, any news from your network team at all? On Hold (4/12/18) | ||
+ | |||
+ | '''SHEFFIELD'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=138649 (3/12/18)<br /> | ||
+ | Sheffield's t2k DFC migration ticket. The site's status is the same as Oxford, and was included in Daniela's query to t2k in that ticket. In progress (9/1) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=131608 (3/11/17)<br /> | ||
+ | Sheffield's v6 ticket. In great need of an update. In progress (30/10) | ||
+ | |||
+ | '''MANCHESTER'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=131607 (3/11/17)<br /> | ||
+ | Only the v6 ticket at Manchester too. Things were looking good towards the end of last year, any news? In progress (27/11/18) | ||
+ | |||
+ | '''LIVERPOOL'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=139411 (30/1)<br /> | ||
+ | A request from Biomed querying if they still need to use the -s option to use the site's space token (note that they're still using lcg tools). John replied that currently this is still the case, but in the DOME future it won't be (due to quotatokens being applied to a directory). On Hold (1/2) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=138648 (3/12/18)<br /> | ||
+ | Liverpool's t2k DFC migration ticket. Unlike the other two sites Liverpool is planning on migrating to DOME soonish, so they might not require a "clean slate solution". On Hold (18/12/18) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=131606 (3/11/17)<br /> | ||
+ | Liverpool's v6 ticket. Last report had the networking team look at this in the New Year (so now-ish) to dual stack the storage, whilst the perfsonars are happily dual-stacked already. Please update the ticket once you know more (whoch will hopefully be soon-ish). In Progress (5/12/18) | ||
+ | |||
+ | '''LANCASTER'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=137996 (30/10/18)<br /> | ||
+ | A ROD ticket for an http test failure caused by DPM not quite handling http file moves quite right. Waiting on an updated version of DPM to get into epel - I will ask the devs today how that's going. On Hold (14/1) | ||
+ | |||
+ | '''UCL'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=139101 (8/1)<br /> | ||
+ | A ROD ticket for APEL publishing test failures. Ben has called Andrew McNab in for help installing things. In Progress (30/1) | ||
+ | |||
+ | '''RHUL'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=131603 (7/11/17)<br /> | ||
+ | Just the v6 ticket at RHUL too. Simon confirms that there's been no news on this front. In progress (23/1) | ||
+ | |||
+ | '''QMUL'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=139430 (30/1)<br /> | ||
+ | Another CMS ticket to update the site-local-config. Daniela has sorted it and has asked CMS to confirm. Waiting for reply (4/2) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=139097 (7/1)<br /> | ||
+ | LHCB seeing data transfer problems, but this was a while ago. Dan has asked if problems persist. Waiting for reply (30/1) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=138364 (19/11/18)<br /> | ||
+ | QM's t2k DFC migration ticket. Dan was ready to do the data moving bit, just asked for a confirmation of that needed to be done. Is the move underway Dan? In progress (16/1) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=134573 (17/4/19)<br /> | ||
+ | CMS request to install singularity. Dan is rolling this into the move to C7, which was in the testing phase last November. Any recent news? On Hold (5/11/18) | ||
+ | |||
+ | '''IMPERIAL'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=139454 (31/1)<br /> | ||
+ | A ticket from a t2k user having trouble accessing post-DFC migration data at RALPP - which for reasons had to be routed to Imperial. Daniela can't spot any problems, so it looks like a user side issue. Although it might be worth checking the t2k.org .lsc files at RALPP. Assigned (should be something else) (31/1) | ||
+ | |||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=138359 (19/11/18)<br /> | ||
+ | Daniela runs such a tight ship at IC that she has to assign other issues to her site - this is the DFC migration master ticket. On Hold (22/1) | ||
+ | |||
+ | '''BRUNEL'''<br /> | ||
+ | https://ggus.eu/?mode=ticket_info&ticket_id=139344 (28/1)<br /> | ||
+ | CMS transfer failures at Brunel. The storage is working fine, but it looks like some files aren't at Brunel that CMS things should be at Brunel, with no explanation of where they went. It's being investigated. In progress (4/2) | ||
+ | |||
+ | '''100IT''' still have ticket: [https://ggus.eu/?mode=ticket_info&ticket_id=137306 137306] (last update 16/1) | ||
+ | |||
+ | '''TIER 1'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138361 138361] (19/11/18)<br /> | ||
+ | The Tier 1's t2k DFC migration ticket. The ticket looks done with, just waiting on t2k to see if things are okay. That seems to be a little unclear, but that might be a VO side problem. In progress (31/1) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138665 138665] (4/12/18)<br /> | ||
+ | The original mice LFC ticket, on hold whilst the above is sorted out. | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=139476 139476] (1/2)<br /> | ||
+ | With the MICE LFC dead in the water this is the request for a dump to migrate to the DFC. In progress (4/2) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=139306 139306] (24/1)<br /> | ||
+ | A request from Duncan to upgrade the RAL perfsonar hosts (and fix some configs). In progress (29/1) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138891 138891] (17/12)<br /> | ||
+ | A ROD availability ticket that looks a bit off - John thinks this is due to invalid tests being run and has opened a counter ticket: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=139198 139198] - from that the test in question is due to be removed this week. On Hold (16/1) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=139477 139477] (1/2)<br /> | ||
+ | A ROD ticket for a couple of sickly ARC CEs. One node is fixed, the other was already on the naughty step for having a high load (possibly from the A-REX slapd process), and it's being poked and prodded. In progress (4/2) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138500 138500] (26/11/18)<br /> | ||
+ | CMS transfers from T2_PL_SWIERK failing. File transfer experts were about to be called in, and the ticket is now On Hold. Is it going to be a tough one to debug? On Hold (30/1) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138033 138033] (1/11/18)<br /> | ||
+ | Atlas ticket for singuarlity job failures at RAL. Still lots of back and forth here, with great efforts from James and Alessandra. In progress (31/1) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=139414 139414] (30/1)<br /> | ||
+ | LHCB jobs seg faulting. It appears these errors all occurred on VMs, and now those VMs have passed on the errors have disappeared too. As there's no way to easily proceed (VM necromancy isn't a thing afaik) then it looks like this one can be closed. In progress (4/2) | ||
<!-- ******************Edit stop********************* -----> | <!-- ******************Edit stop********************* -----> | ||
|} | |} |
Revision as of 17:13, 4 February 2019
Week commencing Monday 28th January 2019 |
Task Areas |
|
|
Meeting Summaries |
|
|