Difference between revisions of "Operations Bulletin Latest"
(→) |
(→) |
||
Line 455: | Line 455: | ||
===== ===== | ===== ===== | ||
<!-- ******************Edit start********************* -----> | <!-- ******************Edit start********************* -----> | ||
− | '''Monday | + | '''Monday 12th September 2016, 15.00 GMT'''<br /> |
+ | 33 Open UK tickets. | ||
− | + | '''SUSSEX'''<br /> | |
+ | I'm just going to skim the Sussex tickets, I'll contact Jeremy M about these again offline.<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122772 122772] (On Hold) - Atlas webdav/xroot ticket.<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123230 123230] (In progress) - Atlas transfer failures, was partially fixed as of 15/8.<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123740 123740] (Assigned) - Recent low availability ticket.<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123733 123733] (Assigned) - Ops SRM-LS test failures.<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122614 122614] (In progress) - Technically a ticket to the NGI concerning the Sussex status, not a Sussex ticket.<br /> | ||
+ | '''RALPP'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123859 123859] (12/9)<br /> | ||
+ | CMS noticed that the Phedex agents appear to be down at RALPP. Fresh this afternoon. Assigned (12/9) | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123858 123858] (12/9)<br /> | ||
+ | A duplicate of above, I believe because you run 2 Phedex boxes? Assigned (12/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123804 123804] (9/9)<br /> | ||
+ | Low availability ticket, Chris provided an explanation and notes that there have been no tests since - possibly related to the problems noticed in this morning's EGI broadcast. In progress (9/9) | ||
+ | |||
+ | '''OXFORD'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=121924 121924] (2/6)<br /> | ||
+ | A ticket from Duncan concerning a drop in perfsonar throughput rates at Oxford. Currently on hold - any ideas perhaps when you'll get round to looking at this? On hold (10/8) | ||
+ | |||
+ | '''BRISTOL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123860 123860] (12/9)<br /> | ||
+ | Another fresh "Phedex is down" ticket from CMS. Assigned (12/9) | ||
+ | |||
+ | '''BIRMINGHAM'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122771 122771] (11/7)<br /> | ||
+ | Atlas ticket requesting xroot and webdav endpoints. The submitter requests an update. In progress (12/9) | ||
+ | |||
+ | '''GLASGOW'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=120351 120351] (22/3)<br /> | ||
+ | Enabling LSST at Glasgow. Any news, or plans for having any news soonish? On hold (19/7) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122378 122378] (28/6)<br /> | ||
+ | No perfsonar results for Glasgow, the server appeared borked so David took it down. Is it soon due to rise from the ashes soonish? On hold (28/6) | ||
+ | |||
+ | '''EDINBURGH'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123732 123732] (5/9)<br /> | ||
+ | Nagios ticket for ce6 and ce7, which I believe are defunct CEs? Marcus has put the ticket on hold until Andy is back. On hold (5/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123164 123164] (28/7)<br /> | ||
+ | Another nagios ticket, a glue2.validate one this time. Andy was confused about where this was coming from, and I assume the alarm is still happening. Perhaps the glue-validator will yield some clues? Waiting for reply (29/8) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122653 122653] (7/7)<br /> | ||
+ | The third nagios ticket for ECDF, this one covers the odd saga of the ARCHER queue that would ideally be "IN PRODUCTION, NOT MONITORED". Waiting for reply (this probably should on hold instead) (5/9) | ||
+ | |||
+ | '''DURHAM'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123810 123810] (9/9)<br /> | ||
+ | LHCB noticed that the arc CEs were producing an incorrect number of running/waiting jobs (was this the catalyst for the tb-support thread on the same subject?). The ticket could do with acknowledgment, perhaps someone in the know could lend Durham a hand. Assigned (9/9) | ||
+ | |||
+ | '''SHEFFIELD'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123851 123851] (12/9)<br /> | ||
+ | APEL-pub ROD ticket. Matt R has rerun the apel publishing scripts by hand and is awaiting the results, if this doesn't work we might need to ask the apel people how things are looking their end. In progress (12/9) | ||
+ | |||
+ | '''MANCHESTER'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123813 123813] (10/9)<br /> | ||
+ | Atlas deletion errors, probably due to a downed disk server that has been brought back up. In progress (12/9) | ||
+ | |||
+ | '''LANCASTER'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123789 123789] (8/9)<br /> | ||
+ | LHCB jobs failing at Lancaster, probably because they're sensitive to some problems we had with our NFS server housing home and sandbox areas. We've hopefully soothed our problems by upping the number of nfs threads, we're in the wait and see period. In progress (12/9) | ||
+ | |||
+ | '''UCL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123734 123734] (5/9)<br /> | ||
+ | ROD apel ticket for UCL. Ben is investigating, it looks like some VAC boxes are having trouble talking to the network. In progress (7/9) | ||
+ | |||
+ | '''QMUL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123400 123400] (15/8)<br /> | ||
+ | Low availability ticket for QM, but Daniela notes the alarms appear to have cleared so this should be able to be closed. On hold (12/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123451 123451] (18/8)<br /> | ||
+ | LHCB had problems with a QM CE, which Dan noticed was fubared and needed a reinstall. Should hopefully be back online soon? On hold (18/8) | ||
+ | |||
+ | '''BRUNEL'''<br /> | ||
+ | Raul solved the two tickets before I could get to them, nicely done. | ||
+ | |||
+ | '''100IT''' have a ticket - [https://ggus.eu/?mode=ticket_info&ticket_id=123753 123753] (6/9) but you don't have to bring yourselves to look at it. | ||
+ | |||
+ | '''The TIER 1'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123794 123794] (9/9)<br /> | ||
+ | Atlas noticed a lot of analysis job failures, after some prodding it turned out that the culprit was one dodgy worker node. Case closed? In progress (9/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122827 122827] (12/7)<br /> | ||
+ | Sno+ asking for more disk, this has developed into some discussion and Alastair has added a few more points. Matt M has expanded on the Sno+ needs, and has decided to make more use of Tier 2 space for Sno+, which many or may not keep them ticking over until Echo is upon is. In progress (24/8) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=120350 120350] (22/3)<br /> | ||
+ | Enabling LSST at RAL. "Proper" test jobs are failed, Alessandra has put the ticket on hold until the issue can be debugged properly. | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123504 123504] (19/8)<br /> | ||
+ | Jon Perkins noticed that the WMSes at RAL didn't seem to be updating proxies at Sheffield for some long T2K jobs. A conversation was started but seemed to have stalled, it seemed some weird resource matching errors were going on. The landscape might have changed in the last 3 weeks however. In progress (23/8) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122364 122364] (27/6)<br /> | ||
+ | cvmfs support for solidexperiment.org. After some solid progress the ticket is on hold waiting for someone VO side to try to roll out some experiment software in anger. On hold (24/8) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=117683 117683] (18/11/2015)<br /> | ||
+ | Developing glue2 support for Castor. Any update will do?! On hold (5/4) | ||
+ | |||
+ | '''NGI'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=119995 119995] (7/3)<br /> | ||
+ | The culling of the uncertified sites. The old Glasgow Pre-production service was mentioned too. In progress (23/8) | ||
+ | |||
+ | '''So long EFDA-JET'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122198 122198] (17/6)<br /> | ||
+ | The jet decommissioning ticket. The other related ticket (123291) was closed as I wrote this report, so it won't be long until this ticket should be closed. Bye Jet! On Hold (1/9) | ||
<!-- ******************Edit stop********************* -----> | <!-- ******************Edit stop********************* -----> |
Revision as of 16:06, 12 September 2016
Week commencing Monday 12th September 2016 |
Task Areas |
|
|
Meeting Summaries |
|
|