Difference between revisions of "Operations Bulletin Latest"
(→) |
(→) |
||
Line 453: | Line 453: | ||
===== ===== | ===== ===== | ||
<!-- ******************Edit start********************* -----> | <!-- ******************Edit start********************* -----> | ||
+ | '''Monday 3rd October 2016, 15.15 BST'''<br /> | ||
+ | 31 Open UK tickets this month. | ||
+ | '''SUSSEX'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122614 122614] (6/7)<br /> | ||
+ | An NGI ticket concerning the availability problems at Sussex, afaics this is just waiting on a month or so smooth running so we can sign off on the time of troubles. On Hold (19/9) | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123740 123740] (6/9)<br /> | ||
+ | A common or garden ROD availability ticket, on hold as is the SOP. Looks like things are quite green down Brighton way, so hopefully this can be closed soon. On Hold (20/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122772 122772](11/7)<br /> | ||
+ | Atlas ticket asking about webdav and xroot endpoints. On Hold until Sussex get an admin to do this - if it becomes a problem before then we will need to offer assistance. (By we I mean someone with a STORM who has a clue about what's going on...). On hold (26/7) | ||
+ | |||
+ | '''OXFORD'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=121924 121924] (2/6)<br /> | ||
+ | Duncan spotted a drop in perfsonar rates at Oxford. Put on hold due to staff shortages, this ticket could do with an update (even a null one). On Hold (10/8) | ||
+ | |||
+ | '''BRISTOL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=124051 124051] (25/9)<br /> | ||
+ | An lhcb job submission problem at Bristol. Some cunning investigation traced the problem to a change of default client in the v5.1 arc tools (gridftp to a-rex), so Winnie and Lukasz are asking for the new ports to be open. They are however worried that they will need to upgrade their CE to match the major version of the arc tools for things to work smoothly. In progress (27/9) | ||
+ | |||
+ | '''BIRMINGHAM'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122771 122771] (11/7)<br /> | ||
+ | Another atlas xrootd/webdav deployment ticket. At last update Matt was in position to start rolling out these changes - any news? In progress (20/9) | ||
+ | |||
+ | '''GLASGOW'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=124052 124052] (25/9)<br /> | ||
+ | A ticket from lhcb about the much reported on ARC CE job publishing problems. Discussed last week, but I'm always up for more discussion! On hold (26/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=120351 120351] (22/3)<br /> | ||
+ | Enabling LSST. Gareth has rolled out support to one of the Glasgow ARC CEs and is ready for testing (the CE for LSST, not Gareth himself). Proper job. Waiting for reply (28/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122378 122378] (28/6)<br /> | ||
+ | Glasgow's perfsonars being out of commission. Rebuild ETA was in the next week or so, still on course for that? On Hold (19/9) | ||
+ | |||
+ | '''EDINBURGH'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123996 123996] (20/9)<br /> | ||
+ | LHCB jobs being murdered by the ECDF batch system, found due to them being submitted without a default wall time. Marcus made the batch system less murdery and pilots have stopped being killed, so it looks like this ticket can be closed. In progress (23/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123732 123732] (5/9)<br /> | ||
+ | ce5 and ce7 (both creams) throwing up ROD alarms, it looks like their state needs to be reviewed. Here's a friendly nudge to review it. Hint hint. On hold (20/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122653 122653] (7/7)<br /> | ||
+ | The other ROD ticket, regarding the archer facing CE. I don't think this has had any progress on it. Is it causing (more of) a problem for the ROD yet? Waiting for reply (probably should be On Hold instead) (26/7) | ||
+ | |||
+ | '''SHEFFIELD'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=124003 124003] (20/9)<br /> | ||
+ | Atlas transfer problems to Sheffield - Elena is ongoing a server rebalancing exercise to smooth this out but this will take time. On Hold (28/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=124036 124036] (23/9)<br /> | ||
+ | An expired argus certificate caused Sheffield to get a ROD availability ticket, but it's being handled and on-holded so all's well. On hold (29/9) | ||
+ | |||
+ | '''MANCHESTER'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123870 123870] (13/9)<br /> | ||
+ | Duncan ticketed over poor perfsonar throughput results to Manchester. Marked in progress, but any news in the investigation? In progress (14/9) | ||
+ | |||
+ | '''LIVERPOOL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123962 123962] (19/9)<br /> | ||
+ | As discussed over the last few weeks, Biomed had trouble using the Liverpool SE as the shared area had filled up. John helpfully gave a quick introduction to space tokens to help them out, but no news from biomed since. A re-poke is likely in order. In progress (20/9) | ||
+ | |||
+ | '''IMPERIAL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123959 123959] (18/9)<br /> | ||
+ | Sno+ DIRAC jobs failing without logs, likely due to being killed when exceeding memory limits. No update on this for a while - but then it will probably need to be looked at in a different light if we no longer have a Matt M. In progress (19/9) | ||
+ | |||
+ | '''BRUNEL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123947 123947] (16/9)<br /> | ||
+ | CMS asking Brunel to do some investigation into some issues they see at other sites. I think some conclusions have been made but I'm not sure what they are! The ticket is very long - a testament to the effort put in. In progress (30/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=124153 124153] (29/9)<br /> | ||
+ | A very fresh ROD SRM-Put ticket. Assigned (29/9) | ||
+ | |||
+ | '''100IT'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123753 123753] (6/9)<br /> | ||
+ | I think this ticket should be closed, I'll prod it to make sure, don't want them cluttering up our GGUS! In progress (19/9) | ||
+ | |||
+ | '''THE TIER 1'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=124183 124183] (2/10)<br /> | ||
+ | An "ALARM" ticket from lhcb, with problems seen copying from the RAL WNs to the RAL BUFFER. Looks like the problem was solved before Sunday teatime, so I think this ticket can be closed. In progress (2/10) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=124188 124188] (3/10)<br /> | ||
+ | atlas reckoned a frontier squid at RAL is down. The picture looks confusing, and perhaps the ticket should be waiting for reply so atlas double check their results. In progress (3/10) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=120350 120350] (22/3)<br /> | ||
+ | Enabling LSST at RAL. Test jobs failed, but in-depth debugging has yet to occur - on hold till then (probably until after San Franscisco). On hold (12/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122827 122827] (12/7)<br /> | ||
+ | Sno+ requesting more disk space. There has been some discussion, but as with the Imperial ticket this will need to be reviewed. Waiting for reply (probably should be On Hold instead really) (19/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=123504 123504] (19/8)<br /> | ||
+ | T2K proxy problems between the RAL WMS and Sheffield. Another ticket that might be in an orphaned state after recent news of people leaving. Waiting for reply (20/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122364 122364] (27/6)<br /> | ||
+ | cvmfs support for the solidexperiment.org VO. On hold awaiting the VO to gain some traction, any signs of solid progress? No rush if there isn't... yet! On Hold (24/8) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=121687 121687] (20/5)<br /> | ||
+ | Packet loss on the RAL perfsonar, which was due an update once the alloted time has passed and a key bit of routing kit was replaced (as John pointed out in the mini-update on Friday). On Hold (30/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=117683 117683] (18/11/15)<br /> | ||
+ | glue2 publishing for castor. Really, really, really could do with an update - even a null one! On hold (17/2) | ||
+ | |||
+ | '''NGI'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=119995 119995] (7/3)<br /> | ||
+ | Culling the uncertified (sites). UKI-ScotGrid-Gla-PPS was confirmed no longer needed by people at Glasgow, so this is nearly done. In progress (19/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122198 122198] (17/6)<br /> | ||
+ | Decommissioning JET ticket, just waiting for 90 days before the site can be officially removed. On hold (19/9) | ||
<!-- ******************Edit stop********************* -----> | <!-- ******************Edit stop********************* -----> |
Revision as of 15:52, 3 October 2016
Week commencing Monday 3rd October 2016 |
Task Areas |
|
|
Meeting Summaries |
|
|