Difference between revisions of "Operations Bulletin Latest"
(→) |
(→) |
||
Line 395: | Line 395: | ||
===== ===== | ===== ===== | ||
<!-- ******************Edit start********************* -----> | <!-- ******************Edit start********************* -----> | ||
+ | '''Monday 2nd October 2017, 15.00 BST'''<br /> | ||
+ | 23 Open UK Tickets this month. | ||
+ | '''SUSSEX'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=122772 122772] (11/7/16)<br /> | ||
+ | Sussex have just the one ticket- the atlas xroot/httpd one. Last word was that Leo had contacted Dan at QM for help. In progress (5/9) | ||
+ | '''RALPP'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130264 130264] (28/8)<br /> | ||
+ | Biomed CE publishing ticket - Chris is waiting on the corresponding Brunel ticket ([https://ggus.eu/?mode=ticket_info&ticket_id=130263 130263]) to see how the fix offered works out. In progress (26/9) | ||
+ | '''OXFORD'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130032 130032] (11/8)<br /> | ||
+ | LHCB jobs failing on upload. The ticket occurred at a time of multiple SE troubles, and there's not been any news throughout September so first port of call will be to see if the problems persist. In progress (23/8) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130173 130173] (22/8)<br /> | ||
+ | A ticket from Duncan about incomplete perfsonar results, with the likely solution being to reinstall to CentOS7. Kashif is on it. In progress (26/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=129931 129931] (4/8)<br /> | ||
+ | http SAM tests failing - for unknown reasons. Kashif hopes an update of the headnode will fix things. On hold (19/9) | ||
+ | |||
+ | '''CAMBRIDGE'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130787 130787] (28/9)<br /> | ||
+ | LHCB pilots failing at Cambridge. It looks like lhcb jobs are failing due to hitting a CPU time limit, although no changes have been made site side to break things. John proposed increasing the CPU limits. Waiting for reply (28/9) | ||
+ | |||
+ | '''BRISTOL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130646 130646] (20/9)<br /> | ||
+ | Low CMS xroot HC rates. After some clarification on the problem Lukasz is looking at the xroot logs. Did you see anything? In progress (26/9) | ||
+ | |||
+ | '''BIRMINGHAM'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=129930 129930] (4/8)<br /> | ||
+ | The Birmingham version of the Oxford http SAM test ticket. Although symptoms are slightly different it's equally hard to debug. Any news, or perhaps we can try to rally the troops for another bash at helping? In progress (16/8) | ||
+ | |||
+ | '''MANCHESTER'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130868 130868] (2/10)<br /> | ||
+ | A fresh ROD CE submission test failure ticket. Assigned (2/10) | ||
+ | |||
+ | '''LIVERPOOL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130518 130518] (12/9)<br /> | ||
+ | One of those ROD availability tickets we all loathe. Steve kept us all in the loop, and hopefully things will go green soon. On hold (2/10) | ||
+ | |||
+ | '''LANCASTER'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130753 130753] (26/9)<br /> | ||
+ | Setting up na62 at Lancaster, things seem to be working for them after the usual back-and-forth and we just need some more jobs to flow. On Hold (26/9) | ||
+ | |||
+ | '''QMUL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130262 130262] (28/8)<br /> | ||
+ | Another biomed publishing ticket, although this one is aimed at the SE. After some feedback from Biomed it looks like it's the glue2 bit that's broken. In progress (26/9) | ||
+ | |||
+ | '''IMPERIAL''' (well, Dirac)<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130202 130202] (24/8)<br /> | ||
+ | Not really a IC ticket, but one to all na62 sites - about na62 jobs waiting too long. Dan included a useful [https://na62.gla.ac.uk/index.php?task=stats&view=sitehealth link] in the ticket - there's feedback from RAL in the ticket too. Is it just fairshare (fairly) stopping na62 jobs running as quickly as they'd like? In progress (27/9) | ||
+ | |||
+ | '''BRUNEL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130742 130742] (26/9)<br /> | ||
+ | LHCB noticing pilots failing at Brunel - Raul points out that the CE is being replaced so the problems should be going away. In progress (26/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130263 130263] (28/8)<br /> | ||
+ | Biomed ticket about negative running jobs being published at Brunel - the ARC devs are involved and Raul has kindly offered to test what they have. Any news from them? In progress (13/9) | ||
+ | |||
+ | '''TIER 1'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=128991 128991] (16/6)<br /> | ||
+ | Solidexperiment.org tape support. The Castor tape is reading for testing, just waiting on word from the VO (aka Janusz). But Janusz is busy in a control room somewhere... Waiting for reply (13/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130467 130467] (10/9)<br /> | ||
+ | CMS SAM tests failing at RAL, due to a lack of space on CASTOR. Chris has identified a bunch of dark data and set about purging it, ready for another round of consistency checks. In Progress (25/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130782 130782] (27/9)<br /> | ||
+ | A request from lhcb to deploy the latest version of heposlibs (which contains a dependency on git). The request has been passed along at the Tier 1. In progress (28/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130193 130193] (23/8)<br /> | ||
+ | CMS staging of files taking a too long from RAL tape- possibly due to a bunch of corrupt files (although manual copies for some problem files are working). George has asked CMS to try again, and if problems persist send a list of dodgy files. Waiting for reply (29/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=130207 130207] (24/8)<br /> | ||
+ | MICE seeing timeouts copying to CASTOR. Gareth provided an indepth update in the ticket, the ticket is being kept open whilst the GenTape disk pool is upgraded. In progress (6/9) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=127597 127597] (7/4)<br /> | ||
+ | A CMS ticket to check networking and xroot performance, held up for a while waiting on the RAL networking team. There is some recent movement to repoke the networkers. On hold (2/10) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=124876 124876] (7/11/16)<br /> | ||
+ | ECHO SAM tests failing due to a problem with the tests - no movement on the counter-ticket ([https://ggus.eu/?mode=ticket_info&ticket_id=125026 125026]) since April - it likely needs a kick. On hold (1/1) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=117683 117683] (18/11/15)<br /> | ||
+ | Getting GLUE 2 working for CASTOR. Intermittently worked on as time allows. The ticket could do with an update this quarter. On hold (6/7) | ||
<!-- ******************Edit stop********************* -----> | <!-- ******************Edit stop********************* -----> | ||
|} | |} |
Revision as of 15:28, 2 October 2017
Week commencing Monday 25th September 2017 |
Task Areas |
|
|
Meeting Summaries |
|
|