Monday 24th April 2017, 14.30 GMT
36 Open UK tickets this week.
NGI
127588 (7/4)
As mentioned before Easter, this is a yearly review of the information in the gocdb. The deadline for this is Friday. Perhaps a swift wiki-table is in order to check the status? The really important security contact information has been check this year thanks to the security challenge, so we're good on one front at least. In progress (10/4)
126808 (24/2)
WMS usage ticket - not sure where we want to leave this and I don't think much will happen on it for a while. In progress (20/3)
SUSSEX
122772 (11/6/16)
Atlas webdav/xroot access ticket. I understand thanks to Dan's efforts there was some pre-gridpp progress on this. Can we have what was achieved "in writing" please! On Hold (6/3)
127767 (18/4)
ROD availability ticket from last week - not noticed by the site yet by the looks of it. Assigned (18/4)
127768 (18/4)
The likely cause of the low availability, a ROD "out of date CA test" ticket. Thanks to Daniela for providing some wisdom. In Progress (24/4)
125503 (9/12/16)
Sno+ file access ticket. A strategy was developed a while ago to try to tackle this, any further plans/problems? In progress (30/1)
RALPP
127555 (7/4)
A availability ticket due to the arc monitoring shenanigans. Chris has on held it as per the tradition. On hold (7/4)
OXFORD
127778 (19/4)
A CMS ticket, but I'm not sure it should have been assigned to the site. Some jobs that will never go anywhere somehow ended up at Oxford. Not the site's problem, but the conversation about Oxford's status within CMS production should be interesting. In progress (19/4)
BRISTOL
126865 (28/2)
A CMS related ticket from Daniela, regarding ipv6 phedex transfers from Bristol's SE. There's a question left hanging on the ticket regarding the IPv6 status of the cern gfal2 tools. Anyone have any knowledge about this? In progress (31/3)
127783 (19/4)
Some CMS sam test failures have re-cropped up. Although the link shows all green (or at least sickly yellowy-green) for me so perhaps this can be closed again? Reopened (21/4) Update - solved again, likely a problem with the SAM tests occasionally not running soon enough.
126864 (28/2)
Request to enable LZ at Bristol. How this progressing? In progress (31/3)
BIRMINGHAM
127319 (27/3)
Another Availability ticket, checking the argo link the Easter period has been full of Unknown's for Birmingham, so I didn't on hold the ticket yet. In progress (In progress (3/3)
GLASGOW
124052 (25/9/16)
The cursed ARC job publishing ticket- Gareth braved the bad mojo haunting this ticket to provide an update, hoping to be free of it by the end of May. On hold (4/4)
DURHAM
127832 (21/4)
LHCB job submission to Durham seems to be failing. Hope it's not all gone horribly wrong. Assigned (21/4) Update - In progress, Oliver notes some ldap problems causing trouble, but these are being fixed.
SHEFFIELD
127766 (18/4)
Another ROD availability ticket, tests have turned green (after the website was fixed) so just needs to be waited out. On hold (19/4)
MANCHESTER
127644 (10/4)
After a brief config mishap some icecube GPU jobs ran on some non-GPU nodes. Alessandra pounced on the issue, and asks if the ticket can be closed. Waiting for reply (20/4)
QMUL
126261 (30/1)
One of the QM ces not working for biomed, at last check the problem persists- probably due to the other woes befalling ce04. In progress (4/4)
127445 (1/4)
A biomed ticket for the other CE, which is also still not working for biomed. In progress (24/4)
126650 (15/2)
cern@school pilot problems. The initial problem was fixed, but one of the CEs is still playing up after getting into a bad state running out of disk. In progress (19/4)
127551 (6/4)
A Sno+ ticket, where Sno+ jobs were having problems competing with atlas for disk space on the nodes - compounded by jobs not cleaning up properly. Not limited to QM (Lancaster had the same issue), but these issues seem to have passed - for now. In progress (19/4)
127352 (28/3)
An Icecube ticket regarding job problems on a GPU node. Dan is taking this node out to use for centos7 testing, which the VO was happy with. This ticket is in limbo now, either needing on holding or solving. In progress (31/3)
127144 (15/3)
LHCB having trouble with ce04 also - with the CE's recent problems I don't know if things would be looking better on this front? Waiting fore reply (31/3)
BRUNEL
127518 (5/4)
The last of the CMS tickets asking to remove rfio from site's storage.xml. Raul will deal with this when he's back in the UK, but thanks Daniela for the kind offer of help/being a scapegoat. Are you back yet Raul? In progress (12/4)
127117 (13/3)
A CMS request to upgrade the spacemon client, which Raul didn't get round to before his hols. Could do with an update when you can get to it. In progress (14/3) Update - Raul is trying to tackle this, but has asked for some help as the DPM instructions seem quite atlas specific.
THE IMPERIAL CLOUD (which is sadly not a Star Wars Super-Weapon)
127620 (9/4)
David from snoplus has noticed jobs landing and failing on the Imperial Cloud, and has asked what the heck it is (but he asked more politely). Simon has spotted the problems and disabled the cloud site so it won't eat more jobs, hoping to fix it this week. On Hold (12/4)
100IT have 2 tickets:
127827
127539
But they're being handled fine.
THE TIER ONE
127597 (7/4)
A request from CMS to test xrootd and networking performance at RAL after noticing a large drop in job efficiency when access data offsite. Andrew L spotted that this was likely due to using "lazy download", and this is being removed from across the WNs but it is noted that this is needed for CEPH... In progress (12/4)
127240 (21/3)
Another CMS ticket, for a Run2 staging test. Tests were done, Sebastian from CMS has asked for access to some site monitoring plots so they can compare what they see to the "real values". In progress (12/4)
126905 (2/3)
Finishing off commissioning the solidexperiment.org cvmfs server. Just needs one last check user-side to make sure that the statum replication took and job's a good'un. Waiting for reply (21/4)
127388 (29/3)
An lhcb user is having trouble accessing files at RAL. The user has provided details of how they are trying to access the files using root - it's been donkey's years since I've mucked about with root - is the problem simply that castor won't accept xroot connections like this? In progress (20/4)
127612 (8/4)
An LHCB ticket where all the RAL CEs rejected LHCB jobs. There were some issues for a while, but things petered out over Easter. Any news? Hopefully all is well. In progress (12/4)
127598 (7/4)
A CMS ticket from Chris, regarding a cunning plan that was likely set in motion at GridPP38 - the setting up of a UK XrootD Redirector at RAL to pair with the one at Imperial. Likely stalled due to Easter, but Simon's added some notes on their config changes. In progress (19/4)
124876 (7/11/16)
ROD gridftp tests failing for CEPH, due to a problem with the tests. Alaistair poked the ticket to fix the tests (https://www.ggus.org/index.php?mode=ticket_info&ticket_id=125026) - these fixes haven't been implemented yet. On hold (1/1)
117683 (18/11/15)
Castor Glue2 publishing. As expected slow progress here due to the lack of effort available, but the last update was promising. On hold (2/3)
|