Monday 3rd April 2017, 14.30 BST
30 UK tickets this month
SUSSEX
122772 (11/7/16)
Atlas webdav/xroot ticket. Any luck, or would you like a hand at GridPP this week? On hold (26/1)
125503 (9/12/16)
Sno+ ticket about file access problems due to a wrong SE name in the LFC. Any word on this too? I think a plan was put in place. In progress (30/1)
RALPP
126902 (2/3)
CMS ticket, I got a bit lost trying to follow it but a moot point as CMS indicate it can be closed. In progress (3/4)
BRISTOL
126864 (28/2)
Request to enable LZ, Daniela has provided the requested information. In progress (31/3) Update - solved
126865 (28/2)
A CMS ticket from Daniela, concerning ipv6 transfer failures to/from Bristol. Things were looking better, although there is an outstanding question that Winnie highlighted about the CERN setup that perhaps Duncan or someone could answer? In progress (31/3)
BIRMINGHAM
127319 (27/3)
A low-availability ticket. Whilst these are boring it needs to be tended (i.e. put In Progress or On Hold). Assigned (27/3) In progress - Mark cites a misbehaving DHCP server causing hassle.
GLASGOW
124052 (25/9)
LHCB ticket concerning incorrect job publishing, to be fixed in the next generation of ARC CEs deployed at Glasgow. Sadly the time has come for another update, even if it's a totally dry one. On Hold (31/1)
127160 (16/3)
An availability ticket. Nothing more to say then that. On hold (16/3)
SHEFFIELD
127210 (19/3)
Atlas transfer timeout failures. After coming out of downtime failures persist. Perhaps a similar problem to what we saw at Lancaster last week? As per the post to the storage list those issues were apparently soothed by increasing the DPM threads. In progress (3/4)
MANCHESTER
127464 (3/4)
A very fresh atlas deletion error ticket. In progress (3/4)
127384 (29/3)
LSST authorisation failure ticket. Alessandra has tracked down hopefully all the config errors that crept in during the move from svn to git. Hopefully this is nearly sorted. In progress (31/3)
LIVERPOOL
124819 (3/11/16)
AFS ticket. After the firewall ports were opened the submitter provided some feedback, but no news back from the site. Perhaps just put this ticket out of its misery (like what will soonish happen for AFS itself)? In progress (13/2)
127353 (28/3)
Steve bravely rolled out a small Centos7 test cluster and Sno+ job accidentally landed on it - they kept it that way to test things out but sadly it looks like their tests failed and have asked for their jobs to not land on the test cluster anymore. In progress (2/4)
126956 (6/3)
Availability ticket due to the annoying ARC monitoring issues. On hold (27/3)
QMUL
127352 (28/3)
Icecube jobs failing on a QM GPU node - the likely cause has been spotted (old AMD libs sitting on the system with a new nvidia card in it) but it might be a little while till this is fixed. Dan has proposed using this as an opportunity to roll out a Centos7 test node which Icecube were okay with. In progress (31/3)
127144 (15/3)
LHCB saw problems with ce04, which Dan reckons were caused by load and has asked if there are still problems. Waiting for reply (31/3)
126261 (30/1)
A biomed ticket for ce04, although they rechecked if this was still a problem during the aforementioned load problems. There seems to be other errors too though- maybe related to the biomed infrastructure? In progress (31/3)
126650 (15/2)
cern@school errors due to a misconfig in the VO usernames (slurm only does lowercase usernames!). Dan has rolled out the new users and Daniela has rolled out some tests jobs. In progress (31/3)
127445 (1/4)
Another biomed submission error ticket, I'm not sure if this is a duplicate of 126261. It looks like a similar error (on ce5 this time though). Assigned (1/4)
BRUNEL
127117 (13/3)
A request from CMS to upgrade the spacemon client. Raul was on it. Any luck with this? Although I've just remembered that Raul is in a different hemisphere so that question might fall on a deaf inbox. In progress (14/3)
127126 (14/3)
Availability ticket, again by the looks of it due to the ARC monitoring playing up. On hold (27/3)
TIER 1
127251 (21/3)
A ticket from an atlas user concerning transfers into castor have trouble and some errors the user is seeing. John has requested more information as the files themselves seem present and correct, but someone who has some idea as to what the error messages listed by the submitter mean would be handy. Waiting for reply (27/3) Update - closed as likely a problem with the user's code.
127449 (2/4)
One of the RAL ARCs wasn't working well for LHCB - but the problems appear to have passed and the ticket can be closed now. In progress (3/4)
126905 (2/3)
CVMFS commissioning for the SOLID experiment. With effort from Daniela and Catalin things all look to be working for solid now with /cvmfs/solidexperiment.egi.eu exported nicely and uploadable to by the VO. Looks like another ticket can be closed. Waiting for reply (29/3) Update - before it gets closed there has been a request for some extra information from Catalin.
127388 (29/3)
LHCB troubles accessing some files at RAL. Have these issues passed with the other castor problems from the weekend? In progress (3/4)
127240 (21/3)
CMS request to run staging tests in prep for Run 2. There was a request from CMS for access to some monitoring plots, I assume for the transfer rates between buffers, but it wasn't very clear. In progress (27/3)
126184 (26/1)
Atlas request for site monitoring input. Alessandra went over this in last week's atlas uk meeting. It's not too late to have your say in the google docs. In progress (7/2)
124876 (7/11)
ROD ticket concerning tests to the RAL echo instance. Alastair's counter ticket (ticket 125026) hasn't had an update since last year - I think it needs a kick. On Hold (1/1)
117683 (18/11/15)
Castor Glue 2 publishing. Rob reported some good progress. On Hold (2/3)
NGI
126808 (24/2)
WMS usage ticket - mainly involving Imperial and the Tier 1. There was some worry from Daniela regarding the closure of old WMS tickets due to it being "no longer supported", but there were reassurances that security bugs would be fixed. Are you feeling reassured? In progress (20/3)
|