Monday 1st December 2014, 14.30 GMT
34 Open UK Tickets this month. Quite a few of them are from Duncan, asking sites to please reinstall their perfsonar hosts.
Simon F ticketed the CA concerning a possible problem with the ticket reminder system. JK has responded with a reply, and asked that similar tickets in the future use the helpdesk at firstname.lastname@example.org rather then GGUS (and definitely don't use both!). He's looking into it at his end, and has asked Simon to check the spam filters. Assigned (should be In Progress?) (1/12) Update - in progress now, and Jens has been roped in to the ticket as well - there was a problem after all (see JK's email).
Duncan has reminded Matt RB to reinstall his Perfsonar with the latest release. Matt reckons he'll get to this the first half of this week. Nothing more to say. In Progress (26/11)
Another perfsonar ticket, Bristol's perfsonar seems ill, but Duncan gave the URL for the Sheffield perfsonar. Probably just a copy and paste error when he wrote the ticket though. In progress (26/11) Update - Winnie confirms that the perfsonar has been reinstalled, poked and prodded. Things are still off with the box, and the site firewall admins are being consulted - but if it isn't a firewall problem Winnie would appreciate assistance debugging the problem.
CMS pilots losing connection at Bristol. The Bristol admins are still looking at this, and the problems are still happening. They've asked some questions (which likely will need a ticket status switch), and have tried disabling IPv6 on their workers for the time being to cross another factor off the list. On Hold (27/11)
Duncan has also asked Birmingham to update their perfsonar boxen- no reply from Matt or Mark yet. Maybe they missed the ticket. Assigned (26/11)
Another request to upgrade perfsonar boxes. Gareth has replied, hopefully it'll get done this week. In progress (26/11)
The Edinburgh "please upgrade your Perfsonar" ticket. Wahid has replied with the ECDF stance on perfsonar, and put the ticket On Hold. On Hold (26/11)
ECDF's glexec tarball ticket. Same position as last month I'm afraid. On Hold (29/8)
Durham's perfsonar results going just plain weird. The Durham chaps have reinstalled their perfsonar, but as expected things are still odd. Hope to test a new routing arrangement later this week. Is that still on course? On hold (12/11)
Atlas have ticketed Manchester about the same issue again (see 110366), which boils down to lost files not being able to be declared lost due to the rucio migration. Not much that can be done Manchester side until the file deletion service is back up at full swing- On Hold the ticket? In progress (1/12)
A ticket for the voms service host at Manchester, detailing the change in VO manager for vo.helios-vo.eu. Bit of confusion with the new VO manager's certificate to be used for this, this ticket might need some shepherding, perhaps even On Holding if it gets too close to Christmas. In Progress (21/11)
Atlas have noticed that the Liverpool DPM has some kind of webdav access problem, browsing works but downloads didn't. This was on purpose as a security, but John enabled http access offsite from the disk nodes. There was some discussion in the ticket about http/https access within DPM, but I suspect this ticket is done unless these points need to be thrashed out a bit. In progress (26/11)
I upgraded my DPM to 1.8.9, and all I got was this ticket! Lancaster's failing the second half of the getTURL test due to what I believe is an incompatibility with the latest DPM version and the SAM tests (and I wasn't rolling back to pass nagios tests!). Waiting on a new set of tests to be rolled out. On Hold (1/12)
Lancaster's bad perfsonar performance ticket. No win after upgrading to the latest perfsonar, hope to run some other tests in the pre-Christmas quiet period.
Lancaster's glexec tarball ticket. No news - my hope is to work on this in the two week per-Christmas quiet period, same as our perfsonar problem. On Hold (14/11)
Atlas have noticed transfer problems to UCL. Ben is trying to investigate, and Wahid is lending a hand. In Progress (28/11)
UCL's "please reinstall your perfsonar" ticket. In progress (26/11)
Nagios ticket for UCL, concerning glexec test failures. Ben has replied that he is trying to debug their glexec installation. In progress (28/11)
UCL's glexec ticket. Ben's working on it, but the site got hit by problems last week. In progress (24/11)
Another atlas httpd access ticket, although this one is quite different from the Liverpool one as it appears they are trying from within a job. I don't think this has been noticed by the QM chaps yet. Assigned (25/11) Update - In Progress now, Dan's checking if https should be working. Elena has involved uk cloud support.
The not-really-a-QM problem snoplus/suse/srmcp ticket. We discussed how to handle this last week, but no news - it seems we're waiting for Matt M to re-engage? Waiting for reply (20/11)
Brunel's "please reinstall your perfsonar" ticket. Raul is on it. In progress (26/11)
The Jet LHCB job failure ticket. If ever there was a candidate for setting a ticket to unsolved, this is it. On Hold (1/10)
Our commercial cloud site's vmcatcher ticket. After Owen's help it looks like things are on the up, but the images still aren't being published. An interesting link was posted with the instructions how to do that. In progress (28/11)
THE TIER 1
CMS Pilots losing connectivity at RAL, sister to the Bristol ticket. Not much news, but Andrew L has a plan to discuss the problem with the HTCondor devs at CERN when he's there. On Hold (27/11)
Sno+ not being able to copy files out of RAL with the gfal tools. It appears to be a non-snoplus specific gfal problem. Perhaps an install problem with wrong versions of gfal2-utils? Andrew L is going to contact the gfal2 devs for help. On hold (26/11)
Inconsistant published BDII/SRM storage numbers. Has been discussed recently in the Ops meeting, a conversation is ongoing with the Castor devs about this, but there wasn't much noise from them at last check. The ticket could do with a mini-update, even if it's "nothing to see here, move along". On Hold (3/11)
Some CMS users having trouble with the RAL FTS REST web interface. Everything seems to be fixed now, so it looks like this ticket can be closed. In progress (27/11)
Duncan has ticketed the Tier 1 regarding not being able to access the LFC via his browser. Catalin confirmed that the problem was occurring for him for his non-dteam identities. Things seem to be working for Chris though. How goes it? In progress (27/11)
CMS glexec errors at the Tier 1. Andrew is back on the case, but needs to test things out first before rolling them out. In progress (27/11)
Another CMS ticket, this time AAA tests failing at RAL. Andrew L asked for the testing scripts so that RAL can test themselves - Duncan provided a link that will help point the way. In progress (26/11)
And the last ticket, the Tier 1's "please upgrade your perfsonar" ticket. In progress (26/11)