Monday 12th August 2013, 14.00 BST</br>
55 Open UK tickets this week. Let's take a couple of deep breathes, then go through them all. Yep, all of them!
NGI tickets:
Unresponsive VOs. (5/7):</br>
https://ggus.eu/ws/ticket_info.php?ticket=95442</br>
Master Ticket, on hold.</br>
https://ggus.eu/ws/ticket_info.php?ticket=95473 </br>
The gridpp ticket, Jeremy is onto this, hoping to wrap it up soon. In progress (12/8) SOLVED</br>
https://ggus.eu/ws/ticket_info.php?ticket=95472</br>
minos ticket. The state of the minos VO is still unknown, but suspected to be defunct. This was set in progress by a ticket manager, although technically it still doesn't have a home. In progress (26/7)</br>
https://ggus.eu/ws/ticket_info.php?ticket=95469</br>
Supernemo ticket. Gianfranco has confirmed it's "not his problem" anymore, and given a few names to try to contact at UCL. In progress (29/7)
NGS decommissioning.</br>
https://ggus.eu/ws/ticket_info.php?ticket=95833 ral-ngs2</br>
https://ggus.eu/ws/ticket_info.php?ticket=96141 oxford-ngs2 </br>
https://ggus.eu/ws/ticket_info.php?ticket=96128 manchester-ngs2</br>
https://ggus.eu/ws/ticket_info.php?ticket=96538 NGS-SHEFFIELD</br>
Nothing to see here really, on hold until JK back from leave) and it doesn't really affect us.
Other NGI tickets:</br>
https://ggus.eu/ws/ticket_info.php?ticket=94780</br>
Cloud Site Creation request. The NGI has been asked for an update, JK has asked others for feedback but is currently on leave. In progress (probably should be on hold if JK isn't back for a while) (5/8)
gLExec tickets. (1/7):</br>
SUSSEX https://ggus.eu/ws/ticket_info.php?ticket=95309 Some progress. On hold (23/7)</br>
CAMBRIDGE https://ggus.eu/ws/ticket_info.php?ticket=95306 Get to it in late summer. On hold (9/7)</br>
BRISTOL https://ggus.eu/ws/ticket_info.php?ticket=95305 After the current work-pile is conquered. Also as an aside, going for an arc ce? Interesting. On hold (11/7)</br>
BIRMINGHAM https://ggus.eu/ws/ticket_info.php?ticket=95304 Aim to do it in ~August, along with other upgrades. On hold (9/7)</br>
ECDF https://ggus.eu/ws/ticket_info.php?ticket=95303 On hold. (1/7)</br>
DURHAM https://ggus.eu/ws/ticket_info.php?ticket=95302 Some progress made, but things stalled. Should be on held if things don't pick up again. In progress (8/8)</br>
SHEFFIELD https://ggus.eu/ws/ticket_info.php?ticket=95301 On hold (10/7)</br>
MANCHESTER https://ggus.eu/ws/ticket_info.php?ticket=95300 Will do it in October upgrade. On hold (1/7)</br>
LANCASTER https://ggus.eu/ws/ticket_info.php?ticket=95299 Trying to get it to work on the tarball. Not having much luck. On hold (17/7)</br>
UCL https://ggus.eu/ws/ticket_info.php?ticket=95298 Won't start until end of August. On Hold (29/7)</br>
RHUL https://ggus.eu/ws/ticket_info.php?ticket=95297 Another for the end of August. On hold (16/7)</br>
QMUL https://ggus.eu/ws/ticket_info.php?ticket=95296 Almost there, just need to roll out SL6 to all their nodes. On hold (12/8)</br>
EFDA-JET https://ggus.eu/ws/ticket_info.php?ticket=95295 Some confusion over Jet's status was had. Otherwise waiting until later to deploy this. On Hold (19/7)
SHA-2 (22/7)</br>
ECDF https://ggus.eu/ws/ticket_info.php?ticket=96002 On hold (23/7)</br>
DURHAM https://ggus.eu/ws/ticket_info.php?ticket=96001 Will upgrade in September. On hold (31/7)</br>
MANCHESTER https://ggus.eu/ws/ticket_info.php?ticket=96081 Again in the October upgrade. On hold (23/7)</br>
LANCASTER https://ggus.eu/ws/ticket_info.php?ticket=95999 Will do this week. On hold (12/8)</br>
TIER 1 https://ggus.eu/ws/ticket_info.php?ticket=95996 In Progress, but not much news. Maybe should be On Held? In progress (22/7)</br>
NEW 12/8 RALPP https://ggus.eu/ws/ticket_info.php?ticket=96588 Just assigned yesterday (12/8)
Common or Garden tickets:
SUSSEX</br>
https://ggus.eu/ws/ticket_info.php?ticket=96469 (8/8)</br>
An ops ticket for CREAMCE-JobSubmit failures. Not acknowledged yet. Assigned (8/8) In progress, Emyr reports the BDII config disappeared (auto update accident?).
https://ggus.eu/ws/ticket_info.php?ticket=96470 (8/8)</br>
Another ops ticket, for the SRM-GetSURLs tests. Emyr has posted an explanation for the problems. In progress (9/8)
https://ggus.eu/ws/ticket_info.php?ticket=96556 (10/8)</br>
Another, slightly younger, ops ticket. This test is CREAMCE-CertLifetime. The cert expired 3 days ago (or an old one has snuck back on the server - that's happened to me more then once). Assigned (10/8)
https://ggus.eu/ws/ticket_info.php?ticket=95165 (28/6)</br>
Duncan has asked you to check your perfsonar - which might be being affected by the firewall work mentioned in 96470. But this ticket is looking mighty neglected. On hold - last "proper" update was (1/7)
RALPP</br>
https://ggus.eu/ws/ticket_info.php?ticket=96287 (31/7)
Atlas were seeing timeouts on their deletion service at RALPP. Alaistair noticed correlation between the times for these failures with those at the Tier 1 (96079). Chris asked if the errors were spread evenly or came in bursts - Brian posted some information that to me suggests bursts. In progress (6/8) Update, problem still persists at both T1 and T2
https://ggus.eu/ws/ticket_info.php?ticket=96531 (9/8)</br>
Someone (lhcb? I recognise the submitter's name) has spotted 444444 jobs being advertised at RALPP. No news from the site yet, such is the peril of Friday tickets (especially over the summer). But of course you'll fix that as soon as you read this... Assigned (9/8) Update - not only acknowledge, but solved. lcg-info-dynamic-scheduler-pbs.noarch missing, screwed up dependencies somewhere?
OXFORD</br>
https://ggus.eu/ws/ticket_info.php?ticket=96440 (6/8)</br>
Actually a ticket for the nagios at Oxford. Chris W noticed ops tests making some odd requests, and noted the old ticket 70066 where he spotted atlas doing similar. Kashif is on the case. In progress (7/8)
BRISTOL</br>
https://ggus.eu/ws/ticket_info.php?ticket=96261 (30/7)</br>
A CMS user had trouble writing into a path at Bristol. Lukasz couldn't see anything wrong, and another user has written to the volume without error, so the submitter has been asked if he still sees a problem. No reply yet. Waiting for reply (5/8)
https://ggus.eu/ws/ticket_info.php?ticket=96483 (8/8)</br>
Bristol had some obsolete glue 2 entries in their publishing. The Bristol team are on it. In progress (9/8)
BIRMINGHAM</br>
https://ggus.eu/ws/ticket_info.php?ticket=95418 (4/7)</br>
Alice, what's the matter? They'd like cvmfs installed at Birmingham. Due to the lack of urgency on this change Mark is leaving it until after the other stuff that needs to be done in this Summer of Upgrades. On hold (17/7)
https://ggus.eu/ws/ticket_info.php?ticket=96555 (10/8)</br>
SRM-Put Ops test failures hitting Birmingham. Space has run out, Mark has his shoe horn out to create more but it will take a little while to sort out. In progress (12/8)
https://ggus.eu/ws/ticket_info.php?ticket=96533 (9/8)</br>
LHCB have asked for g++ to be installed at Birmingham. Mark asked if this is urgent, and I think the LHCB reply can be summarised as "yes". In progress (9/8)
GLASGOW</br>
https://ggus.eu/ws/ticket_info.php?ticket=96528 (9/8)</br>
Glasgow also are having 444444 Waiting jobs on some of their shares. Gareth pointed out that the bad CEs are newer EMI ones - cream developers have been involved. In progress (12/8)
https://ggus.eu/ws/ticket_info.php?ticket=96234 (29/7)</br>
Request to support the new HyperK VO on the Glasgow WMS. Glasgow would like to wait until the VO was supported on all the VOMS servers and the Operations Portal. Chris points out that it is supported on all the former. The latter is being a pain (is what I think the implication was). On hold (2/8)
https://ggus.eu/ws/ticket_info.php?ticket=96231 (29/7)</br>
Sno+ have seen a lot of failures from jobs going through one of Glasgow's WMSii. The problem looks to have been ephemeral, but some zombie job clean up was needed. This was the end of July, Sno+ have been asked if they still have a problem. Waiting for reply (8/8)
ECDF</br>
https://ggus.eu/ws/ticket_info.php?ticket=96331 (2/8)</br>
Failing the ApelDN publishing ops tests. Turned out "publishGlobalUserName no" snuck into the new CE configuration. Just waiting for the republishing to soak in. In progress (12/8) Solved now.
DURHAM</br>
https://ggus.eu/ws/ticket_info.php?ticket=96530 (9/8)</br>
Another 444444 waiting jobs ticket. Not acknowledged yet though. Assigned (9/8) Update - In progress
https://ggus.eu/ws/ticket_info.php?ticket=96554 (10/8)</br>
Ops CREAMCE-JobSubmit failures. Assigned (10/8)
MANCHESTER</br>
https://ggus.eu/ws/ticket_info.php?ticket=96582 (12/8)
An atlas user and the UK atlas team have spotted some files that they can't access at Manchester, in ATLASSCRATCHDISK. Assigned (12/8) Update- in progress, machines are back online. Problem with some network kit.
QMUL</br>
https://ggus.eu/ws/ticket_info.php?ticket=94746 (10/6)</br>
Biomed haunting the QM SE's information. Reinstalling the SE didn't kill off the entries, Storm-developers have been called in. On hold (31/7)
BRUNEL</br>
https://ggus.eu/ws/ticket_info.php?ticket=96217 (29/7)</br>
CMS have spotted that the bdii seems to be publishing inconsistent Wall/CPU time (2880 for one, 72 for the other, so one is in minutes,
t'other is in hours). This is a known issue, fixed in EMI-3 (ticket 91859). Reading the ticket Raul doesn't intend to upgrade just to fix this until he's given it a proper testing. As he suggested, it's probably best to On Hold it until then. In progress (6/8)
EFDA-JET</br>
https://ggus.eu/ws/ticket_info.php?ticket=96526 (9/8)</br>
LHCB are seeing some 'certificate verify failed' errors at efda-jet. Not something I've seen before - CA certificate problems maybe? In progress (9/8)
And last but by no means least:
TIER 1</br>
https://ggus.eu/ws/ticket_info.php?ticket=96482 (8/8)</br>
CMS have noticed transfers from Caltech to RAL failing. Problem looks to be transient, Brian asked if retries also fail. Waiting for reply (8/8)
https://ggus.eu/ws/ticket_info.php?ticket=96235 (29/7)</br>
Chris W has asked for a LFC for HyperK. In progress, but slowed by Vacations. In progress (9/8)
https://ggus.eu/ws/ticket_info.php?ticket=86152 (17/9/2012)</br>
The oldest open ticket: "correlated packet-loss on perfsonar host". Last news was that the upgrade to the Tier 1 backbone/uplink was still in the planning stage. But is the original problem still there? On hold (17/6)
https://ggus.eu/ws/ticket_info.php?ticket=96321 (2/8)</br>
The RAL SE is failing Sno+ nagios tests. Looks to be a problem with Kashif being mapped to t2k - problems seem to be authentication based (but what about ops tests - do they pass too? I smell a possible red herring). Waiting for reply (6/8)
https://ggus.eu/ws/ticket_info.php?ticket=96233 (29/7)</br>
Request for HyperK support on the RAL WMS. In progress, but again Summer Vacations are slowing things down. In progress (9/8)
https://ggus.eu/ws/ticket_info.php?ticket=91658 (20/2)</br>
Webdav support on the RAL LFC. Some good progress has been made by the looks of it, but again people going on well deserved, probably much needed holiday is slowing things down over the Summer. In progress (9/8)
|