Monday 11th of June, 13:00 BST</br>
29 open UK tickets this week.
Over the last week there have been a few queries, at the Ops-team meeting and in tickets themselves, concerning the correct procedure for involving other sites/support units. I hope to have something solid on this for next week.
QMUL/NGI</br>
https://ggus.eu/ws/ticket_info.php?ticket=83020</br>
Reliability/availability report ticket for QMUL. Jeremy's already notified the site but worth mentioning here too.
DURHAM/NGI</br>
https://ggus.eu/ws/ticket_info.php?ticket=83006</br>
Same for Durham. update - Durham replied, citing the infrastructure issues that they've had.
NGI</br>
https://ggus.eu/ws/ticket_info.php?ticket=82491</br>
Jeremy's ticket to the VOMS chaps. Looks like it can be closed.
https://ggus.eu/ws/ticket_info.php?ticket=82492</br>
Chris' ticket on the subject, can be closed or reassigned to the voms developers as an RFE.
SNO+</br>
https://ggus.eu/ws/ticket_info.php?ticket=82671</br>
https://ggus.eu/ws/ticket_info.php?ticket=82670</br>
Have been seeing problems retrieving output from desdemona.zih.tu-dresden.de on both the Glasgow and Imperial WMSs. Likely a problem at the other end, but these tickets threaten to bounce around.
IC</br>
https://ggus.eu/ws/ticket_info.php?ticket=82946</br>
Interesting ticket - despite having cvmfs installed IC seem to be missing a release. This ticket is tracking the investigation.
QMUL</br>
https://ggus.eu/ws/ticket_info.php?ticket=82842</br>
https://ggus.eu/ws/ticket_info.php?ticket=82891</br>
Chris has been seeing jobs dying with "Cancelled by CE admin" errors. This prompted his mail to TB-SUPPORT today (where he saw lots of files in /opt/glite/tmp that needed clearing out). Affecting biomed & hone jobs. Daniela suggests a full service restart and cites a recently discovered problem in ticket 82891. update - Chris has moved non-lhc VOs off of the affected CE which seems to have calmed the problem.
DURHAM</br>
https://ggus.eu/ws/ticket_info.php?ticket=82818</br>
lhcb pilots seem to be dying due to cvmfs problems - although fixing it may be a low priority due to ongoing power troubles at the site. update - Durham are currently having a bad time of it, this will get looked at in due course.
RALPP</br>
https://ggus.eu/ws/ticket_info.php?ticket=82739</br>
heplnx206.pp.rl.ac.uk not working for biomed, looks like information system errors (or the SE isn't actually for biomed's use). Ticket is looking a little neglected. update - Chris has found the error to be caused by the "fix" to another ticket - (75960), removing the fix cured the problem
CAMBRIDGE</br>
https://ggus.eu/ws/ticket_info.php?ticket=82296</br>
Still no word from atlas if the problems have gone away. I suggest closing the ticket after leaving a quick description of your solution.
SOLVED CASE PILE</br>
https://ggus.eu/ws/ticket_info.php?ticket=82749</br>
A ticket from the UK (to NGI_GRNET) concerning the problems seen during the the SUSSEX certification (81784), all issues have been solved.
FROM THE UK</br>
Due to the glacial movement of many of the tickets I'm cataloging these tickets elsewhere (https://www.gridpp.ac.uk/wiki/Tickets_From_The_UK), only documenting significant changes or new problems here.
https://ggus.eu/ws/ticket_info.php?ticket=83133</br>
The na62 FTS service at cern doesn't apepar to be switched on...
Update: Daniela has two interesting tickets:</br>
https://ggus.eu/ws/ticket_info.php?ticket=82746</br>
Daniela spotted an error in the certificate handling for LB on SL6. The LB developers look like they've found the cause.
https://ggus.eu/tech/ticket_show.php?ticket=82448</br>
The EMI-1 LB seems to have a habit of filling up /var/tmp with notifications when things aren't working as intended, tracked down to the glite-lb-notif-interlogd crashing. Investigation continues.
|