Difference between revisions of "Operations Bulletin Latest"
(→) |
(→) |
||
Line 466: | Line 466: | ||
===== ===== | ===== ===== | ||
<!-- ******************Edit start********************* -----> | <!-- ******************Edit start********************* -----> | ||
+ | '''Monday 3rd December 2018, 14.00 GMT'''<br /> | ||
+ | 39 Open Tickets this month, going site by site: | ||
+ | '''RALPP'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138588 138588] (28/11)<br /> | ||
+ | A CMS ticket about SRM timeouts, Ian has marked it in progress but no update with words in it yet. In progress (29/11) | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=131616 131616] (3/11/17)<br /> | ||
+ | RALPP's v6 ticket. After having to rollback the last attempt Chris is having another go today after a router firmware update. Good luck! In Progress (3/12) | ||
+ | |||
+ | '''OXFORD'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=131615 131615] (3/11/17)<br /> | ||
+ | Just the v6 ticket at Oxford, no recent news but Duncan asked some questions last week, advertising some JISC services that could make life easier. On Hold (29/11) | ||
+ | |||
+ | '''BRISTOL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138402 138402] (21/11)<br /> | ||
+ | An LHCB tickets for failed pilots. Initial problems seemed to be with the RAL BDII not giving the right (or any) information for Bristol, but this has been fixed and jobs seem to be failing with connection errors. Winnie and Lukasz are working on it. In Progress (1/12) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138041 138041] (1/11)<br /> | ||
+ | CMS transfers failing from Bristol. Files are on disk but not in the DPM namespace - waiting on a fix to the DPM shell to proceed with this. I think I'm waiting on the same update. In progress (30/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=131613 131613] (3/11/17)<br /> | ||
+ | Bristol's v6 ticket. Has there been any recent progress here? In progress (9/10) | ||
+ | |||
+ | '''BIRMINGHAM'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=137801 137801] (17/10)<br /> | ||
+ | The ticket tracking the decommissioning of the old Birmingham DPM. All proceeding as expected, switch off date is the 10th of December. In progress (26/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138244 138244] (12/11)<br /> | ||
+ | Availability ticket, which will continue to be alarming during the decommissioning process. On Hold (12/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=131612 131612] (3/11/17)<br /> | ||
+ | Birmingham's v6 ticket. No updates since August, any news Mark? Even a confirmation of there being no news would be useful. On Hold (27/8) | ||
+ | |||
+ | '''GLASGOW'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=134689 134689] (23/4)<br /> | ||
+ | Perfsonar upgrade to CentOS7 ticket. On hold whilst trying to get v6 to work. On Hold (30/10) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=131611 131611] (3/11/17)<br /> | ||
+ | Glasgow's v6 ticket. v6 was enabled but the v6 packets aren't flowing. Any updates on diagnosing/fixing this? In progress (22/10) | ||
+ | |||
+ | '''EDINBURGH'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138243 138243] (12/11)<br /> | ||
+ | ROD Availability ticket, caused by a late lcg-CA Package update. Metrics are on the mend, but Andy added his thoughts on the reliability of these reliability metrics. On Hold (19/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=131610 131610] (3/11)<br /> | ||
+ | The ECDF v6 ticket. Any news on your next Ipv6 rollout plans? In progress (10/9) | ||
+ | |||
+ | '''DURHAM'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=134687 134687] (23/4)<br /> | ||
+ | Request to update Perfsonar to CentOS7. Adam gave a plan to do this in the site's big C7 rollout, expected at the start of next year. Luckily not long off now! In progress (6/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=131609 131609] (3/11/17)<br /> | ||
+ | Durham's v6 ticket. After painting a bleak picture of mid-2019 as the earliest they could expect a full v6 rollout at Durham Duncan has asked some questions to try to help things along. In Progress (should be On Hold?) (29/11) | ||
+ | |||
+ | '''SHEFFIELD'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=131608 131608] (3/11/17)<br /> | ||
+ | Just the v6 ticket at Sheffield. A positive update from Elena at the end of October hoping to dual-stack the perfsonar boxes by mid-November. Have you managed to do this yet? Do you need a hand with anything? In progress (30/10) | ||
+ | |||
+ | '''MANCHESTER'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=137112 137112] (11/9)<br /> | ||
+ | Atlas SRM space reporting broken by a dodgy drain moving data outside of tokens. A repair script has been running for a long time, and it should be just about fixed. Alessandra has asked Tim to check the atlas-eye view of the space reporting to see if this issue can be closed. Waiting for reply (29/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=131607 131607] (3/11/17)<br /> | ||
+ | Manchester's v6 ticket. Some good news here, with a new v6 range being put into production and a hope that the storage will be dual-stacked for Christmas. Nice. In progress (3/12) | ||
+ | |||
+ | '''LIVERPOOL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=131606 131606] (3/11/17)<br /> | ||
+ | Only the v6 ticket at Liverpool. No news for a long while on this ticket though. In progress (4/6) | ||
+ | |||
+ | '''LANCASTER'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138365 138365] (19/11)<br /> | ||
+ | Providing storage dumps for the t2k files at Lancaster as a precursor to the move to the DFC. It's proving more difficult then initially thought, in part due to a lot of files not having their checksum information in them. Getting DPM to calculate and store these has been more of a pain then it should have been. In progress (3/12) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=137996 137996] (30/10)<br /> | ||
+ | A ROD ticket for a failed webdav test. Waiting on a new patch for DPM that will fix the dodgy behaviour. No sign of it yet though. On Hold (5/11) | ||
+ | |||
+ | phttps://ggus.eu/?mode=ticket_info&ticket_id=136635 136635] (9/8)<br /> | ||
+ | Availability ROD ticket. Just a few (smooth) days away from being able to close this one. On Hold (5/11) | ||
+ | |||
+ | '''RHUL'''<br /> | ||
+ | phttps://ggus.eu/?mode=ticket_info&ticket_id=131603 131603] (3/11/17)<br /> | ||
+ | Just the v6 ticket for RHUL. Last word is that central IT are outsourcing v6 DNS to JANET. Any news on how this is going? We'd like to hear more on this experience. In Progress (29/10) | ||
+ | |||
+ | '''QMUL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138364 138364] (19/11)<br /> | ||
+ | The QM ticket for the T2K DFC migration. Dan was quick to provide the dump, and has tried to migrate the data (I think mainly successfully?). Just needing to clear up some details. In progress (28/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=134573 134573] (17/4)<br /> | ||
+ | CMS request to install singularity. This is waiting on the move to CentOS7 at QM, which currently has a test setup with a pre-production queue hopefully coming before Christmas. On Hold (5/11) | ||
+ | |||
+ | '''IMPERIAL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138360 138360] (19/11)<br /> | ||
+ | The Imperial ticket for the T2K DFC migration. On Hold after the files have been removed and the file dumps provided. On Hold (3/12) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138359 138359] (19/11)<br /> | ||
+ | Master ticket for all the T2K DFC migrations. Since I started writing this Daniela added child tickets for:<br /> | ||
+ | Oxford: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=138647 138647]<br /> | ||
+ | Liverpool: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=138648 138648]<br /> | ||
+ | RALPP: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=138651 138651] <br /> | ||
+ | Sheffield: [https://ggus.eu/index.php?mode=ticket_info&ticket_id=138649 138649] | ||
+ | |||
+ | '''BRUNEL'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138498 138498] (26/11)<br /> | ||
+ | LHCB not able to access a Brunel ARC CE. This ticket appears to have been missed. Assigned (26/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=133956 133956] (9/3)<br /> | ||
+ | CMS asking to update the Brunel xroot configs. Raul updated DPM last week and hoped to enable DOME (which will enable the xroot changes) later in the week. Any joy? In Progress (26/11) | ||
+ | |||
+ | 100IT have a ticket: [https://ggus.eu/?mode=ticket_info&ticket_id=137306 137306] | ||
+ | |||
+ | '''TIER 1'''<br /> | ||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138361 138361] (19/11)<br /> | ||
+ | The Tier 1's T2K migration to the DFC ticket. Alastair provided the file dump over the weekend. In progress (1/12) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138493 138493] (26/11)<br /> | ||
+ | CMS transfers failing from RAL to T2_CH_CERN. Turned out to be one bad file originally, but now more have appeared. Reopened (3/12) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138500 138500] (26/11)<br /> | ||
+ | A similar ticket, CMS transfers failing from T2_PL_Swierk to RAL. This one has been bounced to RAL, I don't if that was right or fair. In progress (28/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138613 138613] (29/11)<br /> | ||
+ | CMS asked to check a file that was failing to stage from tape. It looks like the file isn't in Castor at all. In Progress (3/12) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138584 138584] (28/11)<br /> | ||
+ | CMS xroot reads timing out for SAM tests. It looked to be an intermittent problem, and might have disappeared over the weekend. In progress (30/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138461 138461] (22/11)<br /> | ||
+ | Winnie's ticket concerning Bristol's old bdii being "stuck" in the RAL Top BDII. Any word on his from the RAL BDII admins? I admit I haven't digested the lcg-rollout thread(s). In progress (26/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=138033 138033] (1/11)<br /> | ||
+ | Atlas singularity jobs failing at RAL. It looks like some progress was made on this last week from both sides. In progress (30/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=137897 137897] (23/10)<br /> | ||
+ | enmr jobs not being accounted at RAL, but it looks to be that's because they never successfully ran. A ticket has been submitted to dirac ([https://ggus.eu/index.php?mode=ticket_info&ticket_id=138414 138414]) to get to the bottom of this. In Progress (28/11) | ||
+ | |||
+ | [https://ggus.eu/?mode=ticket_info&ticket_id=137822 137822] (18/10)<br /> | ||
+ | LHCB ticket regarding the FTS being in a "bad state". Waiting to restart the castor -> echo migration to test to see if the problems can be duplicated, as they appear to happen under heavy load on the RAL FTS. On Hold (22/11) | ||
<!-- ******************Edit stop********************* -----> | <!-- ******************Edit stop********************* -----> |
Revision as of 16:04, 3 December 2018
Week commencing Monday 3rd December 2018 |
Task Areas |
|
|
Meeting Summaries |
|
|