|
|
(19 intermediate revisions by 5 users not shown) |
Line 27: |
Line 27: |
| ====== ====== | | ====== ====== |
| <!-- *********************************************************** -----> | | <!-- *********************************************************** -----> |
| + | '''Tuesday 18th June''' |
| + | * DPM Workshop last week - come to this week's storage meeting for an in-depth look. |
| + | ** https://indico.cern.ch/event/776832/ |
| + | * DIRAC downtime this week due to the move to Slough - good luck! |
| + | * EGI Ops meeting this week. |
| + | ** https://wiki.egi.eu/wiki/Agenda-2019-06-17 |
| + | ** HTCondorCE commissioning ongoing, SRM decommissioning survey |
| + | ** State of SRM usage at DPM sites? |
| + | |
| + | |
| '''Tuesday 11th June''' | | '''Tuesday 11th June''' |
| | | |
Line 32: |
Line 42: |
| * This week we will get round to looking at the outcome of the Security Day (and HEPSYSMAN). | | * This week we will get round to looking at the outcome of the Security Day (and HEPSYSMAN). |
| * The DPM Workshop is this week: https://indico.cern.ch/event/776832/ There's a Vidyo Room planned for people to listen in. | | * The DPM Workshop is this week: https://indico.cern.ch/event/776832/ There's a Vidyo Room planned for people to listen in. |
− | | + | * CentOS7 Migration https://twiki.cern.ch/twiki/bin/view/AtlasComputing/CentOS7Deployment |
| <!-- ***********************Start General text*********************** ----->''' | | <!-- ***********************Start General text*********************** ----->''' |
| '''Tuesday 4th June 2019''' | | '''Tuesday 4th June 2019''' |
Line 42: |
Line 52: |
| * Anything else? | | * Anything else? |
| | | |
− | '''Tuesday 21st May 2019'''
| |
− | * HEPSYSMAN is this week, folks can still attend remotely: https://indico.cern.ch/event/721692/
| |
− | * There was a WLCG Operations Coordination meeting last week: https://indico.cern.ch/event/820489/
| |
− | * The Cambridge CREAM CE is the next one to go: https://ggus.eu/index.php?mode=ticket_info&ticket_id=141241
| |
− | * Anything else?
| |
− |
| |
− | '''Tuesday 14th May 2019'''
| |
− | * Registration is still open for the HEPSYSMAN + SECURITY DAY meeting next week (booking deadline 5pm tomorrow), the Security Day training is especially important: https://indico.cern.ch/event/721692/
| |
− | * The DIRAC users workshop is at Imperial this week: https://indico.cern.ch/event/756635
| |
− | * Last week there was the GDB at the EGI conference - of particularly note is the CREAM migration talk: https://indico.cern.ch/event/739878/
| |
− | * WLCG Operations Coordination meeting this Thursday 16th: https://indico.cern.ch/event/820489
| |
− |
| |
− | '''Tuesday 30th April 2019'''
| |
− | * GridPP 42 was last week, any thoughts or comments? https://indico.cern.ch/event/780766/
| |
− | * Related, Dave B has sent out a "consulation" for GridPP43
| |
− | * There was a technical meeting on the future of DPM in the UK shortly before Easter https://indico.cern.ch/event/813140/
| |
− | * The summer HEPSYSMAN + SECURITY DAY is scheduled for the 23rd-24th of May: https://indico.cern.ch/event/721692/
| |
− | * Vis-a-vis the thread on TB-SUPPORT started by Simon - how many more of the "old ways of doing stuff" can we drop?
| |
− |
| |
− | ''' Tuesday 9th April 2019 '''
| |
− | * Any comments from [http://event.twgrid.org/isgc2019/program ISGC]?
| |
− | * ISGC also hosted the April [https://indico.cern.ch/event/739877/ GDB]
| |
− | * Last week also had the [https://indico.ph.qmul.ac.uk/indico/conferenceDisplay.py?confId=446 IRIS FTF].
| |
− | * Any other meetings or announcements I missed?
| |
− | * Most importantly - remember to tell Alastair what you want for your tea at the GridPP42 collaboration dinner.
| |
− |
| |
− |
| |
− | ''' Tuesday 02 April 2019 '''
| |
− | * Fermilab VOMS Changes
| |
− | * https://indico.cern.ch/event/762505/overview 9-11 April
| |
− | * CERN AFS phasing out
| |
− | * XCache ready in Birmingham
| |
− | * Site Storage & Satisfaction Survey
| |
− | * Anything Else
| |
− |
| |
− |
| |
− | '''Tuesday 27th March 2019'''
| |
− | * HOW HSF/OSG/WLCG workshop last week: https://indico.cern.ch/event/759388/
| |
− | * Reminder that GridPP42 registration closes next week: https://indico.cern.ch/event/780766/
| |
− | * Do people have anything they'd like to added to the agenda? Although you might need a shoehorn to fit anything in.
| |
− | * Provisional technical meeting this Friday@11am on HTCondor, and one is planned on EOS experiences soon(tm).
| |
− |
| |
− | * Anyone have anything else?
| |
− |
| |
− |
| |
− | ''' Tuesday 12th March 2019'''
| |
− | * https://indico.cern.ch/event/739876/
| |
− | * https://indico.egi.eu/indico/event/4321/
| |
− | * EGI A/R Report http://argo.egi.eu/lavoisier/site_reports?ngi=NGI_UK&report=Critical&accept=html
| |
− | * Any other updates
| |
− |
| |
− |
| |
− |
| |
− | '''Tuesday 5th March 2019'''
| |
− | * Atlas Site Jamboree is currently ongoing: https://indico.cern.ch/event/770307/
| |
− | * Anyone have any other general updates?
| |
− | * GDB Next week: https://indico.cern.ch/event/739876/
| |
− | * WLCG Ops meeting this Thursday: https://indico.cern.ch/event/803145/
| |
− |
| |
− |
| |
− | '''Monday 18th February 2019'''
| |
− | * GTF is about to release a regular update to the trust anchor repository (1.96) on Monday, FEB 25th 2019. See the [https://rt.egi.eu/rt/Ticket/Display.html?id=15335 ticket] for more information.
| |
− | * [https://indico.cern.ch/e/cvm19 CernVM Users Workshop 2019] will take place from 3 June to 5 June 2019 at CERN.
| |
− | * [https://twiki.cern.ch/twiki/bin/view/LCG/WLCGOpsMeetingWeek190218 Notes from the Monday WLCG operations meeting] are available.
| |
− | * Talks from the GDB last week are now on the [https://indico.cern.ch/event/739875/ agenda page].
| |
− |
| |
− |
| |
− | '''Tuesday 12th February'''
| |
− | * DPM release - 1.11
| |
− | * Approved VOs: changes to BIOMED and ENMR
| |
− | * Simon: VAC with VM condor VM to support local batch jobs?
| |
− | * There is a GDB this week - [https://indico.cern.ch/event/739875/ agenda].
| |
− | * NGI Argus/Central Banning config (see DC message).
| |
− | * There was a WLCG ops meeting yesterday. [https://twiki.cern.ch/twiki/bin/view/LCG/WLCGOpsMeetingWeek190211 Minutes] are available.
| |
− | * [https://indico.cern.ch/e/how2019 HOW19 registration] (early bird rate has ended). HSF-OSG-WLCG joint workshop.
| |
− | * A/R explanations for January: QMUL power outage; Bham now on EOS so SE tests fail; ECDF 80% LHCb??
| |
− |
| |
− |
| |
− | '''Tuesday 5th February'''
| |
− | * The EGI RP/RC A/R Report for January 2019 is [http://argo.egi.eu/lavoisier/site_reports?ngi=NGI_UK&report=Critical&accept=html available]. UKI overall fine. Bham and RAL-LCG2 may wish to examine their results.
| |
− | * A reminder.... Please could everyone think about their WLCG (and beyond) engagements and commitments and put anything of note in the [https://www.gridpp.ac.uk/wiki/Engagements_and_commitments wiki table here].
| |
− | * Notes from yesterday's WLCG ops meeting are [https://twiki.cern.ch/twiki/bin/view/LCG/WLCGOpsMeetingWeek190204 available]. Issues at RAL noted by LHCb.
| |
− | * It is CMS week this week.
| |
− | * In case missed here is Elena's ATLAS update:
| |
− | ** https://ggus.eu/index.php?mode=ticket_info&ticket_id=138033: singularity jobs failing at RAL. A job that Alessandra submitted still has a problem with the home directory. Ral is trying to fix it.
| |
− | ** There was a power lost to a rack and the two storage nodes in it in Lancaster over weekend which cause a problem to ATLAS jobs. This was fixed on Monday.
| |
− | ** Raul asked for srmless atlas transfers for Brunel. Elena is changing transfer configuration in AGIS but the jobs are still failing. Peter will look into this.
| |
− | ** There was a discussion on lightweight ATLAS Grid sites @ADC weekly last week. Sheffield, Cambridge, Brunel, Durham, Bham and Sussex should become diskless sites.
| |
− | * There is an ongoing discussion about LSST jobs on GridPP resources that may be of wider interest. Their VOMS was down so the question was raised about using GridPP VOMS or setting up another VOMS in the UK to read its data from the SLAC instance.
| |
− | * The 9th DIRAC Users Workshop will be held in London 14. - 17. May 2019. Here is the [https://indico.cern.ch/event/756635/overview registration link].
| |
− | * December's WLCG A/R final report is available at via [http://wlcg-docs.web.cern.ch/wlcg-docs/?dir=reporting/reliability-availability/2018/12-18 this link].
| |
− | * The figures for January 2019 are now available and updates from 3 sites requested:
| |
− | ** [http://wlcg-sam.cern.ch/reports/2019/201901/wlcg/WLCG_All_Sites_ALICE_Jan2019.pdf ALICE]
| |
− | ** [http://wlcg-sam.cern.ch/reports/2019/201901/wlcg/WLCG_All_Sites_ATLAS_Jan2019.pdf ATLAS]
| |
− | ** [http://wlcg-sam.cern.ch/reports/2019/201901/wlcg/WLCG_All_Sites_CMS_Jan2019.pdf CMS]
| |
− | ** [http://wlcg-sam.cern.ch/reports/2019/201901/wlcg/WLCG_All_Sites_LHCB_Jan2019.pdf LHCb]
| |
− | * Simon G asks: VAC with VM condor VM to support local batch jobs? Has anyone done it??
| |
− | * EGI has started an HTCondor integration process and captures progress here: https://ggus.eu/index.php?mode=ticket_info&ticket_id=139377.
| |
− | * End of Support for CREAM-CE: The CREAM working group has announced that official support for the CREAM-CE component will cease at the end of the EOSC-hub project, i.e. in Dec 2020. The CREAM working group will be providing full support until the end of 2019, including one minor release already scheduled. During 2020 only security updates will be released.
| |
| | | |
| | | |
Line 157: |
Line 68: |
| <!-- *********************************************************** -----> | | <!-- *********************************************************** -----> |
| <!-- ***********************Start ops coord text*********************** -----> | | <!-- ***********************Start ops coord text*********************** -----> |
| + | '''Tuesday 11th June''' |
| + | |
| + | * I got stuck (figuratively) in my machine room last Thursday afternoon so missed it. [https://indico.cern.ch/event/823800/ Agenda.] [https://twiki.cern.ch/twiki/bin/view/LCG/WLCGOpsMinutes190606 Minutes.] Ste was there - any observations? |
| | | |
| '''Tuesday 14th May''' | | '''Tuesday 14th May''' |
Line 197: |
Line 111: |
| <!-- *********************************************************** -----> | | <!-- *********************************************************** -----> |
| <!-- ***********************Start T1 text*********************** -----> | | <!-- ***********************Start T1 text*********************** -----> |
| + | |
| + | '''17 June 2019''' Report for the Experiments Liaison Report (17/06/2019) is [https://www.gridpp.ac.uk/wiki/Tier1_Operations_Report_2019-06-17 here]. |
| + | <!-- *********************************************************** -----> |
| + | <!-- **********************End T1 text************************** -----> |
| + | * Ongoing, we are seeing high outbound packet loss over IPv6. Central networking performed a firmware update to the border routers but this didn’t resolve the issue. Plan to move connections to the new border routers in Mid June. Will do this before trying to debug any further. |
| + | |
| '''11 June 2019''' Report for the Experiments Liaison Report (10/06/2019) is [https://www.gridpp.ac.uk/wiki/Tier1_Operations_Report_2019-06-10 here]. | | '''11 June 2019''' Report for the Experiments Liaison Report (10/06/2019) is [https://www.gridpp.ac.uk/wiki/Tier1_Operations_Report_2019-06-10 here]. |
| <!-- *********************************************************** -----> | | <!-- *********************************************************** -----> |
Line 204: |
Line 124: |
| * LHCb Castor instance has been completely disabled for LHCb and will be decommissioned. | | * LHCb Castor instance has been completely disabled for LHCb and will be decommissioned. |
| * Brian Davies has transferred from his role as GridPP Tier-2 Storage Support Officer and has joined the Tier-1 Production Team. Although this has happened with immediate effect he will still be available for ad-hoc/informal storage support. | | * Brian Davies has transferred from his role as GridPP Tier-2 Storage Support Officer and has joined the Tier-1 Production Team. Although this has happened with immediate effect he will still be available for ad-hoc/informal storage support. |
| + | |
| |} | | |} |
| + | |
| <!-- ****************Start Storage & DM****************** -----> | | <!-- ****************Start Storage & DM****************** -----> |
| {| style="background-color: #ffffff; border: 1px solid silver; border-collapse: collapse; width: 100%; margin: 0 0.5em 1em 0;" | | {| style="background-color: #ffffff; border: 1px solid silver; border-collapse: collapse; width: 100%; margin: 0 0.5em 1em 0;" |
Line 215: |
Line 137: |
| <!-- ******************Edit start********************* -----> | | <!-- ******************Edit start********************* -----> |
| | | |
− | '''[http://storage.esc.rl.ac.uk/weekly/20190410-minutes.txt Wed 10 Apr]''' | + | '''[http://storage.esc.rl.ac.uk/weekly/20191030-minutes.txt Wed 30 Oct]''' |
− | * Summary of mostly storage and data management related stuff at IRIS F2F meeting | + | * DOME upgrade problems at Edinburgh |
| + | * Data management support/development for IRIS users |
| + | |
| + | '''[http://storage.esc.rl.ac.uk/weekly/20191023-minutes.txt Wed 23 Oct]''' |
| + | * Rucio reporting |
| + | |
| + | '''Wed 16 Oct''' |
| + | * CEPH workshop at CERN report |
| + | |
| + | '''[http://storage.esc.rl.ac.uk/weekly/20191002-minutes.txt Wed 02 Oct]''' |
| + | * Safe to upgrade to DPM 1.13 but make sure the BDII is working if you support DIRAC |
| + | * Roadmap for xroot and http TPC for RAL FTS(es) |
| + | |
| + | '''[http://storage.esc.rl.ac.uk/weekly/20190925-minutes.txt Wed 25 Sept]''' |
| + | * Storage support for IRIS VOs? |
| + | |
| + | '''[http://storage.esc.rl.ac.uk/weekly/20190918-minutes.txt Wed 18 Sept]''' |
| + | * Report from yesterday's Rucio Face Meeting at Coseners |
| + | * Suggestions for following up from yesterday's CEPH day hosted by CERN |
| | | |
− | '''Tuesday 09/04/19''' | + | '''[http://storage.esc.rl.ac.uk/weekly/20190911-minutes.txt Wed 11 Sept]''' |
− | * Progress with xCache at Birmingham , ATLAS to use and test before other sites to try | + | * Storage related stuff at the FNAL (pre-)GDBs |
− | ** ATLAS have two modes but "transparent cache" not use case for when a site has no storage, so investigating "Volatile RSE" | + | * DOME upgrade tickets for non-DOME DPM sites |
− | * Site Storage Status & Satisfaction Survey
| + | |
− | ** Please can remaining sites send responses to Brian Davies
| + | |
− | * New gfal-* rpms available
| + | |
− | * Request to discuss DPM at GRIDPP42
| + | |
| | | |
− | '''[http://storage.esc.rl.ac.uk/weekly/20190227-minutes.txt Wed 27 Feb]''' | + | '''[http://storage.esc.rl.ac.uk/weekly/20190904-minutes.txt Wed 04 Sept]''' |
− | * Report from [https://indico.cern.ch/event/765497 HEPiX] | + | * Banning in SSC not entirely successful in non-DOME DPM, and end of support is nigh; tickets to upgrade will go out shortly. |
− | * Update on IPv6 | + | * Storage-and-data-management-wise, GridPP43 was interesting although no-one volunteered to install the next CEPH. |
| | | |
− | '''[http://storage.esc.rl.ac.uk/weekly/20190220-minutes.txt Wed 20 Feb]'''
| |
− | * There's a Rucio workshop next week, in Oslo
| |
− | * Data lakes and analytics facilities.
| |
| | | |
| <!-- ******************Edit stop********************* -----> | | <!-- ******************Edit stop********************* -----> |
Line 302: |
Line 235: |
| <!-- *********************************************************** -----> | | <!-- *********************************************************** -----> |
| <!-- ******************Edit start********************* -----> | | <!-- ******************Edit start********************* -----> |
| + | |
| + | ''' Tue 9th July 2019''' |
| + | |
| + | LHCb has added this to their requirements: |
| + | |
| + | Sites not having an SRM installation must provide: |
| + | |
| + | * disk only storage |
| + | * a GRIDFPT endpoint (a single dns entry) |
| + | * an XROOT endpoint (a single dns entry) |
| + | * a way to do the accounting (preferably following the WLCG TF standard: https://twiki.cern.ch/twiki/bin/view/LCG/StorageSpaceAccounting) |
| + | |
| ''' Tue 16th April 2019''' | | ''' Tue 16th April 2019''' |
| | | |
Line 556: |
Line 501: |
| ===== ===== | | ===== ===== |
| <!-- ******************Edit start********************* -----> | | <!-- ******************Edit start********************* -----> |
− | '''Monday 3rd June 2019, 14.30 BST'''<br />
| |
− | 47 Open UK Tickets this month
| |
− |
| |
− | '''Yearly GOCDB review'''<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141296 141296] (21/5)<br />
| |
− | Has everyone checked their site's GOCDB information to make sure it's all up to date?
| |
− |
| |
− | '''iris.ac.uk VO tickets'''<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141533 141533]<br />
| |
− | A new VO has been created. Will it be added to the operations portal? Is the plan to make this a gridpp approved VO, or only for IRIS sites? ''Andrew spoilered this one for us on TB-SUPPORT!''
| |
− |
| |
− | '''LCGDM "Retirement" tickets'''<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141466 141466] (Glasgow)<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141468 141468] (Liverpool)<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141469 141469] (Manchester)<br />
| |
− | Tickets asking for site's plans now that LCGDM is phased out - all sites have replied.
| |
− |
| |
− | '''BRUNEL'''<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141475 141475] (29/5)<br />
| |
− | This ticket regarding your ARC CEs seems to have been missed. Assigned (29/5)
| |
− |
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141435 141435] (28/5)<br />
| |
− | Similar for this atlas ticket. Assigned (28/5)
| |
− |
| |
− | ''Both tickets handled now.''
| |
− |
| |
− | '''QMUL'''<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141553 141553] (3/6)<br />
| |
− | It appears that (presumably harmless) CERT warnings are generating alarms in the ROD dashboard. Is that new behaviour? In progress (3/6) ''Discussed on TB-SUPPORT''
| |
− |
| |
− | CAMBRIDGE<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141241 141241] (20/5)<br />
| |
− | How goes the decommissioning? Are all broadcasts sent out? In progress (20/5)
| |
− |
| |
− | ECDF<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141098 141098] (9/5)<br />
| |
− | This DUNE ticket has been solved, and can be closed with the blessing of the VO. In progress (29/5)
| |
− |
| |
− | SHEFFIELD<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=138649 138649] (3/12/18)<br />
| |
− | Do you need any extra help fixing the file move Elena? On hold (20/5)
| |
− |
| |
− | DURHAM<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141234 141234] (20/5)<br />
| |
− | Another case of atlas jobs being killed due to using too much memory. In Progress (29/5)
| |
− |
| |
− | TIER 1<br />
| |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=141262 141262] (21/5)<br />
| |
− | Any progress with this LHCB ticket, where jobs failed trying to access a file? In progress (22/5)
| |
| | | |
− | [https://ggus.eu/?mode=ticket_info&ticket_id=140870 140870] (25/4)<br />
| + | 32 Open Tickets this week, which is an in depth a look as I've been able to take. |
− | Has the process of moving these T2K files to where they should be in the CASTOR namespace started? In progress (29/4)
| + | |
| | | |
| <!-- ******************Edit stop********************* -----> | | <!-- ******************Edit stop********************* -----> |