Search results

Jump to: navigation, search

Page title matches

  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Tier 1 CASTOR stop and rebooted for Ghost vulnerability (and CIP)
    3 KB (449 words) - 16:58, 6 February 2015
  • ...ing - need to investigate if a fix is already available, if not discuss at castor face to face * Break in connectivity Monday 8th, it appears that this did not affect castor internally in any way however if transfers were in process they would have
    3 KB (404 words) - 15:14, 12 September 2014
  • [[Category:CASTOR]] == Tier1 Castor at RAL Weekly Operations ==
    31 KB (3,178 words) - 09:34, 2 August 2019
  • * CASTOR 2.1.14 + SL5/6 testing. The change control has gone through today with few * Castor on Call person
    1 KB (181 words) - 13:58, 17 March 2014
  • ...transfers from one non-LHC VO from affecting other due to the use a shared CASTOR instance - Able to set limits for each VO srm endpoint , need to decide and
    1 KB (188 words) - 14:11, 21 December 2016
  • ...313-01 || Medium || ATLAS || Alastair || Make sure ATLAS GGUS ticket about CASTOR problems affecting FTS is up-to-date || Closed || 2013-05-01
    2 KB (219 words) - 09:28, 20 May 2015
  • * CASTOR 2.1.14 + SL5/6 testing. The change control has gone through today with few * Castor on Call person
    1 KB (164 words) - 15:18, 24 March 2014
  • 2. SL5 elimination from CASTOR functional test boxes and tape verification server 3. CASTOR stress test improvement
    2 KB (333 words) - 10:24, 28 July 2017
  • * CASTOR 2.1.14 Upgrade Progress - Reversion to 2.1.13-9 software and databases on p * (Tue 1 Apr) Facilities CASTOR Upgrade. Downtime between 0900-1600
    2 KB (368 words) - 16:46, 28 March 2014
  • * Facilities CASTOR was successfully upgraded to 2.1.14-11 ...rian to discuss with Alastair. Other tier 1s are not keen but RAL tier 1 / castor should be able to cope with this.
    1,019 B (149 words) - 13:21, 4 April 2014
  • * The NN_FILE_STAGERTIME constraint has been removed for the Facilities CASTOR database, completing the 2.1.14 upgrade. This upgrade was thought to be tra * The xrootd timeout in castor.conf is now set to 30s for all nodes.
    1 KB (221 words) - 10:09, 15 April 2014
  • * A new version of CASTOR 2.1.14 (2.1.14-12) has been released. This version makes no changes to the * CASTOR 2.1.14 upgrade for Tier 1.
    1 KB (208 words) - 13:02, 25 April 2014
  • * CASTOR 2.1.14 upgrade for Tier 1. Possible date for first stage of intervention (N * CASTOR 2.1.14 for Tier 1
    1 KB (161 words) - 15:56, 2 May 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local * CASTOR 2.1.14 upgrade for Tier 1. Possible date for first stage of intervention (N
    2 KB (245 words) - 10:07, 13 May 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local * CASTOR 2.1.14 upgrade for Tier 1. First stage of intervention (NS upgrade) is book
    2 KB (294 words) - 15:03, 19 May 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local ...n our issues was reported/fixed. These servers are now in acceptance test. Castor team will only deploy V13 servers to non prod until further notice.
    2 KB (290 words) - 10:34, 30 May 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local ...n our issues was reported/fixed. These servers are now in acceptance test. Castor team will only deploy V13 servers to non prod until further notice.
    2 KB (276 words) - 13:46, 28 May 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local ...have been upgraded need further configurations (James) before releasing to castor team. V13 machines in production should have firmware update, best approach
    2 KB (267 words) - 15:00, 9 June 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local * A partitioning alignment issue (3rd CASTOR partition) has been identified, proposal is to resolve this for new machine
    3 KB (412 words) - 13:13, 13 June 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local * A partitioning alignment issue (3rd CASTOR partition) has been identified, proposal is to resolve this for new machine
    3 KB (423 words) - 12:45, 20 June 2014
  • .... CERN provided a solution for SL5.9. We need to consider SL6 upgrade post CASTOR 2.1.14-13 upgrades. ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local
    2 KB (366 words) - 16:00, 27 June 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local * CMS db locking issue 3/7/14 early hours, resulted in lost CMS test file, castor current shows diskcopy_failed in stager logs. Proposal is to identify if th
    2 KB (362 words) - 15:10, 12 August 2014
  • ...ek with the task of investigating visualisation and querying solutions for CASTOR use. * CASTOR 2.1.14-13 upgrade for Repack - planned for Tuesday or Wednesday this week.
    2 KB (308 words) - 13:48, 14 July 2014
  • ...on with the task of investigating visualisation and querying solutions for CASTOR use. * Incorrect service classes in castor.conf on disk servers, Atlas issues resolved by Rob. Other non production is
    2 KB (318 words) - 09:07, 21 July 2014
  • ...on with the task of investigating visualisation and querying solutions for CASTOR use. * Facilities castor error
    2 KB (262 words) - 15:46, 25 July 2014
  • * We have received word that a 2.1.14-15 version of CASTOR may be forthcoming. * Kashyap's Elasticsearch query script has been rolled out to CASTOR headnodes. Users are encouraged to test it and report any bugs.
    2 KB (279 words) - 16:48, 1 August 2014
  • * Kashyap's Elasticsearch query script has been rolled out to CASTOR headnodes. Users are encouraged to test it and report any bugs. ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future.
    2 KB (300 words) - 11:01, 15 August 2014
  • * Kashyap's Elasticsearch query script has been rolled out to CASTOR headnodes. Users are encouraged to test it and report any bugs. ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future.
    2 KB (313 words) - 10:40, 15 August 2014
  • * passive draining produces file duplication - fixed in castor 2.1.14-14 * SL6 castor stalled due to resource limitations
    2 KB (312 words) - 10:55, 22 August 2014
  • * passive draining produces file duplication - fixed in castor 2.1.14-14 * SL6 castor stalled due to resource limitations & A/L
    2 KB (338 words) - 15:06, 29 August 2014
  • * Juan to patch castor dbs beginning of Nov PSU patches – standard change ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future.
    2 KB (279 words) - 12:37, 5 September 2014
  • ==RAL Tier1 Incident 20130628 Atlas Castor Outage======Description:=== The ATLAS CASTOR instance encountered a problem where large numbers of invalid subrequests g
    10 KB (1,594 words) - 10:56, 1 May 2015
  • * Juan to patch castor dbs beginning of Nov PSU patches – standard change ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future.
    2 KB (297 words) - 16:23, 19 September 2014
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * gdss659 is still but will be decommissioned out of CASTOR.
    2 KB (383 words) - 10:53, 4 February 2015
  • * useful breakout sessions at Castor face to face - deadlock analysis & bugs confirmed, discussions to simplify * Juan to patch castor dbs starting next week (PSU patches) – standard change
    2 KB (274 words) - 15:25, 26 September 2014
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * SL6 name server upgrade postponed due to castor team resource - likely to be this week
    2 KB (366 words) - 15:20, 16 January 2015
  • * useful breakout sessions at Castor face to face - deadlock analysis & bugs confirmed, discussions to simplify ...nt on gdss720. Server currently in read only and will revisit post current castor issues.
    3 KB (479 words) - 09:41, 7 October 2014
  • * SL6 Headnode work progressing well - hoping for test in castor vcert next week ...h due to emc failure. Action Add CIP into instructions for castor failover.Castor team decided to wait until dbs rolled back.
    3 KB (479 words) - 16:36, 10 October 2014
  • * SL6 Headnode work progressing well - tested in vcert2, hoping for test in castor vcert next week and production end of Nov. * Successfully moved Castor atlas/gen stager/srm back to primary db following EMC cache battery replace
    2 KB (378 words) - 12:09, 17 October 2014
  • * 2-1-14-14 castor upgrade priority dropped as we have a draining workaround. Revisit once SL6 ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future.
    2 KB (270 words) - 14:50, 27 October 2014
  • ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future. * Possible future upgrade to CASTOR 2.1.14-15 post christmas
    2 KB (355 words) - 10:19, 3 November 2014
  • ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future. * Possible future upgrade to CASTOR 2.1.14-15 post Christmas
    1 KB (226 words) - 13:51, 12 November 2014
  • ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future. * Possible future upgrade to CASTOR 2.1.14-15 post Christmas
    2 KB (267 words) - 14:41, 14 November 2014
  • ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future. * Possible future upgrade to CASTOR 2.1.14-15 post Christmas
    2 KB (265 words) - 14:14, 21 November 2014
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * gdss659 is still but will be decommissioned out of CASTOR.
    2 KB (364 words) - 15:03, 2 December 2014
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * gdss659 is still but will be decommissioned out of CASTOR.
    2 KB (331 words) - 10:53, 4 February 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Kernel and errata upgrade on Castor SL6 headnodes (including reboot) - Tues 23rd 10:00 - 12:00
    3 KB (386 words) - 11:33, 19 December 2014
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Testing CASTOR rebalancer on preproduction.
    4 KB (574 words) - 15:27, 11 May 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * SL6 name server upgrade postponed due to castor team resource - needs to be rescheduled
    2 KB (368 words) - 13:40, 9 January 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Redundant atlasHotdisk service class and disk pool from CASTOR
    2 KB (358 words) - 14:00, 23 January 2015

Page text matches

  • ** [[Using Castor At RAL]]
    8 KB (1,130 words) - 17:31, 17 April 2024
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Tier 1 CASTOR stop and rebooted for Ghost vulnerability (and CIP)
    3 KB (449 words) - 16:58, 6 February 2015
  • ...ing - need to investigate if a fix is already available, if not discuss at castor face to face * Break in connectivity Monday 8th, it appears that this did not affect castor internally in any way however if transfers were in process they would have
    3 KB (404 words) - 15:14, 12 September 2014
  • [[Category:CASTOR]] == Tier1 Castor at RAL Weekly Operations ==
    31 KB (3,178 words) - 09:34, 2 August 2019
  • * CASTOR 2.1.14 + SL5/6 testing. The change control has gone through today with few * Castor on Call person
    1 KB (181 words) - 13:58, 17 March 2014
  • ...; margin-bottom: 0; padding-top: 0.1em; padding-bottom: 0.1em;" | Resolved Castor Disk Server Issues ...0; margin-bottom: 0; padding-top: 0.1em; padding-bottom: 0.1em;" | Ongoing Castor Disk Server Issues
    14 KB (1,386 words) - 09:37, 19 June 2019
  • ...en many zero-sized files created for Alice in Castor. This appears to be a Castor timeout affecting files that are written over a period of more than two hou * A start has been made on updating the Castor tape servers to SL6. (One server for each of the 'C' and 'D' drives was upd
    16 KB (1,794 words) - 12:58, 6 May 2015
  • * LHCb Castor instance has been completely disabled for LHCb and will be decommissioned.
    41 KB (5,018 words) - 14:09, 30 October 2019
  • ...ind Castor. In the meantime we will carry out the (separate) update of the Castor SRMs to version 2.14. * "GEN Scratch" storage in Castor will be decommissioned.
    40 KB (4,974 words) - 12:18, 11 April 2016
  • * We have uncovered a problem where draining of Castor disk servers is now going very slowly. We need to drain a few old servers t ...w but persistent rate of failure when copying the results of batch jobs to Castor. There is also a further problem that sometimes occurs when these (failed)
    13 KB (1,356 words) - 09:59, 16 March 2016
  • ...w but persistent rate of failure when copying the results of batch jobs to Castor. There is also a further problem that sometimes occurs when these (failed) * The Castor repack instance was updated from version 2.1.14.13 to 2.1.14.15.
    11 KB (1,098 words) - 09:42, 17 February 2016
  • ...w but persistent rate of failure when copying the results of batch jobs to Castor. There is also a further problem that sometimes occurs when these (failed) ** This morning the link to the Castor headnodes was moved.
    13 KB (1,411 words) - 08:55, 10 December 2015
  • ...roblems with the special xroot configuration for Alice following since the Castor 2.1.14 update on Tuesday (24th). These were resolved this morning (25th). ...e xroot settings were tuned. Significant improvement were made and the CMS Castor instance is now working OK but being closely monitored.
    13 KB (1,342 words) - 16:20, 25 June 2014
  • * Problems with "CMSDisk" in Castor reported last week have been resolved. CMS deleted files freeing up space a | All Castor (SRM) endpoints
    13 KB (1,357 words) - 12:47, 9 May 2014
  • ...ier 1 concerning not being able to get the gfal commands to work accessing Castor. Duncan has posted to the ticket that things are working for him now, along ... discussed recently in the Ops meeting, a conversation is ongoing with the Castor devs about this, but there wasn't much noise from them at last check. The t
    184 KB (30,332 words) - 17:18, 16 December 2014
  • * A high rate of Atlas file access failures into/from Castor was seen during the day yesterday (9th Sep). A number of measures were take ... files. These are likely to be the results of partly failed transfers into Castor in the past. These are being checked and will be followed up with the appro
    13 KB (1,367 words) - 13:30, 10 September 2014
  • ...; margin-bottom: 0; padding-top: 0.1em; padding-bottom: 0.1em;" | Resolved Castor Disk Server Issues ...0; margin-bottom: 0; padding-top: 0.1em; padding-bottom: 0.1em;" | Ongoing Castor Disk Server Issues
    17 KB (1,646 words) - 09:31, 11 July 2018
  • * We have had load issues on the CMS Castor instance throughout the holiday period which has led to repeated SAM test f ...w but persistent rate of failure when copying the results of batch jobs to Castor. There is also a further problem that sometimes occurs when these (failed)
    14 KB (1,476 words) - 14:02, 4 January 2017
  • ...transfers from one non-LHC VO from affecting other due to the use a shared CASTOR instance - Able to set limits for each VO srm endpoint , need to decide and
    1 KB (188 words) - 14:11, 21 December 2016
  • * Castor xroot performance problems seen by CMS - particularly in very long file ope * The Castor tape servers are being updated to SL6.
    13 KB (1,442 words) - 11:25, 20 May 2015
  • ...313-01 || Medium || ATLAS || Alastair || Make sure ATLAS GGUS ticket about CASTOR problems affecting FTS is up-to-date || Closed || 2013-05-01
    2 KB (219 words) - 09:28, 20 May 2015
  • * There have been problems with the CMS Castor instance through the last week. These are triggered by high load on CMS_Tap ...is significantly adcanced and further investigations are ongoing using the Castor Preprod instance. Ideas for a workaround are being developed.
    14 KB (1,553 words) - 11:36, 19 March 2014
  • ...w but persistent rate of failure when copying the results of batch jobs to Castor. There is also a further problem that sometimes occurs when these (failed) * Castor:
    13 KB (1,364 words) - 12:54, 20 January 2016
  • * CASTOR 2.1.14 + SL5/6 testing. The change control has gone through today with few * Castor on Call person
    1 KB (164 words) - 15:18, 24 March 2014
  • 2. SL5 elimination from CASTOR functional test boxes and tape verification server 3. CASTOR stress test improvement
    2 KB (333 words) - 10:24, 28 July 2017
  • * There have been problems with the Atlas Castor instance that appear to be within the SRM. AtlasScratch shows some high loa * Since yesterday morning there has been a problem with Castor file transfers for those transfers initiated by the CERN FTS3 service. This
    18 KB (1,971 words) - 14:03, 26 July 2017
  • * Last Tuesday we moved a number of Castor disk servers physically within the machine room. This was required to make * The Castor Team plan to upgrade to version 2.1.14-15 ahead of migrating to the next ve
    46 KB (5,846 words) - 07:57, 9 March 2015
  • * CASTOR 2.1.14 Upgrade Progress - Reversion to 2.1.13-9 software and databases on p * (Tue 1 Apr) Facilities CASTOR Upgrade. Downtime between 0900-1600
    2 KB (368 words) - 16:46, 28 March 2014
  • ...the CMS Castor instance at the end of last week and the start of this. The Castor /Database teams have some ideas for the cause of this which looks to be loa * There have been problems with the CMS Castor instance caused by load issues through the disk cache in front of CMS_Tape.
    48 KB (6,293 words) - 07:35, 31 March 2014
  • ...the CMS Castor instance at the end of last week and the start of this. The Castor /Database teams have some ideas for the cause of this which looks to be loa * There have been problems with the CMS Castor instance caused by load issues through the disk cache in front of CMS_Tape.
    48 KB (6,293 words) - 07:36, 31 March 2014
  • * There was a failover of an Atlas Castor Database early evening on Tuesday 25th March. The failover triggered a call * There have been problems with the CMS Castor instance in recent weeks. These are triggered by high load. Work is underwa
    16 KB (1,769 words) - 14:16, 2 April 2014
  • * Facilities CASTOR was successfully upgraded to 2.1.14-11 ...rian to discuss with Alastair. Other tier 1s are not keen but RAL tier 1 / castor should be able to cope with this.
    1,019 B (149 words) - 13:21, 4 April 2014
  • * Load related problems with the CMS Castor instance have been ongoing. Plans to mitigate this are in place.
    45 KB (5,701 words) - 09:21, 7 April 2014
  • * The load related problems reported for the CMS Castor instance have not been seen this last fortnight. However, work is underway ...is significantly adcanced and further investigations are ongoing using the Castor Preprod instance. Ideas for a workaround are being developed.
    13 KB (1,469 words) - 10:34, 16 April 2014
  • * The load related problems reported for the CMS Castor instance havenot been seen this last week. However, work is underway to tac ...is significantly adcanced and further investigations are ongoing using the Castor Preprod instance. Ideas for a workaround are being developed.
    14 KB (1,599 words) - 11:33, 14 April 2014
  • * The NN_FILE_STAGERTIME constraint has been removed for the Facilities CASTOR database, completing the 2.1.14 upgrade. This upgrade was thought to be tra * The xrootd timeout in castor.conf is now set to 30s for all nodes.
    1 KB (221 words) - 10:09, 15 April 2014
  • * The load related problems reported for the CMS Castor instance have not been seen for a few weeks. However, work is underway to t ...is significantly advanced and further investigations are ongoing using the Castor Preprod instance. Ideas for a workaround are being developed.
    13 KB (1,411 words) - 10:57, 23 April 2014
  • * A new version of CASTOR 2.1.14 (2.1.14-12) has been released. This version makes no changes to the * CASTOR 2.1.14 upgrade for Tier 1.
    1 KB (208 words) - 13:02, 25 April 2014
  • ... continued through until the night of Thursday/Friday (4/5 December). With Castor very full there were very few disk servers available with any space on to r * CMS Castor headnodes were updated to SL6 on Tuesday 9th December and the Atlas ones th
    14 KB (1,492 words) - 13:08, 10 December 2014
  • * There have been problems with "CMSDisk" in Castor caused by it becoming very full. * The load related problems reported for the CMS Castor instance have not been seen for a few weeks. However, work is underway to t
    14 KB (1,557 words) - 13:24, 30 April 2014
  • * CASTOR 2.1.14 upgrade for Tier 1. Possible date for first stage of intervention (N * CASTOR 2.1.14 for Tier 1
    1 KB (161 words) - 15:56, 2 May 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local * CASTOR 2.1.14 upgrade for Tier 1. Possible date for first stage of intervention (N
    2 KB (245 words) - 10:07, 13 May 2014
  • * In process of scheduling Castor 2.1.14 upgrade. * In process of scheduling Castor 2.1.14 upgrade. Proposed date for Nameserver upgrade: Wednesday 28th May.
    37 KB (4,615 words) - 08:50, 12 May 2014
  • * Provisional dates for the Castor 2.1.14 upgrade delayed to: Nameserver: Tuesday 10th June; Stagers to follow * Castor:
    13 KB (1,393 words) - 10:46, 14 May 2014
  • * Castor disk server were physically moved to make room for new procurements. This * We have had two Castor disk server crashes since the move gdss776 and gdss783 both lhcbDst disk se
    17 KB (1,612 words) - 11:29, 27 February 2019
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local * CASTOR 2.1.14 upgrade for Tier 1. First stage of intervention (NS upgrade) is book
    2 KB (294 words) - 15:03, 19 May 2014
  • * LHCb: Incremental stripping campaign finished, all productions closed. CASTOR->EOS migration of LHCb user data finished. * In process of scheduling Castor 2.1.14 upgrade. (Now likely to be 10th June).
    46 KB (6,091 words) - 11:47, 19 May 2014
  • * The checksum checker found a corrupt LHCb file in Castor which has been declared lost. * Provisional dates for the Castor 2.1.14 upgrade: Nameserver: Tuesday 10th June; Stagers: CMS- Tue 17th June;
    14 KB (1,427 words) - 13:22, 21 May 2014
  • ...rs were the OCF'12 batch, which are in AtlasDataDisk, CMSDisk and LHCbDst. Castor recovered OK from this. The network change itself was carried out to comple ...May) two network switches that provide connectivity to the 2012 batches of Castor disk servers were moved to the mesh network.
    14 KB (1,452 words) - 12:18, 29 May 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local ...n our issues was reported/fixed. These servers are now in acceptance test. Castor team will only deploy V13 servers to non prod until further notice.
    2 KB (290 words) - 10:34, 30 May 2014

View (previous 50 | next 50) (20 | 50 | 100 | 250 | 500)