Search results

Jump to: navigation, search

Page title matches

  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Facilities CASTOR patched for kernel/errata (not Ghost)
    3 KB (502 words) - 14:28, 30 January 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * storageD retrieval from castor problems - investigation ongoing
    3 KB (429 words) - 15:30, 16 February 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * storageD retrieval from castor problems - investigation ongoing
    3 KB (491 words) - 09:48, 25 February 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] ...while draining (had difficulties previously) - now back and draining final castor partition
    3 KB (550 words) - 14:59, 9 March 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] ...it current version - never seen by RAL. CERN have a workaround in place on castor 2.1.15
    4 KB (574 words) - 12:14, 13 March 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * storageD retrieval from castor problems - investigation ongoing
    3 KB (537 words) - 17:53, 20 March 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Upgrade of CASTOR DBs to Oracle version DB 11.2.04 complete.
    3 KB (514 words) - 16:13, 1 May 2015
  • ==RAL-LCG2 Incident 20150408 network intervention preceding Castor upgrade== ... to resolve (and was not finally cleared until the following morning.) The Castor update had to be backed out and there were some problems in doing this.
    15 KB (2,406 words) - 16:43, 17 August 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * testing CASTOR rebalancer (new version in 2.1.14-15)
    3 KB (520 words) - 09:25, 1 May 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Tier 1 CASTOR 2.1.14-15 upgrade completed successfully
    3 KB (542 words) - 13:38, 20 April 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Testing CASTOR rebalancer on preproduction, and developing associated tools.
    4 KB (566 words) - 14:12, 15 May 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] ...e are examining options for running this in a slow-and-steady fashion with CASTOR up.
    4 KB (657 words) - 12:54, 22 May 2015
  • ... || Medium || || Andrew S || Discuss strategy for funding LSF in 2012 with CASTOR team || No longer necessary, since an LSF license has been purchased for th | 20120321-01 || Medium || ALICE || Lee, Shaun || Find out about the load on CASTOR from Japan || Closed. No longer relevant. || 2012-04-25
    4 KB (566 words) - 09:26, 20 May 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Mice (Castor Gen) will be operating overnight and able to call pri oncall
    5 KB (830 words) - 15:06, 29 May 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * CASTOR rebalancing from Monday
    6 KB (919 words) - 14:23, 5 June 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * CASTOR Gen rebalancing underway
    5 KB (750 words) - 11:09, 12 June 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Facilities CASTOR - change to time to write to tape from 30 mins to 5 mins now
    5 KB (799 words) - 09:33, 22 June 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Juno (CASTOR Facilities) Oracle update to 11.2.0.4
    6 KB (974 words) - 16:10, 3 July 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Change to improve file open times on CASTOR (central db, subrequest todo procedure) - has now been deployed to LHCb and
    6 KB (938 words) - 12:34, 1 July 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Proposed CASTOR face to face W/C Oct 5th or 12th
    6 KB (1,039 words) - 08:28, 14 July 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Proposed CASTOR face to face W/C Oct 5th or 12th
    3 KB (509 words) - 11:07, 24 July 2015
  • [https://www.gridpp.ac.uk/wiki/RAL_Tier1_weekly_operations_castor List of CASTOR meetings] * Proposed CASTOR face to face W/C Oct 5th or 12th
    3 KB (535 words) - 15:22, 27 July 2015
  • ** all VOs / all castor disks * Upgrade CASTOR disk servers to SL6
    3 KB (488 words) - 11:29, 21 August 2015
  • * Proposed CASTOR face to face W/C Oct 5th or 12th * Upgrade CASTOR disk servers to SL6
    3 KB (569 words) - 15:00, 3 August 2015
  • * Upgrade CASTOR disk servers to SL6 * Proposed CASTOR face to face W/C Oct 5th or 12th
    3 KB (539 words) - 14:09, 7 August 2015
  • * Upgrade CASTOR disk servers to SL6 * Proposed CASTOR face to face W/C Oct 5th or 12th
    2 KB (336 words) - 13:26, 14 August 2015
  • ** all VOs / all castor disks * Upgrade CASTOR disk servers to SL6
    4 KB (596 words) - 10:39, 28 August 2015
  • ** all VOs / all castor disks * Upgrade CASTOR disk servers to SL6
    4 KB (617 words) - 10:35, 4 September 2015
  • ** all VOs / all castor disks * Upgrade CASTOR disk servers to SL6
    4 KB (651 words) - 10:23, 18 September 2015
  • ...d from castor and back to fabric to gather spares cv11 spec – no further castor action. ** all VOs / all castor disks
    5 KB (886 words) - 10:45, 2 October 2015
  • * CASTOR 2.1.15 * Proposed CASTOR face to face W/C Oct 5th or 12th
    4 KB (637 words) - 12:47, 9 October 2015
  • * The checksum issue/tickets still present. These are thought to be due to a CASTOR bug fixed in 2.1.15. * CASTOR 2.1.15
    2 KB (401 words) - 13:06, 16 October 2015
  • * RA, SdW, GTF and AS have been to CERN for a CASTOR face-to-face meeting * Disk servers name lookup issue (CV11's) - more system than CASTOR. Currently holding CV11 upgrades until understood.
    3 KB (478 words) - 16:01, 16 November 2015
  • * CASTOR 2.1.15 == Issues to bring up at CASTOR F2F ==
    2 KB (345 words) - 15:23, 27 October 2015
  • Castor ops 23/10/15 11-2-04 client updates – 2.1.15 prerequisite … has to go on castor headnodes
    795 B (124 words) - 09:53, 9 November 2015
  • * RA, SdW, GTF and AS have been to CERN for a CASTOR face-to-face meeting * CASTOR 2.1.15
    2 KB (374 words) - 17:37, 6 November 2015
  • * LHCb batch jobs failing to copy results into castor - changes made seems to have improved the situation but not fix (Raja). Inc * RA, SdW, GTF and AS have been to CERN for a CASTOR face-to-face meeting
    5 KB (850 words) - 11:33, 27 November 2015
  • • GS/RA to revisit the CASTOR decommissioning process in light of the production team updates to their de • JJ – Glue 2 for CASTOR, something to do with publishing information??? Not sure there was a speci
    2 KB (306 words) - 16:22, 24 November 2015
  • * LHCb batch jobs failing to copy results into castor - changes made seems to have improved the situation but not fix (Raja). Inc * RA, SdW, GTF and AS have been to CERN for a CASTOR face-to-face meeting
    6 KB (1,018 words) - 12:25, 4 December 2015
  • * LHCb batch jobs failing to copy results into castor - changes made seems to have improved the situation but not fix (Raja). Inc * RA, SdW, GTF and AS have been to CERN for a CASTOR face-to-face meeting
    7 KB (1,141 words) - 15:00, 11 December 2015
  • * Gfal-cat command failing for atlas reading of nsdumps form castor: https://ggus.eu/index.php?mode=ticket_info&ticket_id=117846. Developers lo * LHCb batch jobs failing to copy results into castor - changes made seems to have improved the situation but not fix (Raja). Inc
    7 KB (1,085 words) - 16:11, 18 January 2016
  • #REDIRECT [[RAL Tier1 weekly operations castor 18/01/2019]]
    59 B (6 words) - 12:58, 8 February 2019
  • 2. SL5 elimination from CASTOR functional test boxes and tape verification server 3. CASTOR stress test improvement
    2 KB (313 words) - 08:05, 14 July 2017
  • ...S tape no longer an issue, following disk server failure and test files in castor cache * Gfal-cat command failing for atlas reading of nsdumps form castor:
    7 KB (1,203 words) - 17:47, 23 January 2016
  • * Gfal-cat command failing for atlas reading of nsdumps form castor: https://ggus.eu/index.php?mode=ticket_info&ticket_id=117846. Developers lo * LHCb batch jobs failing to copy results into castor - changes made seems to have improved the situation but not fix (Raja). Inc
    4 KB (583 words) - 17:45, 29 January 2016
  • * Gfal-cat command failing for atlas reading of nsdumps form castor: https://ggus.eu/index.php?mode=ticket_info&ticket_id=117846. Developers lo * LHCb batch jobs failing to copy results into castor - changes made seems to have improved the situation but not fix (Raja). Inc
    4 KB (565 words) - 10:29, 12 February 2016
  • * castor 2.1.16 coming soon - SRM integration into CASTOR code base * Gfal-cat command failing for atlas reading of nsdumps form castor: https://ggus.eu/index.php?mode=ticket_info&ticket_id=117846. Developers lo
    4 KB (640 words) - 13:34, 17 February 2016
  • * castor 2.1.15 update * castor 2.1.16 coming soon - SRM integration into CASTOR code base
    4 KB (703 words) - 14:52, 19 February 2016
  • * glibc updates applied, all CASTOR systems rebooted. initial issues with head nodes, 7 failed to reboot due t * CASTOR facilities patching scheduled for next week - detailed schedule to be agree
    3 KB (557 words) - 11:32, 26 February 2016
  • * glibc updates applied, all CASTOR systems rebooted. initial issues with head nodes, 7 failed to reboot due t * CASTOR facilities patching scheduled for next week - detailed schedule to be agree
    4 KB (643 words) - 11:40, 4 March 2016

Page text matches

  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local ...n our issues was reported/fixed. These servers are now in acceptance test. Castor team will only deploy V13 servers to non prod until further notice.
    2 KB (276 words) - 13:46, 28 May 2014
  • * Castor 2.1.14 upgrade. Firming update of 10th June for nameserver with stagers CMS * Castor Nameserver 2.1.14 update on 10th June announced in GOC DB. Stager dates to
    41 KB (5,148 words) - 09:38, 2 June 2014
  • ...w but persistent rate of failure when copying the results of batch jobs to Castor. There is also a further problem that sometimes occurs when these (failed) | All Castor (All SRM endpoints)
    13 KB (1,425 words) - 11:34, 2 December 2015
  • * Castor 2.1.14 upgrade. Firming update of 10th June for nameserver with stagers CMS * Castor Nameserver 2.1.14 update on 10th June announced in GOC DB. Stager dates to
    41 KB (5,148 words) - 07:10, 9 June 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local ...have been upgraded need further configurations (James) before releasing to castor team. V13 machines in production should have firmware update, best approach
    2 KB (267 words) - 15:00, 9 June 2014
  • ...k - D1T0) failed to restart after kernel/errata updates applied during the Castor update on 10th June. It was returned to production just befor this meeting ...e firmware in some network switches and apply kernel/errata updates to the Castor disk servers.
    15 KB (1,592 words) - 12:26, 11 June 2014
  • * A Castor namesever box has been set-up to enable queries against Castor metadata to be made without affecting the throughput of production work. * A system has been set-up to provide Atlas with Castor information that is not supplied by the SRM.
    42 KB (5,185 words) - 11:36, 2 March 2015
  • * The CMS Castor stager update to version 2.1.14-13 took place yesterday (Tuesday) as planne * Yesterday (17th June) the CMS Castor stager was updated to version 2.1.14-13.
    12 KB (1,236 words) - 13:13, 18 June 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local * A partitioning alignment issue (3rd CASTOR partition) has been identified, proposal is to resolve this for new machine
    3 KB (412 words) - 13:13, 13 June 2014
  • * Castor and batch services currently down for Castor Namserver Upgrade (to version 2.1.14). If all goes well plan to upgrade sta * Castor Nameserver 2.1.14-13 updated successfully yesterday (10th June). Stager dat
    39 KB (4,952 words) - 19:40, 13 June 2014
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local * A partitioning alignment issue (3rd CASTOR partition) has been identified, proposal is to resolve this for new machine
    3 KB (423 words) - 12:45, 20 June 2014
  • * Castor Namserver Upgrade (to version 2.1.14) successful last week. CMS Stager upda * Castor CMS Stager 2.1.14-13 updated yesterday (17th June) although there were some
    37 KB (4,591 words) - 09:54, 23 June 2014
  • .... CERN provided a solution for SL5.9. We need to consider SL6 upgrade post CASTOR 2.1.14-13 upgrades. ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local
    2 KB (366 words) - 16:00, 27 June 2014
  • * Castor Stager Upgrade was carried out last week. 'GEN' stager update this morning. * Castor GEN Stager 2.1.14-13 updated yesterday (24th June). Some problems with xroo
    35 KB (4,220 words) - 09:11, 30 June 2014
  • ... SRM test failures during last week which were traced to load on the Atlas Castor system during searches for dark data. Less intrusive ways of carrying out t * We continue to monitor closely the performance of xroot access to CMS Castor following the upgrade on the 17th June. Performance is generally good altho
    12 KB (1,212 words) - 10:15, 2 July 2014
  • |Castor |<span style="color:red">Test CASTOR WebDAV developed. Not production ready.</span>
    5 KB (692 words) - 08:28, 29 April 2016
  • ...een identified that may have contributed to the deletion problems on their CASTOR instance. However, the key test of running the ATLAS deletion scripts local * CMS db locking issue 3/7/14 early hours, resulted in lost CMS test file, castor current shows diskcopy_failed in stager logs. Proposal is to identify if th
    2 KB (362 words) - 15:10, 12 August 2014
  • * There were problems with the SRM (not Castor) for the GEN instance on Thursday and Friday of last week (3/4 July). It wa * We are still investigating xroot access to CMS Castor following the upgrade on the 17th June.
    11 KB (1,140 words) - 13:17, 9 July 2014
  • ... was carried out successfully last Thursday. The final update is the Atlas Castor instance stager which is planned for the Atlas - Tue 1st July. The information publishing police have pointed out that the RAL Castor isn't publishing a sane version. Brian suspects an rogue ":" causing the pr
    43 KB (5,584 words) - 12:52, 7 July 2014
  • ...ek with the task of investigating visualisation and querying solutions for CASTOR use. * CASTOR 2.1.14-13 upgrade for Repack - planned for Tuesday or Wednesday this week.
    2 KB (308 words) - 13:48, 14 July 2014
  • * There have been recurring problems with the SRM processes for the castor GEN instance crashing since Friday (11th). This appears to be linked to a p * We are still investigating xroot access to CMS Castor following the upgrade on the 17th June.
    13 KB (1,422 words) - 13:41, 16 July 2014
  • * Yesterday (Tuesday) there was an outage of part of Castor as some racks containing disk servers (the 2011 batches) were shutdown whil ...a single file was reportd lost to CMS. This file had been picked up by the Castor checksum checker.
    12 KB (1,241 words) - 14:07, 25 February 2015
  • ...e was carried out successfully last Tuesday (8th July). This completes the Castor 2.1.14 upgrades apart from some internal changes (E.g. the 'repack' instanc * All Castor instances have been updated to version 2.1.14-13. Some issues remain and ar
    39 KB (4,936 words) - 09:03, 21 July 2014
  • ...on with the task of investigating visualisation and querying solutions for CASTOR use. * Incorrect service classes in castor.conf on disk servers, Atlas issues resolved by Rob. Other non production is
    2 KB (318 words) - 09:07, 21 July 2014
  • * All Castor instances have been upgraded to version 2.1.14. The upgrade is complete apa
    39 KB (4,833 words) - 10:09, 28 July 2014
  • * The recurring problems with the SRM processes for the castor GEN instance crashing has been solved. The problem started on Friday 11th J * On Thursday (17th) the Castor disk cache for AtlasTape filled up. This was traced to the garbage collecto
    13 KB (1,382 words) - 13:24, 23 July 2014
  • ...on with the task of investigating visualisation and querying solutions for CASTOR use. * Facilities castor error
    2 KB (262 words) - 15:46, 25 July 2014
  • ...und (code to trap and fixup the mal-formed filename) was inserted into the Castor GEN instance. * There have been some problems with the Atlas SRM/Castor instance in the last couple of days that are under investigation.
    13 KB (1,402 words) - 13:04, 30 July 2014
  • * We have received word that a 2.1.14-15 version of CASTOR may be forthcoming. * Kashyap's Elasticsearch query script has been rolled out to CASTOR headnodes. Users are encouraged to test it and report any bugs.
    2 KB (279 words) - 16:48, 1 August 2014
  • * All Castor instances have been upgraded to version 2.1.14. The upgrade is complete inc ..._id=106655 GGUS 106655]. Cross-contamination of information due to the GEN-CASTOR SRMs sharing a database, and some VOs sharing service classes. In progress.
    42 KB (5,191 words) - 14:37, 2 August 2014
  • ...epancies were found in some of the Castor database tables and columns. The Castor team are considering options with regard to fixing these. The issue has no * There are problems with disk server draining for Atlas in Castor 2.1.4. This is under investigation.
    12 KB (1,257 words) - 12:25, 6 August 2014
  • * Kashyap's Elasticsearch query script has been rolled out to CASTOR headnodes. Users are encouraged to test it and report any bugs. ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future.
    2 KB (300 words) - 11:01, 15 August 2014
  • ...t triggered by attempts to use the disk server re-balancing feature now in Castor. ...oblems with disk server draining in Castor (and specifically for Atlas) in Castor 2.1.4. This is under investigation.
    11 KB (1,152 words) - 10:26, 13 August 2014
  • * Kashyap's Elasticsearch query script has been rolled out to CASTOR headnodes. Users are encouraged to test it and report any bugs. ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future.
    2 KB (313 words) - 10:40, 15 August 2014
  • ...not part of RAL-LCG2. RAL APEL accounting of course includes both Echo and CASTOR jobs.
    33 KB (5,297 words) - 10:13, 15 November 2017
  • * Ongoing investigations into problems with draining disk servers in Castor 2.1.14.
    41 KB (5,255 words) - 08:52, 18 August 2014
  • | Decommission RAL's Castor Disk endpoint for ALICE
    14 KB (1,558 words) - 11:37, 12 December 2019
  • * passive draining produces file duplication - fixed in castor 2.1.14-14 * SL6 castor stalled due to resource limitations
    2 KB (312 words) - 10:55, 22 August 2014
  • * Ongoing investigations into problems with draining disk servers in Castor 2.1.14.
    42 KB (5,304 words) - 10:39, 25 August 2014
  • * Port opened up to allow external Castor WebDav access (requested by LHCb). * Castor:
    12 KB (1,238 words) - 08:50, 11 November 2014
  • * Following some problems with disk server draining in Castor 2.1.14 a modified procedure has been tested on one disk server and been suc ...epancies were found in some of the Castor database tables and columns. The Castor team are considering options with regard to fixing these. The issue has no
    14 KB (1,421 words) - 13:42, 27 August 2014
  • * passive draining produces file duplication - fixed in castor 2.1.14-14 * SL6 castor stalled due to resource limitations & A/L
    2 KB (338 words) - 15:06, 29 August 2014
  • * We have resumed draining disk servers after the Castor 2.1.14 upgrade. There were some problems with this that are now resolved.
    42 KB (5,358 words) - 10:48, 1 September 2014
  • ...t took some time to fix. Not all services were affected - the site (except Castor) was declared down for around 6 hours on Saturday. ...epancies were found in some of the Castor database tables and columns. The Castor team are considering options with regard to fixing these. The issue has no
    13 KB (1,342 words) - 11:16, 3 September 2014
  • ...ver. Storage (Castor) services were unaffected. The Tier1 site (apart from Castor storage) was declared down in the GOC DB for 5.5 hours from 09:00 on Saturd | Start of unscheduled 'outage' in the GCDB for the whole Tier1 apart from Castor.
    10 KB (1,628 words) - 09:31, 14 November 2014
  • * Juan to patch castor dbs beginning of Nov PSU patches – standard change ...inate a number of excess tables and other entities left over from previous CASTOR versions. This will be change-controlled in the near future.
    2 KB (279 words) - 12:37, 5 September 2014
  • BackendType = castor Path = /castor/cern.ch/grid/dteam/
    8 KB (1,180 words) - 12:27, 1 February 2016
  • * There were problems with the Atlas Castor instance over the weekend which was linked to the draining of a disk server * Oracle patches (PSU) applied to the standby Neptune database (Castor Atlas & GEN) yesterday (Tuesday 30th Sep).
    13 KB (1,429 words) - 10:06, 8 October 2014
  • ==RAL Tier1 Incident 20130628 Atlas Castor Outage======Description:=== The ATLAS CASTOR instance encountered a problem where large numbers of invalid subrequests g
    10 KB (1,594 words) - 10:56, 1 May 2015
  • * On Saturday (13th Sep) there was a problem with the Atlas Castor instance that persisted into the beginning of Sunday. A number of measures ** Apply latest Oracle patches (PSU) to the production database systems (Castor, LFC).
    12 KB (1,195 words) - 14:07, 17 September 2014

View (previous 50 | next 50) (20 | 50 | 100 | 250 | 500)