Search results

Jump to: navigation, search

Page title matches

  • * [https://twiki.cern.ch/twiki/bin/view/LCG/BatchSystemComparison Batch System Comparison Table] == Sites batch system status ==
    11 KB (1,661 words) - 12:47, 21 June 2019
  • ...ent/uploads/2008/12/batchsystemconfig-nov08.pdf Configuration of the batch system at November 2008] [[Category:Batch Systems]]
    325 B (40 words) - 12:23, 18 March 2014

Page text matches

  • * [[New Information System]] * [[:Category:Batch_Systems|Batch Systems]]
    8 KB (1,130 words) - 17:31, 17 April 2024
  • <!-- ******************Start Limits On Batch System Jobs***************** -----> ...; padding-top: 0.1em; padding-bottom: 0.1em;" | Limits on concurrent batch system jobs.
    14 KB (1,386 words) - 09:37, 19 June 2019
  • * Technical Meeting last week about the New JSON based Information System: https://indico.cern.ch/event/821105/ ...on Thursday. All SAM tests failed until this was fixed the next morning. Batch farm also did not start any new jobs during this time. We used this accide
    41 KB (5,018 words) - 14:09, 30 October 2019
  • ...y LHCb of a low but persistent rate of failure when copying the results of batch jobs to Castor. There is also a further problem that sometimes occurs when * GDSS620 (GenTape - D0T1) Reported a read-only file system yesterday (Tuesday) morning and was taken out of production. Two T2K files
    13 KB (1,356 words) - 09:59, 16 March 2016
  • ...y LHCb of a low but persistent rate of failure when copying the results of batch jobs to Castor. There is also a further problem that sometimes occurs when ... hours of yesterday morning (8th Dec). This also reported a read-only file system.
    13 KB (1,411 words) - 08:55, 10 December 2015
  • * There was a problem on Thurdsay with the batch farm caused by a particular (biomed) user running very large jobs. This led | Outage of tape system for update of library controller.
    13 KB (1,357 words) - 12:47, 9 May 2014
  • ...n F ticketed the CA concerning a possible problem with the ticket reminder system. JK has responded with a reply, and asked that similar tickets in the futur LHCB having cvmfs trouble at IC, which was likely caused by a batch of naughty CMS jobs ruining it for everyone else. LHCB re-enabled IC to see
    184 KB (30,332 words) - 17:18, 16 December 2014
  • ...is week there is a [http://indico.cern.ch/event/272785/ pre-GDB meeting on batch systems] and a [http://indico.cern.ch/event/272619/other-view?view=standard ...re will be a [https://www.gridpp.ac.uk/wiki/Batch_system_status pre-GDB on batch systems] next Tuesday, and a [https://indico.cern.ch/event/272619/timetable
    42 KB (5,176 words) - 11:12, 17 March 2014
  • * CERN batch capacity migrated to SLC6 was at 65% last week. * The APEL accounting system has been undergoing database maintenance to improve performance and reliabi
    46 KB (5,930 words) - 18:40, 28 April 2014
  • * [https://twiki.cern.ch/twiki/bin/view/LCG/BatchSystemComparison Batch System Comparison Table] == Sites batch system status ==
    11 KB (1,661 words) - 12:47, 21 June 2019
  • ...ent/uploads/2008/12/batchsystemconfig-nov08.pdf Configuration of the batch system at November 2008] [[Category:Batch Systems]]
    325 B (40 words) - 12:23, 18 March 2014
  • ...ve released a new [https://ggus.eu/pages/didyouknow.php page on using] the system. * Investigations are ongoing into problems at batch job set-up.
    43 KB (5,533 words) - 08:50, 18 August 2014
  • <!-- ******************Start Limits On Batch System Jobs***************** -----> ...; padding-top: 0.1em; padding-bottom: 0.1em;" | Limits on concurrent batch system jobs.
    17 KB (1,646 words) - 09:31, 11 July 2018
  • ...y LHCb of a low but persistent rate of failure when copying the results of batch jobs to Castor. There is also a further problem that sometimes occurs when | Outage of Castor Storage System for patching
    14 KB (1,476 words) - 14:02, 4 January 2017
  • * GDSS649 (LHCbUser - D1T0) failed on Saturday 16th May when the system hung up. Following tests a faulty drive was replaced. It was returned to se ...ew configuration of a batch of new worker nodes was reported. Most of this batch have now been re-set to have the usual worker node configuration.
    13 KB (1,442 words) - 11:25, 20 May 2015
  • ...r the hypervisor hosting this virtual machine rebooted and this particular system was not configured to re-start. This was resolved by the primary on-call. ...ed during the change. The batch system was also reconfigured such that new batch jobs world not startt during this period. The change was successful. There
    14 KB (1,553 words) - 11:36, 19 March 2014
  • * As reported last week the CMSTape system has been busy - and throughput was compromised by two out of its five disk .... Following the first rebuild a another problematic disk was found and the system was returned to service on Monday (18th Jan) once that too had been resolve
    13 KB (1,364 words) - 12:54, 20 January 2016
  • |MAGIC is a system of two imaging atmospheric Cherenkov telescopes (or IACTs). MAGIC-I started * high priority in the batch system for the atlassgm user;
    78 KB (13,056 words) - 13:44, 23 April 2024
  • <!-- ******************Start Limits On Batch System Jobs***************** -----> ...; padding-top: 0.1em; padding-bottom: 0.1em;" | Limits on concurrent batch system jobs.
    18 KB (1,971 words) - 14:03, 26 July 2017
  • * Last week there was a [http://indico.cern.ch/event/272785/ pre-GDB on batch systems] and a [http://indico.cern.ch/event/272619/other-view?view=standard ...is week there is a [http://indico.cern.ch/event/272785/ pre-GDB meeting on batch systems] and a [http://indico.cern.ch/event/272619/other-view?view=standard
    48 KB (6,293 words) - 07:35, 31 March 2014
  • * Last week there was a [http://indico.cern.ch/event/272785/ pre-GDB on batch systems] and a [http://indico.cern.ch/event/272619/other-view?view=standard ...is week there is a [http://indico.cern.ch/event/272785/ pre-GDB meeting on batch systems] and a [http://indico.cern.ch/event/272619/other-view?view=standard
    48 KB (6,293 words) - 07:36, 31 March 2014
  • * Last week there was a [http://indico.cern.ch/event/272785/ pre-GDB on batch systems] and a [http://indico.cern.ch/event/272619/other-view?view=standard ...reviewed are capable of supporting multicore jobs however a tuning of each system is required to be able to absorb them (draining/reservation of resources) w
    45 KB (5,701 words) - 09:21, 7 April 2014
  • * Last week there was a [http://indico.cern.ch/event/272785/ pre-GDB on batch systems] and a [http://indico.cern.ch/event/272619/other-view?view=standard * CERN batch capacity migrated to SLC6 was at 65% last week.
    52 KB (6,980 words) - 08:19, 15 April 2014
  • ...ed. Multiple disk failures were being reported by the disk controller. The system was returned to production yesterday evening (8th April) and is being drain * The EMI3 Argus server is now in use everywehere in the batch farm.
    14 KB (1,599 words) - 11:33, 14 April 2014
  • * CERN batch capacity migrated to SLC6 was at 65% last week. * The APEL accounting system has been undergoing database maintenance to improve performance and reliabi
    45 KB (5,796 words) - 22:44, 21 April 2014
  • ... CMS deleting files to make space and a reduction in the number of running batch jobs relieved thd strain. ... brought into use. (Currently Atlas 3D/Frontier still uses the OGMA datase system, although this was also changed to update from CERN using Oracle Golden Gat
    14 KB (1,492 words) - 13:08, 10 December 2014
  • ...bers of jobs (from T2K) submitted to the batch system by the WMSs. A batch system parameter (max number of gridftp connections on ARC CEs) has been increased | System be decommissioned. (Replaced my myproxy.gridpp.rl.ac.uk).
    14 KB (1,557 words) - 13:24, 30 April 2014
  • * CERN batch capacity migrated to SLC6 was at 65% last week. * The APEL accounting system has been undergoing database maintenance to improve performance and reliabi
    41 KB (5,106 words) - 19:52, 5 May 2014
  • * Testing CVMFS Client version 2.1.19 ongoing. This is now rolled out to one batch of worker nodes. So far so good. | Outage of tape system for update of tape library controller. (Postponed from 13th May).
    13 KB (1,393 words) - 10:46, 14 May 2014
  • <!-- ******************Start Limits On Batch System Jobs***************** -----> ...; padding-top: 0.1em; padding-bottom: 0.1em;" | Limits on concurrent batch system jobs.
    17 KB (1,612 words) - 11:29, 27 February 2019
  • ...onday covered some site reports and OS related updates. Tuesday's focus is batch systems. Wednesday covers IPv6, security and benchmarking. Thursday storage ...naged services from Quattor to a new Puppet based Configuration Management system.
    41 KB (5,148 words) - 09:38, 2 June 2014
  • ...onday covered some site reports and OS related updates. Tuesday's focus is batch systems. Wednesday covers IPv6, security and benchmarking. Thursday storage ...naged services from Quattor to a new Puppet based Configuration Management system.
    41 KB (5,148 words) - 07:10, 9 June 2014
  • * Today (11th June) a new tape controller system (ACSLS) is being installed. There have been some problems with the new serv | Castor (all SRM endpoints) and batch (all CEs)
    15 KB (1,592 words) - 12:26, 11 June 2014
  • 0) Find and read the "ARC Computing Element System Administrator Guide". <br> 3) Ensure the machine can submit to the batch system & has all of the users. <br>
    11 KB (1,578 words) - 15:50, 12 June 2014
  • ...ieve this by using the (Condor) Submit module of a glideinWMS as the batch system and then channeling the jobs via the glideinWMS to the gridpp cloud. <br>
    925 B (154 words) - 11:11, 23 August 2019
  • ...r their resources into a ‘pool’ via the [https://e-grant.egi.eu eGrant system]. [https://wiki.egi.eu/wiki/Resource_Allocation_Process More information] i * Castor and batch services currently down for Castor Namserver Upgrade (to version 2.1.14). I
    39 KB (4,952 words) - 19:40, 13 June 2014
  • ...n there is contention between other processes for physical memory will the system force physical memory into swap and push the physical memory used towards t
    1 KB (241 words) - 10:28, 11 February 2015
  • ...] needs updating and a consensus! Could the SEs implement some reservation system internally? Is there merit in the suggestion to make use of [https://www.gr * KeyDocs are going to be reviewed (in next 4 weeks) as the system is not working (or not adding anything) in some areas.
    43 KB (5,584 words) - 12:52, 7 July 2014
  • '''UKI-NORTHGRID-MAN-HEP''': Multicore and passing parameters to the batch system testing requested by the experiments through the WLCG Task Force Alessandra
    8 KB (1,155 words) - 11:09, 13 March 2015
  • ...egi-trustanchors.repo Finally, for historical reasons related to our build system, we also installed these two repos from the glite 3.2 instructions - jpacka ...wever you do it, make a munge key using /usr/sbin/create-munge-key on some system that has munge installed on it (this one?) and use the resulting key on all
    15 KB (2,429 words) - 10:18, 31 July 2015
  • | Email everyone on how to hack the publishing system to avoid publishing incorrect GlueSubClusterWNTmpDir. | Plan out the future of CE/Batch System integration. Torque/maui are not supported by EGI. Layout an agenda with pr
    33 KB (5,297 words) - 10:13, 15 November 2017
  • ...lable, called HTCondor (or CONDOR for short). We also decided to front the system with an ARC CE. You'll need a copy of the ARC System Admin Manual.
    121 KB (17,569 words) - 08:26, 28 November 2019
  • ...or allocation. It is a brokering service only. There is one request in the system for cloud resources. * News: CERN-IT to terminate the SLC5-based interactive and batch services (lxplus5 and lxbatch5) soon. The current target date is 30 Septemb
    42 KB (5,358 words) - 10:48, 1 September 2014
  • ... jobs at CCIN2P3 and of the method to passing job requirement arguments to batch systems via CE. ([https://indico.cern.ch/event/339461/ Agenda]) * OSG following up on how to discover HTCondor CEs in the information system.
    46 KB (6,062 words) - 10:07, 15 September 2014
  • ...ring Saturday evening. It was restarted and tested but no fault found. The system was returned to service yesterday (30th Sep). * One batch of worker nodes (64 machines) have had Linux cgroups configured to enforce
    13 KB (1,429 words) - 10:06, 8 October 2014
  • ==RAL Tier1 Incident 20130626 Failure of RAL CVMFS Stratum1 Triggered Batch Farm Problems=====Description:=== ...s over to use other replicas. However this did not happen across the Tier1 batch farm where many nodes were running a version of the CVMFS client in which t
    12 KB (1,968 words) - 15:13, 16 September 2014
  • ...ordinating/publicising local site-admin tools (Nagios plugins, local batch system dashboards)
    906 B (116 words) - 08:35, 5 June 2018
  • ...of the systems affected was the argus server and this caused a problem for batch job submissions for an hour or so. * The Atlas Frontier service will be switched to use the new database system that updates from CERN using Oracle "GoldenGate" on 24th Sep.
    12 KB (1,195 words) - 14:07, 17 September 2014
  • ... jobs at CCIN2P3 and of the method to passing job requirement arguments to batch systems via CE. ([https://indico.cern.ch/event/339461/ Agenda]) * OSG following up on how to discover HTCondor CEs in the information system.
    48 KB (6,422 words) - 08:45, 23 September 2014
  • *** Durham: Batch system upgrade led to one outage and a University wide internet connection loss le * Ongoing tests ongoing with some batch jobs for the LHC VOs running in SL6 containers on worker nodes running SL7.
    42 KB (5,079 words) - 18:37, 19 March 2017
  • ...h hosts the Atlas and GEN SRM databases) was moved to the standby database system. This required an outage of the Castor Atlas and GEN instances which lasted ...day morning (5th Oct). It was restarted and tested but no fault found. The system was returned to service this morning (8th Oct).
    15 KB (1,740 words) - 10:50, 15 October 2014
  • ...completed in October 2008. The first is to provide information about batch system memory limits. The second is to give an update on networking issues that ca [[Background to batch system memory request for details:]]
    17 KB (2,669 words) - 11:14, 1 March 2016
  • ...ol for use with both local batch systems and the DIRAC workload management system. It's maintained by Ulrik Egede (ulrik<AT>monash.edu) - please email if you ...want to use it to submit jobs to the grid rather than just for local batch system submission), there are a few steps you need to go through:
    15 KB (2,621 words) - 14:40, 27 May 2020
  • <!-- ******************Start Limits On Batch System Jobs***************** -----> ...; padding-top: 0.1em; padding-bottom: 0.1em;" | Limits on concurrent batch system jobs.
    16 KB (1,535 words) - 13:37, 20 June 2018
  • <!-- ******************Start Limits On Batch System Jobs***************** -----> ...; padding-top: 0.1em; padding-bottom: 0.1em;" | Limits on concurrent batch system jobs.
    16 KB (1,476 words) - 07:41, 16 May 2018
  • ...y maximise throughput. Experiments show that, in order to fully utilize a system, it is often necessary to choose a number of slots that is higher than th Sites have to transmit (via the BDII and the accounting system) a couple more things; the power of the site and the amount of work done.
    8 KB (1,284 words) - 14:03, 2 October 2017
  • ...tlasDataDisk - D1T0) had failed for the third time on around a month. This system has been completely drained and is undergoing further investigations. ...regular "PSU" patches will be applied to the Pluto Castor standby database system on Monday (27th Oct) and to the Pluto production database on Wednesday (29t
    14 KB (1,569 words) - 13:13, 29 October 2014
  • * Machine/Job features: Concluded on a single architecture for cloud and batch implementations. * The OGMA database system (Atlas3D/Frontier) has been updated and switched to using Oracle GoldenGate
    40 KB (4,976 words) - 10:25, 27 October 2014
  • * Machine/Job features: Concluded on a single architecture for cloud and batch implementations. * The OGMA database system (Atlas3D/Frontier) has been updated and switched to using Oracle GoldenGate
    42 KB (5,228 words) - 10:37, 4 November 2014
  • * Machine/Job features: Concluded on a single architecture for cloud and batch implementations. LHCB having cvmfs trouble at IC, which was likely caused by a batch of naughty CMS jobs ruining it for everyone else. LHCB re-enabled IC to see
    48 KB (6,138 words) - 09:19, 10 November 2014
  • * Multicore: Passing parameters to batch system discussion started. Limited tests. ATLAS 40% resources now MC. Still 37 sit
    39 KB (4,698 words) - 18:46, 16 November 2014
  • ...caster test cluster runs using torque, interfacing with a DPM SE, so other batch/storage combinations are not as well tested. ''This assumes that the workernode has been setup to work within the batch system, and the users and groups have been set up. It would technically be possibl
    25 KB (4,174 words) - 09:57, 23 July 2015
  • The tarball versions listed may look convoluted, but there is a system to them! The first part denotes what middleware was used to build the tarba ... vomses, CA and CRLs. For a WN you will have to set up the users and batch system yourself.
    11 KB (1,832 words) - 10:02, 23 January 2018
  • * Multicore: Passing parameters to batch systems [https://indico.cern.ch/event/272779/session/0/contribution/8/mater ...n F ticketed the CA concerning a possible problem with the ticket reminder system. JK has responded with a reply, and asked that similar tickets in the futur
    50 KB (6,536 words) - 00:08, 7 December 2014
  • ...on (but as he also notes - what's getting loaded and causing the problem - Batch, CE or WNs?). Kashif reckons the argus server, and suggests a handy glexec Sno+ spotted malloc errors at Lancaster. The problems seemed to survive one batch of fixes, but I asked again if they still see problems after running a good
    117 KB (18,736 words) - 11:05, 4 January 2016
  • * Following a restriction on numbers of CMS batch jobs imposed during problems a week or so ago the CMS jobs limits on the fa ... brought into use. (Currently Atlas 3D/Frontier still uses the OGMA datase system, although this was also changed to update from CERN using Oracle Golden Gat
    14 KB (1,504 words) - 14:50, 17 December 2014
  • ... brought into use. (Currently Atlas 3D/Frontier still uses the OGMA datase system, although this was also changed to update from CERN using Oracle Golden Gat | Due to Kernel patching of EGI ADV 20141217, the RAL tier1 batch farm worker nodes will need to be rebooted.
    17 KB (1,780 words) - 12:56, 7 January 2015
  • WMS - Workload Managment System. The central part of the DIRAC system
    2 KB (306 words) - 12:29, 12 March 2015
  • ...ostic tests were being run on the faulty router – however after that the system restarted and took over as the master router of the pair (which was not ant ...the week. Intermittent timeouts were seen on the tests. The number of LHCb batch jobs has been restricted to try and reduce the problem. In addition, during
    14 KB (1,559 words) - 10:52, 21 January 2015
  • * We are now fully using cgroups to control job memory limits on the batch farm. ... brought into use. (Currently Atlas 3D/Frontier still uses the OGMA datase system, although this was also changed to update from CERN using Oracle Golden Gat
    13 KB (1,290 words) - 11:23, 11 February 2015
  • ...Thursday (26th Feb) there was a problem with our Argus server that stopped batch job submission starting for an hour or so. * Cap on maximum number of ALICE batch jobs raised from 3500 to 6000.
    12 KB (1,175 words) - 14:56, 4 March 2015
  • ...ese sites. There is a complementary page about [[Batch system status|batch system status]].
    3 KB (378 words) - 09:57, 27 June 2017
  • ...f this system is that there is no gate keeper service, head node, or batch system accepting and then directing jobs to particular worker nodes, avoiding seve
    4 KB (628 words) - 12:52, 13 March 2015
  • ...fter the action refer to the Tier1 internal (Footprints) incident tracking system. * Investigate, and implement, an alternative method of connecting to the system to allow for a reconnection in the event of a network break.
    8 KB (1,074 words) - 09:36, 18 September 2018
  • ... network changes were made at the start of a planned upgrade to the Castor system. A network problem was triggered that took most of the day to resolve (and | In response to a ticket from t2k, the non-LHC VOs were re-enabled on the batch farm.
    15 KB (2,406 words) - 16:43, 17 August 2015
  • * A [https://indico.cern.ch/event/319821/ pre-GDB on batch systems] will take place in May. ...tops the HTC solution until after CHEP. Is there interest in testing other batch systems? Raul mentioned SLURM. There is also SGE and Torque.
    43 KB (5,339 words) - 06:42, 27 April 2015
  • |Enable the OSG VO on RAL CEs and batch system. |Test of the Transformation System
    19 KB (3,141 words) - 12:14, 27 April 2020
  • ...addition data is exported to a central database. For sites running a batch system with cgroups enabled, cAdvisor can provide information about running jobs o
    4 KB (584 words) - 20:09, 12 May 2015
  • * CMS CASTOR file open time issues affecting batch farm efficiency ...t dataset that is located almost entirely on one node. Shaun has devised a system to redistribute this dataset across the rest of the cmsDisk pool.
    4 KB (566 words) - 14:12, 15 May 2015
  • * There is a [https://indico.cern.ch/event/319821/ pre-GDB on batch systems at CERN this week]. Tier-2 participation encouraged. ** "Consider open science as a production and dissemination system that needs integrated, easy and fair access to several types of shared reso
    46 KB (5,803 words) - 11:48, 16 May 2015
  • | 20120425-01 || Medium || || Gareth || Review batch system limits || Done. Limits have been removed or increased. || 2012-05-23
    4 KB (566 words) - 09:26, 20 May 2015
  • ...ch/twiki/bin/view/LCG/GDBMeetingNotes20150512 summary of the pre-GDB about batch systems] is available. * There is a [https://indico.cern.ch/event/319821/ pre-GDB on batch systems at CERN this week]. Tier-2 participation encouraged.
    46 KB (5,732 words) - 18:32, 23 May 2015
  • * Tier-1problems with secondary database system for Castor - resolved quickly. ...ch/twiki/bin/view/LCG/GDBMeetingNotes20150512 summary of the pre-GDB about batch systems] is available.
    43 KB (5,271 words) - 22:18, 31 May 2015
  • * A problem with the Argus server affected batch job submissions for a while during the early evening of Friday 5th June. Th * The second batch of 2014 CPU purchases has been brought online.
    16 KB (1,741 words) - 13:24, 10 June 2015
  • * Tier-1problems with secondary database system for Castor - resolved quickly. ...ch/twiki/bin/view/LCG/GDBMeetingNotes20150512 summary of the pre-GDB about batch systems] is available.
    43 KB (5,271 words) - 13:02, 6 June 2015
  • * Tier-1problems with secondary database system for Castor - resolved quickly. ...ch/twiki/bin/view/LCG/GDBMeetingNotes20150512 summary of the pre-GDB about batch systems] is available.
    43 KB (5,391 words) - 15:50, 14 June 2015
  • * Tier-1problems with secondary database system for Castor - resolved quickly. ...9/ Agenda]. There will be presentations and discussions on the Information System.
    45 KB (5,632 words) - 13:32, 21 June 2015
  • * The batch job limit for Alice has been completely removed. (It was set at 6000). ...this 15-minute period all services will be unavailable. The Castor storage system will be stopped at 12:45 UTC before the network break, and restarted once t
    15 KB (1,738 words) - 13:30, 24 June 2015
  • * Highlights: Information System discussion started. Use cases and dependencies will be built up and reviewe * T2 feedback: UK response on Information System: Useful for service discovery; minor VO usage; contains too much informatio
    45 KB (5,792 words) - 21:55, 28 June 2015
  • ...setup for the discussion of batch and CE matters in WLCG: project-lcg-gdb-batch at cern.ch. * Highlights: Information System discussion started. Use cases and dependencies will be built up and reviewe
    45 KB (5,742 words) - 21:15, 5 July 2015
  • ...s down. Forcing the FTS system to use more typical settings unblocked the system. The backlog had cleared by the following morning. ...on (with grid middleware delivered via CVMFS) has been extended to a whole batch of WNs.
    12 KB (1,261 words) - 11:39, 15 July 2015
  • ...ntation proposing a new Task Force] studying the future of the Information System. ...tops the HTC solution until after CHEP. Is there interest in testing other batch systems? Raul mentioned SLURM. There is also SGE and Torque.
    46 KB (5,777 words) - 09:41, 20 July 2015
  • ...tops the HTC solution until after CHEP. Is there interest in testing other batch systems? Raul mentioned SLURM. There is also SGE and Torque. ...implement it involves additional complexity and possibly cost. The current system works fine and we therefore see no overriding reason to remove T1-T1 transi
    47 KB (5,972 words) - 08:49, 27 July 2015
  • ...iday morning (24th July) there was a warning level in the fire suppression system in the machine room. The cause seems to have been the failure of a PDU feed ...) redirection accessing Castor; Slow file open times using Xroot; and poor batch job efficiencies.
    13 KB (1,341 words) - 12:17, 29 July 2015
  • ...) redirection accessing Castor; Slow file open times using Xroot; and poor batch job efficiencies. * Deployed changes to remove glite-CLUSTER node from information system and shutdown cream-ce01 and cream-ce02.
    13 KB (1,380 words) - 13:20, 12 August 2015
  • ...ek triggered by the updates to the information provided to the information system by the ARC CEs. This was fixed on Thursday (13th). ...eferred to last week has improved data access rates for the worst cases of batch work (pile-up jobs).
    14 KB (1,580 words) - 13:55, 19 August 2015
  • * Info system: Implementing [https://twiki.cern.ch/twiki/bin/view/EGEE/AllAboutREBUS#REBU ...tops the HTC solution until after CHEP. Is there interest in testing other batch systems? Raul mentioned SLURM. There is also SGE and Torque.
    52 KB (6,730 words) - 22:58, 9 August 2015
  • * Info system: Implementing [https://twiki.cern.ch/twiki/bin/view/EGEE/AllAboutREBUS#REBU ...tops the HTC solution until after CHEP. Is there interest in testing other batch systems? Raul mentioned SLURM. There is also SGE and Torque.
    52 KB (6,730 words) - 23:00, 9 August 2015
  • * Info system: Implementing [https://twiki.cern.ch/twiki/bin/view/EGEE/AllAboutREBUS#REBU ...tops the HTC solution until after CHEP. Is there interest in testing other batch systems? Raul mentioned SLURM. There is also SGE and Torque.
    48 KB (6,103 words) - 23:03, 16 August 2015
  • * Info system: Implementing [https://twiki.cern.ch/twiki/bin/view/EGEE/AllAboutREBUS#REBU * Lydia's document - Setup a system to do data archiving using FTS3
    45 KB (5,578 words) - 19:59, 22 August 2015

View (previous 100 | next 100) (20 | 50 | 100 | 250 | 500)