Search results

Create the page "Batch System" on this wiki!

Tier1 Operations Report 2014-10-08

...h hosts the Atlas and GEN SRM databases) was moved to the standby database system. This required an outage of the Castor Atlas and GEN instances which lasted ...day morning (5th Oct). It was restarted and tested but no fault found. The system was returned to service this morning (8th Oct).

15 KB (1,740 words) - 10:50, 15 October 2014
Site information

...completed in October 2008. The first is to provide information about batch system memory limits. The second is to give an update on networking issues that ca [[Background to batch system memory request for details:]]

17 KB (2,669 words) - 11:14, 1 March 2016
Guide to Ganga

...ol for use with both local batch systems and the DIRAC workload management system. It's maintained by Ulrik Egede (ulrik<AT>monash.edu) - please email if you ...want to use it to submit jobs to the grid rather than just for local batch system submission), there are a few steps you need to go through:

15 KB (2,621 words) - 14:40, 27 May 2020
Tier1 Operations Report 2018-06-18

 ...; padding-top: 0.1em; padding-bottom: 0.1em;" | Limits on concurrent batch system jobs.

16 KB (1,535 words) - 13:37, 20 June 2018
Tier1 Operations Report 2018-05-14

 ...; padding-top: 0.1em; padding-bottom: 0.1em;" | Limits on concurrent batch system jobs.

16 KB (1,476 words) - 07:41, 16 May 2018
Publishing tutorial

...y maximise throughput. Experiments show that, in order to fully utilize a system, it is often necessary to choose a number of slots that is higher than th Sites have to transmit (via the BDII and the accounting system) a couple more things; the power of the site and the amount of work done.

8 KB (1,284 words) - 14:03, 2 October 2017
Tier1 Operations Report 2014-10-29

...tlasDataDisk - D1T0) had failed for the third time on around a month. This system has been completely drained and is undergoing further investigations. ...regular "PSU" patches will be applied to the Pluto Castor standby database system on Monday (27th Oct) and to the Pluto production database on Wednesday (29t

14 KB (1,569 words) - 13:13, 29 October 2014
Operations Bulletin 271014

* Machine/Job features: Concluded on a single architecture for cloud and batch implementations. * The OGMA database system (Atlas3D/Frontier) has been updated and switched to using Oracle GoldenGate

40 KB (4,976 words) - 10:25, 27 October 2014
Operations Bulletin 031114

* Machine/Job features: Concluded on a single architecture for cloud and batch implementations. * The OGMA database system (Atlas3D/Frontier) has been updated and switched to using Oracle GoldenGate

42 KB (5,228 words) - 10:37, 4 November 2014
Operations Bulletin 101114

* Machine/Job features: Concluded on a single architecture for cloud and batch implementations. LHCB having cvmfs trouble at IC, which was likely caused by a batch of naughty CMS jobs ruining it for everyone else. LHCB re-enabled IC to see

48 KB (6,138 words) - 09:19, 10 November 2014
Operations Bulletin 171114

* Multicore: Passing parameters to batch system discussion started. Limited tests. ATLAS 40% resources now MC. Still 37 sit

39 KB (4,698 words) - 18:46, 16 November 2014
OldEMITarball

...caster test cluster runs using torque, interfacing with a DPM SE, so other batch/storage combinations are not as well tested. ''This assumes that the workernode has been setup to work within the batch system, and the users and groups have been set up. It would technically be possibl

25 KB (4,174 words) - 09:57, 23 July 2015
EMITarball

The tarball versions listed may look convoluted, but there is a system to them! The first part denotes what middleware was used to build the tarba ... vomses, CA and CRLs. For a WN you will have to set up the users and batch system yourself.

11 KB (1,832 words) - 10:02, 23 January 2018
Operations Bulletin 081214

* Multicore: Passing parameters to batch systems [https://indico.cern.ch/event/272779/session/0/contribution/8/mater ...n F ticketed the CA concerning a possible problem with the ticket reminder system. JK has responded with a reply, and asked that similar tickets in the futur

50 KB (6,536 words) - 00:08, 7 December 2014
Past Ticket Bulletins 2015

...on (but as he also notes - what's getting loaded and causing the problem - Batch, CE or WNs?). Kashif reckons the argus server, and suggests a handy glexec Sno+ spotted malloc errors at Lancaster. The problems seemed to survive one batch of fixes, but I asked again if they still see problems after running a good

117 KB (18,736 words) - 11:05, 4 January 2016
Tier1 Operations Report 2014-12-17

* Following a restriction on numbers of CMS batch jobs imposed during problems a week or so ago the CMS jobs limits on the fa ... brought into use. (Currently Atlas 3D/Frontier still uses the OGMA datase system, although this was also changed to update from CERN using Oracle Golden Gat

14 KB (1,504 words) - 14:50, 17 December 2014
Tier1 Operations Report 2015-01-07

... brought into use. (Currently Atlas 3D/Frontier still uses the OGMA datase system, although this was also changed to update from CERN using Oracle Golden Gat | Due to Kernel patching of EGI ADV 20141217, the RAL tier1 batch farm worker nodes will need to be rebooted.

17 KB (1,780 words) - 12:56, 7 January 2015
Dirac Dictionary

WMS - Workload Managment System. The central part of the DIRAC system

2 KB (306 words) - 12:29, 12 March 2015
Tier1 Operations Report 2015-01-14

...ostic tests were being run on the faulty router – however after that the system restarted and took over as the master router of the pair (which was not ant ...the week. Intermittent timeouts were seen on the tests. The number of LHCb batch jobs has been restricted to try and reduce the problem. In addition, during

14 KB (1,559 words) - 10:52, 21 January 2015
Tier1 Operations Report 2015-02-11

* We are now fully using cgroups to control job memory limits on the batch farm. ... brought into use. (Currently Atlas 3D/Frontier still uses the OGMA datase system, although this was also changed to update from CERN using Oracle Golden Gat

13 KB (1,290 words) - 11:23, 11 February 2015

Search results

Navigation menu

Personal tools

Namespaces

Variants

Views

Actions

Search

Main GridPP website

Navigation

Tools