Search results

Jump to: navigation, search
  • | Email everyone on how to hack the publishing system to avoid publishing incorrect GlueSubClusterWNTmpDir. | Plan out the future of CE/Batch System integration. Torque/maui are not supported by EGI. Layout an agenda with pr
    33 KB (5,297 words) - 10:13, 15 November 2017
  • ...lable, called HTCondor (or CONDOR for short). We also decided to front the system with an ARC CE. You'll need a copy of the ARC System Admin Manual.
    121 KB (17,569 words) - 08:26, 28 November 2019
  • ...or allocation. It is a brokering service only. There is one request in the system for cloud resources. * News: CERN-IT to terminate the SLC5-based interactive and batch services (lxplus5 and lxbatch5) soon. The current target date is 30 Septemb
    42 KB (5,358 words) - 10:48, 1 September 2014
  • ... jobs at CCIN2P3 and of the method to passing job requirement arguments to batch systems via CE. ([https://indico.cern.ch/event/339461/ Agenda]) * OSG following up on how to discover HTCondor CEs in the information system.
    46 KB (6,062 words) - 10:07, 15 September 2014
  • ...ring Saturday evening. It was restarted and tested but no fault found. The system was returned to service yesterday (30th Sep). * One batch of worker nodes (64 machines) have had Linux cgroups configured to enforce
    13 KB (1,429 words) - 10:06, 8 October 2014
  • ==RAL Tier1 Incident 20130626 Failure of RAL CVMFS Stratum1 Triggered Batch Farm Problems=====Description:=== ...s over to use other replicas. However this did not happen across the Tier1 batch farm where many nodes were running a version of the CVMFS client in which t
    12 KB (1,968 words) - 15:13, 16 September 2014
  • ...ordinating/publicising local site-admin tools (Nagios plugins, local batch system dashboards)
    906 B (116 words) - 08:35, 5 June 2018
  • ...of the systems affected was the argus server and this caused a problem for batch job submissions for an hour or so. * The Atlas Frontier service will be switched to use the new database system that updates from CERN using Oracle "GoldenGate" on 24th Sep.
    12 KB (1,195 words) - 14:07, 17 September 2014
  • ... jobs at CCIN2P3 and of the method to passing job requirement arguments to batch systems via CE. ([https://indico.cern.ch/event/339461/ Agenda]) * OSG following up on how to discover HTCondor CEs in the information system.
    48 KB (6,422 words) - 08:45, 23 September 2014
  • *** Durham: Batch system upgrade led to one outage and a University wide internet connection loss le * Ongoing tests ongoing with some batch jobs for the LHC VOs running in SL6 containers on worker nodes running SL7.
    42 KB (5,079 words) - 18:37, 19 March 2017
  • ...h hosts the Atlas and GEN SRM databases) was moved to the standby database system. This required an outage of the Castor Atlas and GEN instances which lasted ...day morning (5th Oct). It was restarted and tested but no fault found. The system was returned to service this morning (8th Oct).
    15 KB (1,740 words) - 10:50, 15 October 2014
  • ...completed in October 2008. The first is to provide information about batch system memory limits. The second is to give an update on networking issues that ca [[Background to batch system memory request for details:]]
    17 KB (2,669 words) - 11:14, 1 March 2016
  • ...ol for use with both local batch systems and the DIRAC workload management system. It's maintained by Ulrik Egede (ulrik<AT>monash.edu) - please email if you ...want to use it to submit jobs to the grid rather than just for local batch system submission), there are a few steps you need to go through:
    15 KB (2,621 words) - 14:40, 27 May 2020
  • <!-- ******************Start Limits On Batch System Jobs***************** -----> ...; padding-top: 0.1em; padding-bottom: 0.1em;" | Limits on concurrent batch system jobs.
    16 KB (1,535 words) - 13:37, 20 June 2018
  • <!-- ******************Start Limits On Batch System Jobs***************** -----> ...; padding-top: 0.1em; padding-bottom: 0.1em;" | Limits on concurrent batch system jobs.
    16 KB (1,476 words) - 07:41, 16 May 2018
  • ...y maximise throughput. Experiments show that, in order to fully utilize a system, it is often necessary to choose a number of slots that is higher than th Sites have to transmit (via the BDII and the accounting system) a couple more things; the power of the site and the amount of work done.
    8 KB (1,284 words) - 14:03, 2 October 2017
  • ...tlasDataDisk - D1T0) had failed for the third time on around a month. This system has been completely drained and is undergoing further investigations. ...regular "PSU" patches will be applied to the Pluto Castor standby database system on Monday (27th Oct) and to the Pluto production database on Wednesday (29t
    14 KB (1,569 words) - 13:13, 29 October 2014
  • * Machine/Job features: Concluded on a single architecture for cloud and batch implementations. * The OGMA database system (Atlas3D/Frontier) has been updated and switched to using Oracle GoldenGate
    40 KB (4,976 words) - 10:25, 27 October 2014
  • * Machine/Job features: Concluded on a single architecture for cloud and batch implementations. * The OGMA database system (Atlas3D/Frontier) has been updated and switched to using Oracle GoldenGate
    42 KB (5,228 words) - 10:37, 4 November 2014
  • * Machine/Job features: Concluded on a single architecture for cloud and batch implementations. LHCB having cvmfs trouble at IC, which was likely caused by a batch of naughty CMS jobs ruining it for everyone else. LHCB re-enabled IC to see
    48 KB (6,138 words) - 09:19, 10 November 2014

View (previous 20 | next 20) (20 | 50 | 100 | 250 | 500)