Difference between revisions of "Batch system status"

From GridPP Wiki
Jump to: navigation, search
Line 31: Line 31:
 
|Interest/Investigating/Testing
 
|Interest/Investigating/Testing
 
|CE type(s) & plans at site
 
|CE type(s) & plans at site
 +
|Multicore Atlas/CMS
 
|Cloud interface available/plans
 
|Cloud interface available/plans
 
|Notes
 
|Notes
Line 40: Line 41:
 
|<span style="color:green">No reason to change</span>
 
|<span style="color:green">No reason to change</span>
 
|<span style="color:green">ARC & CREAM CEs, but would like to decommission CREAM CEs eventually</span>
 
|<span style="color:green">ARC & CREAM CEs, but would like to decommission CREAM CEs eventually</span>
 +
|
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|
 
|
Line 49: Line 51:
 
|<span style="color:green">Slurm and HTCondor in test</span>
 
|<span style="color:green">Slurm and HTCondor in test</span>
 
|<span style="color:green">Arc in test</span>
 
|<span style="color:green">Arc in test</span>
 +
|
 
|<span style="color:green">OpenVZ in production, Docker in test</span>
 
|<span style="color:green">OpenVZ in production, Docker in test</span>
 
|
 
|
Line 54: Line 57:
 
|-
 
|-
 
|UKI-LT2-IC-HEP
 
|UKI-LT2-IC-HEP
|<span style="color:green">Gridengine</span>
+
|<span style="color:green">Gridengine (local)</span>
|<span style="color:green">None</span>
+
|<span style="color:green"> - </span>
|<span style="color:green">None</span>
+
|<span style="color:green"> - </span>
 
|<span style="color:green">CREAM, ARC</span>
 
|<span style="color:green">CREAM, ARC</span>
 +
|
 
|<span style="color:green">GridPP Cloud Tests</span>
 
|<span style="color:green">GridPP Cloud Tests</span>
 
|
 
|
Line 65: Line 69:
 
|UKI-LT2-QMUL
 
|UKI-LT2-QMUL
 
|<span style="color:green">Gridengine (local)</span>
 
|<span style="color:green">Gridengine (local)</span>
|<span style="color:green">None</span>
+
|<span style="color:green"> - </span>
|<span style="color:green">son of gridengine</span>
+
|<span style="color:green">Son of Gridengine</span>
|<span style="color:green">cream</span>
+
|<span style="color:green">CREAM</span>
 +
|Yes
 
|<span style="color:green">Deploy cloudstack, find scalable solution to get our storage usable in the cloud</span>
 
|<span style="color:green">Deploy cloudstack, find scalable solution to get our storage usable in the cloud</span>
 
|
 
|
Line 76: Line 81:
 
|<span style="color:green">Torque/Maui support non-existent</span>
 
|<span style="color:green">Torque/Maui support non-existent</span>
 
|<span style="color:green">Will follow the consensus</span>
 
|<span style="color:green">Will follow the consensus</span>
|<span style="color:green">Cream</span>
+
|<span style="color:green">CREAM</span>
 +
|
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|
 
|
Line 87: Line 93:
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 +
|
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|
 
|
Line 96: Line 103:
 
|<span style="color:green">Disillusioned with torque/maui.</span>
 
|<span style="color:green">Disillusioned with torque/maui.</span>
 
|<span style="color:green">Slurm or HTCondor.</span>
 
|<span style="color:green">Slurm or HTCondor.</span>
|<span style="color:green">Cream, interested in ARC</span>
+
|<span style="color:green">CREAM, interested in ARC</span>
 +
|Yes
 
|<span style="color:green">VMWare testing.</span>
 
|<span style="color:green">VMWare testing.</span>
 
|
 
|
Line 102: Line 110:
 
|-
 
|-
 
|UKI-NORTHGRID-LIV-HEP
 
|UKI-NORTHGRID-LIV-HEP
|<span style="color:green">Torque Maui</span>
+
|<span style="color:green">Torque Maui (local)</span>
 
|<span style="color:green">Poor Support, Maui intrinsically broken</span>
 
|<span style="color:green">Poor Support, Maui intrinsically broken</span>
|<span style="color:green">ARC/Condor </span>
+
|<span style="color:green">ARC/HTCondor </span>
 
|<span style="color:green">Cream (ARC)</span>
 
|<span style="color:green">Cream (ARC)</span>
 +
|Testing
 
|<span style="color:green">None</span>
 
|<span style="color:green">None</span>
 
|
 
|
Line 113: Line 122:
 
|<span style="color:green">Torque/Maui (local)</span>
 
|<span style="color:green">Torque/Maui (local)</span>
 
|<span style="color:green">Maui is unsupported. It had memory leaks. Robert wrote a patch and there was nowhere to feed it back into.</span>
 
|<span style="color:green">Maui is unsupported. It had memory leaks. Robert wrote a patch and there was nowhere to feed it back into.</span>
|<span style="color:green">htcondor</span>
+
|<span style="color:green">HTCondor</span>
|<span style="color:green">Currently CreamCE, investigating ARC-CE</span>
+
|<span style="color:green">Currently CREAM, investigating ARC-CE</span>
 +
|<span style="color:green">Yes</span>
 
|<span style="color:green">Vac in production on testbed</span>
 
|<span style="color:green">Vac in production on testbed</span>
 
|
 
|
Line 125: Line 135:
 
|<span style="color:green">Will follow the consensus</span>
 
|<span style="color:green">Will follow the consensus</span>
 
|<span style="color:green">CREAM CE</span>
 
|<span style="color:green">CREAM CE</span>
 +
|
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|
 
|
Line 131: Line 142:
 
|-
 
|-
 
|UKI-SCOTGRID-DURHAM
 
|UKI-SCOTGRID-DURHAM
|<span style="color:green">Torque/Maui - Local</span>
+
|<span style="color:green">Torque/Maui (local)</span>
 
|<span style="color:green">Becomes unresponsive and unstable. Doesn't behave particularly well if it looses nodes.</span>
 
|<span style="color:green">Becomes unresponsive and unstable. Doesn't behave particularly well if it looses nodes.</span>
 
|<span style="color:green">SLURM</span>
 
|<span style="color:green">SLURM</span>
 
|<span style="color:green">Currently CreamCE, would like to use ARC as a replacement</span>
 
|<span style="color:green">Currently CreamCE, would like to use ARC as a replacement</span>
 +
|
 
|<span style="color:green">N/A</span>
 
|<span style="color:green">N/A</span>
 
|
 
|
Line 142: Line 154:
 
|UKI-SCOTGRID-ECDF
 
|UKI-SCOTGRID-ECDF
 
|<span style="color:green">Gridengine</span>
 
|<span style="color:green">Gridengine</span>
|<span style="color:green">None</span>
+
|<span style="color:green"> - </span>
 
|<span style="color:green">No plans to change</span>
 
|<span style="color:green">No plans to change</span>
 
|<span style="color:green">Cream CE for standard production, ARC CE for exploratory HPC work</span>
 
|<span style="color:green">Cream CE for standard production, ARC CE for exploratory HPC work</span>
 +
|
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|
 
|
Line 155: Line 168:
 
|<span style="color:green">Investigating HTCondor/SoGE/SLURM as a replacement.</span>
 
|<span style="color:green">Investigating HTCondor/SoGE/SLURM as a replacement.</span>
 
|<span style="color:green">Currently CreamCE, investigating ARC CE as replacement.</span>
 
|<span style="color:green">Currently CreamCE, investigating ARC CE as replacement.</span>
 +
|
 
|<span style="color:green">N/A</span>
 
|<span style="color:green">N/A</span>
 
|
 
|
Line 164: Line 178:
 
|<span style="color:green">HTCondor</span>
 
|<span style="color:green">HTCondor</span>
 
|<span style="color:green">CREAM</span>
 
|<span style="color:green">CREAM</span>
 +
|
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|
 
|
Line 171: Line 186:
 
|UKI-SOUTHGRID-BRIS
 
|UKI-SOUTHGRID-BRIS
 
|<span style="color:green">HTCondor (shared), torque + maui (local)</span>
 
|<span style="color:green">HTCondor (shared), torque + maui (local)</span>
|<span style="color:green">None</span>
+
|<span style="color:green"> - </span>
|<span style="color:green">No reason to change</span>
+
|<span style="color:green"> - </span>
 
|<span style="color:green">ARC & CREAM CEs</span>
 
|<span style="color:green">ARC & CREAM CEs</span>
 +
|
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|
 
|
Line 184: Line 200:
 
|<span style="color:green">Will follow the consensus</span>
 
|<span style="color:green">Will follow the consensus</span>
 
|<span style="color:green">CREAM CE</span>
 
|<span style="color:green">CREAM CE</span>
 +
|
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|
 
|
Line 189: Line 206:
 
|-
 
|-
 
|UKI-SOUTHGRID-OX-HEP
 
|UKI-SOUTHGRID-OX-HEP
|<span style="color:green">Torque/Maui</span>
+
|<span style="color:green">HTCondor (local)</span>
|<span style="color:green">Becomes unresponsive and unstable.</span>
+
|<span style="color:green"> - </span>
|<span style="color:green">Moved 1/3 WN's to HTCondor</span>
+
|<span style="color:green"> - </span>
|<span style="color:green">CREAMCE,  ARC CE in production</span>
+
|<span style="color:green">ARC CE in production</span>
 +
|<span style="color:green">Yes/Not yet</span>
 
|<span style="color:green">OpenStack in production. Testing VAC</span>
 
|<span style="color:green">OpenStack in production. Testing VAC</span>
 
|
 
|
Line 203: Line 221:
 
|<span style="color:green">None, just migrated from torque/maui</span>
 
|<span style="color:green">None, just migrated from torque/maui</span>
 
|<span style="color:green">ArcCE (Legacy CreamCEs will be switched off soon</span>
 
|<span style="color:green">ArcCE (Legacy CreamCEs will be switched off soon</span>
 +
|
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|
 
|
Line 210: Line 229:
 
|UKI-SOUTHGRID-SUSX
 
|UKI-SOUTHGRID-SUSX
 
|<span style="color:green">(Shared) Gridengine - (Univa Grid Engine)</span>
 
|<span style="color:green">(Shared) Gridengine - (Univa Grid Engine)</span>
|<span style="color:green">None</span>
+
|<span style="color:green"> - </span>
|<span style="color:green">No reason to change</span>
+
|<span style="color:green"> - </span>
 
|<span style="color:green">CREAMCE</span>
 
|<span style="color:green">CREAMCE</span>
 +
|
 
|<span style="color:green"></span>
 
|<span style="color:green"></span>
 
|
 
|

Revision as of 14:18, 29 July 2014

Other links

Sites batch system status

This page has been setup to collect information from GridPP sites regarding their batch systems in February 2014. The information will help with wider considerations and strategy. The table seeks the following:

1) Current product (local/shared) - what is the current batch system at the site. Is it locally managed or shared with other groups?

2) Concerns - has your site experienced any problems with the batch system in operation?

3) Interest/Investigating/Testing - Does your site already have plans to change and if so to what. If not are you actively investigating or testing any alternatives?

4) CE type(s) - What CE type (gLite, ARC...) do you currently run and do you plan to change this, perhaps in conjunction with a batch system move?

5) Cloud interface(s)? - Does your site offer access to resources in ways other than via a CE?

6) Notes - Any other information you wish to share on this topic.



Site Current product (local/shared) Concerns and observations Interest/Investigating/Testing CE type(s) & plans at site Multicore Atlas/CMS Cloud interface available/plans Notes
RAL Tier-1 HTCondor (local) None No reason to change ARC & CREAM CEs, but would like to decommission CREAM CEs eventually
UKI-LT2-Brunel Torque/Maui, Arc/Condor No support for Torque/Maui Slurm and HTCondor in test Arc in test OpenVZ in production, Docker in test
UKI-LT2-IC-HEP Gridengine (local) - - CREAM, ARC GridPP Cloud Tests


UKI-LT2-QMUL Gridengine (local) - Son of Gridengine CREAM Yes Deploy cloudstack, find scalable solution to get our storage usable in the cloud
UKI-LT2-RHUL Torque/Maui (local) Torque/Maui support non-existent Will follow the consensus CREAM


UKI-LT2-UCL-HEP


UKI-NORTHGRID-LANCS-HEP Son of Gridengine (HEC), torque/maui (local) Disillusioned with torque/maui. Slurm or HTCondor. CREAM, interested in ARC Yes VMWare testing.
UKI-NORTHGRID-LIV-HEP Torque Maui (local) Poor Support, Maui intrinsically broken ARC/HTCondor Cream (ARC) Testing None
UKI-NORTHGRID-MAN-HEP Torque/Maui (local) Maui is unsupported. It had memory leaks. Robert wrote a patch and there was nowhere to feed it back into. HTCondor Currently CREAM, investigating ARC-CE Yes Vac in production on testbed


UKI-NORTHGRID-SHEF-HEP Torque/Maui (local) Torque/Maui support non-existent Will follow the consensus CREAM CE


UKI-SCOTGRID-DURHAM Torque/Maui (local) Becomes unresponsive and unstable. Doesn't behave particularly well if it looses nodes. SLURM Currently CreamCE, would like to use ARC as a replacement N/A


UKI-SCOTGRID-ECDF Gridengine - No plans to change Cream CE for standard production, ARC CE for exploratory HPC work


UKI-SCOTGRID-GLASGOW Torque/Maui - Local Becomes unresponsive at times of high load or nodes being un-contactable. Investigating HTCondor/SoGE/SLURM as a replacement. Currently CreamCE, investigating ARC CE as replacement. N/A
UKI-SOUTHGRID-BHAM-HEP Torque/Maui Maui sometimes fails to see new jobs and so nothing is scheduled HTCondor CREAM


UKI-SOUTHGRID-BRIS HTCondor (shared), torque + maui (local) - - ARC & CREAM CEs


UKI-SOUTHGRID-CAM-HEP Torque/Maui (local) Torque/Maui support non-existent Will follow the consensus CREAM CE
UKI-SOUTHGRID-OX-HEP HTCondor (local) - - ARC CE in production Yes/Not yet OpenStack in production. Testing VAC


UKI-SOUTHGRID-RALPP HTCondor (Legacy Torque/Maui will be switched off soon) None None, just migrated from torque/maui ArcCE (Legacy CreamCEs will be switched off soon


UKI-SOUTHGRID-SUSX (Shared) Gridengine - (Univa Grid Engine) - - CREAMCE