RAL Tier1 weekly operations castor 29/04/2016

From GridPP Wiki
Revision as of 09:43, 8 December 2016 by Rob Appleyard 7f7797b74a (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

AD discussion>>>> batch farm > 10k job slots on farm (15 > 26k) most throughput from fts batch farm & fts about equal number of files CMS throttled - 1200 job slots (direct IO) - 1200 as it works, MoU should be 3k Atlas - fair share 7-8k jobs ... pos prev max load + 20%? LHCb - ok at the moment Alice use farm (quite significant) but dont really use castor Non LHC vos - mostly going to tape as 'archive' >>> will probably go to echo 2 defn of efficency success/total or cpu time/wall time - CMS raising issues with both


AP - come up with all potential solutions even if ££££

  • 2014 disk serevrs can be put into castor - poss cms .. for IO throughput

reduce the number of drives used (castor partitions) on above machines atlas log files could be put onto echo find prob workflows get echo working second raid in hardware