Difference between revisions of "Links Monitoring pages"
From GridPP Wiki
(→Alice) |
(→Network Monitoring) |
||
(86 intermediate revisions by 12 users not shown) | |||
Line 3: | Line 3: | ||
== Grid Monitoring == | == Grid Monitoring == | ||
− | + | * [http://gstat-wlcg.cern.ch/apps/capacities/sites WLCG REBUS Capacities] | |
− | * [http://gstat-wlcg.cern.ch/apps/capacities/sites WLCG | + | |
− | === UKI Nagios/myegi monitoring === | + | === UKI Nagios/myegi/argo monitoring === |
− | + | ||
− | + | ||
− | + | ||
− | + | * [http://argo.egi.eu/lavoisier/status_report-site?ngi=NGI_UK&report=Critical&accept=html ARGO UK sites] | |
− | * [ | + | |
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
=== Security monitoring === | === Security monitoring === | ||
Line 26: | Line 13: | ||
* [https://operations-portal.egi.eu/csiDashboard/ngiDetails/ngi/NGI_UK/view_type/monitoring/colspan/unknown EGI Security Dashboard UK view] | * [https://operations-portal.egi.eu/csiDashboard/ngiDetails/ngi/NGI_UK/view_type/monitoring/colspan/unknown EGI Security Dashboard UK view] | ||
− | == Transfers | + | == Transfers Monitoring == |
− | * [https:// | + | * [https://lcgfts3.gridpp.rl.ac.uk:8449/fts3/ftsmon/#/ RAL FTS3 (production instance) monitoring web app] |
* [http://ganglia.gridpp.rl.ac.uk/cgi-bin/ganglia-fts/fts3-sites.pl RAL FTS3 (production instance) ganglia plots] | * [http://ganglia.gridpp.rl.ac.uk/cgi-bin/ganglia-fts/fts3-sites.pl RAL FTS3 (production instance) ganglia plots] | ||
− | * [https:// | + | * [https://fts3-test.gridpp.rl.ac.uk:8449/fts3/ftsmon/#/ RAL FTS3 (test instance) monitoring web app] |
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
* [http://dashb-wlcg-transfers.cern.ch/ui/ WLCG transfers dashboard] | * [http://dashb-wlcg-transfers.cern.ch/ui/ WLCG transfers dashboard] | ||
* [http://dashb-fts-transfers.cern.ch/ui/ FTS dashboard] | * [http://dashb-fts-transfers.cern.ch/ui/ FTS dashboard] | ||
− | |||
− | == | + | == Network Monitoring with perfSONAR== |
− | * [http:// | + | * [https://psmad.opensciencegrid.org/maddash-webui/index.cgi?dashboard=UK%20Mesh%20Config WLCG perfSONAR dashboard] |
+ | * [https://ps-dash.dev.ja.net/maddash-webui/index.cgi?dashboard=UK%20Mesh%20Config Jisc perfSONAR dashboard] | ||
+ | * [https://psetf.opensciencegrid.org/etf/check_mk/index.py?start_url=%2Fetf%2Fcheck_mk%2Fview.py%3Fhostgroup%3DUK%26opthost_group%3DUK%26view_name%3Dhostgroup WLCG/OSG perfsonar Check_MK ] | ||
+ | * [http://ps-dashboard.es.net/maddash-webui/index.cgi?dashboard=6%3A%20ESnet%20to%20International ESnet International perfSONAR dashboard] | ||
− | == | + | == Accounting == |
− | * [ | + | |
+ | * [https://twiki.cern.ch/twiki/bin/view/LCG/AccountingFAQ WLCG Accounting FAQ] | ||
+ | * [https://accounting-next.egi.eu/egi/country/United%20Kingdom New EGI accounting portal] | ||
+ | * [http://tinyurl.com/hevnfz5 Experiments APEL comparison UK] | ||
+ | * [http://tinyurl.com/zdtco8j ATLAS APEL comparison UK last 3 months] | ||
+ | * [http://tinyurl.com/zksowcl CMS APEL comparison UK last 3 months] | ||
== Experiment Monitoring == | == Experiment Monitoring == | ||
− | === | + | === ATLAS === |
* [http://adc-monitoring.cern.ch ADC Monitoring] | * [http://adc-monitoring.cern.ch ADC Monitoring] | ||
− | * ''' | + | * '''Blacklist specific pages''' |
− | ** [ | + | ** [https://bigpanda.cern.ch/sites/?cloud=UK BigPanda Queues status] |
− | + | ** [http://atlas-agis.cern.ch/agis/pandablacklisting/list Panda Queue Blacklist page] | |
− | + | ** [http://atlas-agis.cern.ch/agis/ddmblacklisting/list/ DDM blacklist page] | |
− | + | '''Job Monitoring''' | |
+ | *'''Historical''' | ||
** [http://panglia.triumf.ca Panglia] | ** [http://panglia.triumf.ca Panglia] | ||
− | ** [http:// | + | ** [http://tinyurl.com/ounn5od Atlas Historical Dashboard UK view] (Soon obsolete) |
− | *'''Production''' | + | ** [https://monit-grafana.cern.ch/d/a62E4PgWk/job-accounting-uk-cloud?orgId=17 New grafana monitoring UK view] |
− | ** [ | + | *'''Big Panda Production''' |
− | *'''Analysis''' | + | ** [https://bigpanda.cern.ch/dash/production/?cloudview=region#cloud_UK Big Panda production dashboard] |
− | ** [ | + | *'''Big Panda Analysis''' |
− | + | ** [https://bigpanda.cern.ch/dash/analysis/#cloud_UK Big Panda Analysis dashboard] | |
** [http://hammercloud.cern.ch/hc/app/atlas HammerCloud tests (V4)] | ** [http://hammercloud.cern.ch/hc/app/atlas HammerCloud tests (V4)] | ||
− | ** [http://apfmon.lancs.ac.uk/ Pilot | + | *'''Pilot/Harvester worker''' |
− | *'''AGIS''' | + | ** [http://apfmon.lancs.ac.uk/ Pilot wrapper job monitoring] |
− | + | ** [https://tinyurl.com/y63jvapb Harvester monitoring] | |
− | + | '''AGIS''' | |
− | + | * [http://atlas-agis.cern.ch/agis/atlassite/table_view/ Site Configuration] | |
− | + | * [http://atlas-agis.cern.ch/agis/panda_queue/table_view/ Queue Configuration] | |
− | ** [ | + | '''DDM and transfers''' |
− | + | * [https://tinyurl.com/y2a2mjrn DDM grafana dashboard] | |
− | + | ** [https://tinyurl.com/yynnsc8a UK as a source] | |
− | + | ** [https://tinyurl.com/yy2arjqk UK as a destination] | |
− | + | '''Storage''' | |
− | + | * [http://tinyurl.com/kqqx2vd ATLAS UK storage accounting] | |
− | + | * [http://www.hep.lancs.ac.uk/~love/ukdata/ Peter Love's pledge monitoring ] | |
− | + | * [http://adc-ddm-mon.cern.ch/ddmusr01/plots Rucio Space Tokens plots] | |
− | + | * [https://rucio-ui.cern.ch/bad_replicas?state=SUSPICIOUS List of suspicious files] | |
− | + | '''SUM and Lloyds''' | |
− | + | * [http://tinyurl.com/jhlczrh ATLAS ETF (nagios)] | |
− | + | * [http://tinyurl.com/nejr8r4 ATLAS ETF(dashboard)] | |
− | + | * [http://wlcg-squid-monitor.cern.ch/snmpstats/mrtgatlas2/indexatlas2siteUKI-NORTHGRID-MAN-HEP.html ATLAS Squid monitoring] | |
− | + | '''Other services''' | |
− | + | * [https://atlas-logbook.cern.ch/elog/ATLAS+Computer+Operations+Logbook/?Cloud=^UK%24 Atlas shifters elog] | |
− | + | '''UK Support Mailing Lists''' | |
− | + | * <span style="color: #0000ff">atlas-support-cloud-uk@NOSPAMcern.ch</span> Atlas UK Cloud Support (use for help solving problems and in GGUS tickets) | |
− | + | * <span style="color: #0000ff">atlas-uk-comp-operations@NOSPAMcern.ch</span> Atlas UK Computing Operations (use for general discussion) | |
− | + | * [https://indico.cern.ch/categoryDisplay.py?categId=4620 Atlas UK meeting (Thursday 10am)] | |
− | + | ||
− | + | ||
=== CMS === | === CMS === | ||
− | |||
− | |||
− | |||
* [http://dashboard.cern.ch/cms Dashboard] | * [http://dashboard.cern.ch/cms Dashboard] | ||
− | * [http:// | + | * [http://tinyurl.com/h467yn9 CMS ETF (nagios)] |
− | * [ | + | * [http://tinyurl.com/kyzmghs CMS ETF (dashboard) T2] |
− | * [http:// | + | * [http://wlcg-sam-cms.cern.ch/templates/ember/#/plot?group=Tier3s&profile=CMS_CRITICAL&sites=T3_UK_London_QMUL%2CT3_UK_London_RHUL%2CT3_UK_London_UCL%2CT3_UK_ScotGrid_GLA%2CT3_UK_SGrid_Oxford CMS ETF (dashboard) T3] |
− | + | * [http://dashb-ssb.cern.ch/dashboard/request.py/siteview? Site status Board (T2 and T3)]. Click on 'Analysis' and 'Production' for further details. | |
− | * [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?view=global PhEDEx transfer rate plots] | + | |
+ | * [https://cms-site-readiness.web.cern.ch/cms-site-readiness/SiteReadiness/HTML/SiteReadinessReport.html#T2_UK_London_Brunel Site Readiness status (T2s only starts at Brunel, scroll down for other UK sites)] | ||
+ | * [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?view=global PhEDEx data transfer rate plots] | ||
** [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?graph=quantity_rates&entity=src&src_filter=UK&dest_filter=&no_mss=true&period=l96h&upto= From UK sites] | ** [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?graph=quantity_rates&entity=src&src_filter=UK&dest_filter=&no_mss=true&period=l96h&upto= From UK sites] | ||
** [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?graph=quantity_rates&entity=dest&src_filter=&dest_filter=UK&no_mss=true&period=l96h&upto= To UK sites] | ** [http://cmsdoc.cern.ch/cms/aprom/phedex/prod/Activity::RatePlots?graph=quantity_rates&entity=dest&src_filter=&dest_filter=UK&no_mss=true&period=l96h&upto= To UK sites] | ||
+ | * Debug transfers from UK sites: [https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T2_UK_London_IC&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T2_UK_London_IC],[https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T2_UK_London_Brunel&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T2_UK_London_Brunel],[https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T2_UK_SGrid_RALPP&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T2_UK_SGrid_RALPP], [https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T2_UK_SGrid_Bristol&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T2_UK_SGrid_Bristol], | ||
+ | [https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T3_UK_London_QMUL&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T3_UK_London_QMUL],[https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T3_UK_London_RHUL&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T3_UK_London_RHUL],[https://cmsweb.cern.ch/phedex/debug/Activity::QualityPlots?graph=quality_all&entity=dest&src_filter=T3_UK_ScotGrid_GLA&dest_filter=.*&no_mss=true&period=l96h&upto=&.submit=Update T3_UK_ScotGrid_GLA],(no debug transfers for Oxford) | ||
=== LHCb === | === LHCb === | ||
− | * [ | + | * [http://tinyurl.com/zcttfdf LHCB ETF (nagios)] |
− | * [http:// | + | * [http://wlcg-sam-lhcb.cern.ch/templates/ember/#/historicalsmry/heatMap?group=Tier%200%2F1&profile=LHCb_CRITICAL&site=LCG.SARA.nl%2CLCG.RRCKI.ru%2CLCG.RAL.uk%2CLCG.PIC.es%2CLCG.NIKHEF.nl%2CLCG.IN2P3.fr%2CLCG.GRIDKA.de%2CLCG.CNAF.it%2CLCG.CERN.ch&time=Last%2024%20Hours LHCb ETF (dashboard)] |
− | * [ | + | * [https://lhcb-portal-dirac.cern.ch/DIRAC/?view=tabs&theme=Grey&url_state=1|*DIRAC.SiteSummary.classes.SiteSummary:, Site Summary] |
− | * [ | + | * [https://lhcb-portal-dirac.cern.ch/DIRAC/?view=tabs&theme=Grey&url_state=1|*LHCbDIRAC.Accounting.classes.Accounting:, LHCb Accounting] Please choose what you want to look at in the first drop-down box called "category". Hints : "Data operation" = Transfer operations, "Job" = Completed jobs, "WMS history" = Pilot jobs, "Pilot" = Pilot statuses. |
− | * | + | * [http://pprc.qmul.ac.uk/~walker/votable.html Steve Lloyd User Monitoring (lhcb)] |
− | * [http://pprc.qmul.ac.uk/~ | + | |
− | === | + | === ALICE === |
− | * [http://wlcg-sam-alice.cern.ch/templates/ember/ | + | * [http://tinyurl.com/huwrqs3 ALICE ETF (nagios)] |
+ | * [http://wlcg-sam-alice.cern.ch/templates/ember/ ALICE ETF (dashboard)] | ||
* [http://alimonitor.cern.ch/stats?page=SE/table SE tests] | * [http://alimonitor.cern.ch/stats?page=SE/table SE tests] | ||
* [http://alimonitor.cern.ch/siteinfo/?site=RAL Site Overview] | * [http://alimonitor.cern.ch/siteinfo/?site=RAL Site Overview] | ||
Line 121: | Line 110: | ||
* [http://alimonitor.cern.ch/display?page=jobResUsageSum_time_cpu CPU Accounting] | * [http://alimonitor.cern.ch/display?page=jobResUsageSum_time_cpu CPU Accounting] | ||
* [http://alimonitor.cern.ch/display?page=FTD/SE RAW data repication speed] | * [http://alimonitor.cern.ch/display?page=FTD/SE RAW data repication speed] | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
== Tickets == | == Tickets == | ||
Line 151: | Line 131: | ||
* Argus | * Argus | ||
** https://www.gridpp.ac.uk/wiki/ARGUS_deployment | ** https://www.gridpp.ac.uk/wiki/ARGUS_deployment | ||
− | {{KeyDocs|responsible=Alessandra Forti|reviewdate= | + | {{KeyDocs|responsible=Alessandra Forti|reviewdate=2015-11-15|accuratedate=2015-11-15|percentage=90}} |
Latest revision as of 08:41, 7 August 2019
Contents
Grid Monitoring
UKI Nagios/myegi/argo monitoring
Security monitoring
- EGI Pakiti - only if you are registered as security officer in GOCDB
- EGI Security Dashboard UK view
Transfers Monitoring
- RAL FTS3 (production instance) monitoring web app
- RAL FTS3 (production instance) ganglia plots
- RAL FTS3 (test instance) monitoring web app
- WLCG transfers dashboard
- FTS dashboard
Network Monitoring with perfSONAR
- WLCG perfSONAR dashboard
- Jisc perfSONAR dashboard
- WLCG/OSG perfsonar Check_MK
- ESnet International perfSONAR dashboard
Accounting
- WLCG Accounting FAQ
- New EGI accounting portal
- Experiments APEL comparison UK
- ATLAS APEL comparison UK last 3 months
- CMS APEL comparison UK last 3 months
Experiment Monitoring
ATLAS
- ADC Monitoring
- Blacklist specific pages
Job Monitoring
- Historical
- Panglia
- Atlas Historical Dashboard UK view (Soon obsolete)
- New grafana monitoring UK view
- Big Panda Production
- Big Panda Analysis
- Pilot/Harvester worker
AGIS
DDM and transfers
Storage
- ATLAS UK storage accounting
- Peter Love's pledge monitoring
- Rucio Space Tokens plots
- List of suspicious files
SUM and Lloyds
Other services
UK Support Mailing Lists
- atlas-support-cloud-uk@NOSPAMcern.ch Atlas UK Cloud Support (use for help solving problems and in GGUS tickets)
- atlas-uk-comp-operations@NOSPAMcern.ch Atlas UK Computing Operations (use for general discussion)
- Atlas UK meeting (Thursday 10am)
CMS
- Dashboard
- CMS ETF (nagios)
- CMS ETF (dashboard) T2
- CMS ETF (dashboard) T3
- Site status Board (T2 and T3). Click on 'Analysis' and 'Production' for further details.
- Site Readiness status (T2s only starts at Brunel, scroll down for other UK sites)
- PhEDEx data transfer rate plots
- Debug transfers from UK sites: T2_UK_London_IC,T2_UK_London_Brunel,T2_UK_SGrid_RALPP, T2_UK_SGrid_Bristol,
T3_UK_London_QMUL,T3_UK_London_RHUL,T3_UK_ScotGrid_GLA,(no debug transfers for Oxford)
LHCb
- LHCB ETF (nagios)
- LHCb ETF (dashboard)
- Site Summary
- LHCb Accounting Please choose what you want to look at in the first drop-down box called "category". Hints : "Data operation" = Transfer operations, "Job" = Completed jobs, "WMS history" = Pilot jobs, "Pilot" = Pilot statuses.
- Steve Lloyd User Monitoring (lhcb)
ALICE
- ALICE ETF (nagios)
- ALICE ETF (dashboard)
- SE tests
- Site Overview
- Active jobs per site
- CPU Accounting
- RAW data repication speed
Tickets
Status tracking links
Links for tracking the deployment status at sites.
- SL6
- WebDAV
- https://www.gridpp.ac.uk/wiki/WebDAV WebDAV/xrootd
- IPv6
- perfSONAR
- Backup VOMS servers
- Argus
This page is a Key Document, and is the responsibility of Alessandra Forti. It was last reviewed on 2015-11-15 when it was considered to be 90% complete. It was last judged to be accurate on 2015-11-15.