Monitoring

From GridPP Wiki
Jump to: navigation, search

This is a core team task.

The task includes:

1. Develop local monitoring techniques and best practice in discussion with the community.

2. Curate sample monitoring scripts and solutions for the community

3. Remaining aware of developments with the grid operations tools such as COD/ROD dashboard: http://operations-portal.egi.eu/

4. Briefing the team on functionality changes & presenting monthly updates on experiment tool functionality and changes

4a. Looking at wide area monitoring (eg regional dashboard/security dashboard) from a site-admin perspective

5. Subscribing to informational lists - such as GOCDB discuss

6. Co-ordinating/publicising local site-admin tools (Nagios plugins, local batch system dashboards)

7. Co-ordinating liaison with experiment dashboard teams?

This page is a Key Document, and is the responsibility of Federico Melaccio & David Crooks. It was last reviewed on 2015-07-07 when it was considered to be 90% complete. It was last judged to be accurate on 2014-09-17.