David Moreau-Simard
David is an open source enthusiast and contributor to projects like OpenStack, Ansible, Fedora and CentOS.
iWeb, Ubisoft and RedHat alumni doing bare metal and clouds, Dev/Ops, SRE, CI/CD and everything in between.
He likes simplicity and makes things work.
Sessions
With thousands of available plugins, Ansible automates and orchestrates configuration management, application deployment as well as cloud, network, security and server infrastructure.
Beyond these typical scenarios, it can be a great abstraction layer to interface or glue different tools and systems together.
Given this wide range of use cases and the many ways they can all go wrong differently dozens or thousands of times a day, it would be interesting and useful to have detailed and granular metrics about individual playbooks, hosts and tasks.
We could spot improvements, regressions, spikes and bottlenecks in Grafana to make playbooks run better and faster.
If unexpected changes or failures happen, we could notify someone or something about it with Alertmanager.
In this talk we'll explain and show "why not" using an implementation that puts Ansible metrics in Prometheus using ARA Records Ansible.
At time of writing, it kind of works and puts many pieces of the puzzle together but doesn't quite use the right approach. It turns out putting historical metrics in Prometheus is not that simple.
We might just find out how to do it together if you are interested in the use case !