1. 11 Feb, 2021 1 commit
    • Jelle van der Waa's avatar
      Add correct After targets for prometheus_exporters · 042ff9cf
      Jelle van der Waa authored
      arch-audit hung as it was started before there was a working internet
      connection. To work around this issue, add proper Wants/After targets
      for network-online.target and for the rebuilderd textcollector let it
      start after rebuilderd.service is "online".
      042ff9cf
  2. 26 Jan, 2021 1 commit
    • Jelle van der Waa's avatar
      Add a btrfs prometheus exporter · 8ea35153
      Jelle van der Waa authored
      Collect prometheus btrfs errors from the btrfs command from btrfs-progs
      which since 5.10 supports json output for device stats. The collected
      errors will in the future trigger an alert when the errors reach a
      certain treshold.
      8ea35153
  3. 25 Jan, 2021 1 commit
  4. 13 Jan, 2021 1 commit
  5. 02 Jan, 2021 1 commit
  6. 18 Dec, 2020 1 commit
  7. 14 Dec, 2020 1 commit
    • Jelle van der Waa's avatar
      Add archive specific monitoring · 4658d36d
      Jelle van der Waa authored
      To monitor our archive mirrors and the archive size itself a new
      textcollector has been added. This will allow us to monitor the archive
      growth and the sync rate to mirrors.
      4658d36d
  8. 17 Oct, 2020 1 commit
  9. 06 Oct, 2020 1 commit
    • Jelle van der Waa's avatar
      Add rebuilderd_results Prometheus metric · 7abc2500
      Jelle van der Waa authored
      To monitor if reproducible builds are going in the right direction,
      record the good/bad/unknown metrics from rebuilderd with a Prometheus
      textcollector for a Grafana dashboard to display a long term trend.
      
      A Python script is required to handle data collection as obtaining the
      status with jq/bash is non trivial and cannot easily dnyamically collect
      suites and statuses.
      
      Closes: #146
      7abc2500
  10. 05 Oct, 2020 1 commit
  11. 23 Sep, 2020 1 commit
  12. 21 Sep, 2020 2 commits
  13. 12 Sep, 2020 1 commit
  14. 06 Sep, 2020 3 commits
    • Jelle van der Waa's avatar
      Add rebuilderd build queue length textcollector · cd4b2844
      Jelle van der Waa authored
      Record the rebuilderd queue length in prometheus so we can generate an
      alert for when the queue length keeps rising. As this could be an
      indication that the rebuilders have builds which are stuck.
      cd4b2844
    • Jelle van der Waa's avatar
      Add blackbox exporter for https status checking · 3fd36ddb
      Jelle van der Waa authored
      Run the blackbox exporter on monitoring.archlinux.org to monitor other
      machines http status for public services we provide. Also has an alert
      for when a certificate is about to expire in 3 days.
      3fd36ddb
    • Jelle van der Waa's avatar
      Introduce prometheus exporters role for collection · 23564b29
      Jelle van der Waa authored
      Add a new role called prometheus_exporters which should be run on every
      machine we have and starts different collectors depending on what group
      the machine is in. Currently supported our the gitlab runner exporter,
      rebuilder textcollector, mysqld-exporter, borg textcollector and an
      node/arch exporter. The arch exporter monitors the security status and
      pacman out of date packages gauge.
      23564b29