aboutsummaryrefslogtreecommitdiff
Commit message (Collapse)AuthorAge
* Fix promu config (#1119)v0.17.0-rc.0Ben Kochie2018-10-20
| | | | | | Rename promu no-cgo config to default promu name to avoid crossbuild problems. Signed-off-by: Ben Kochie <superq@gmail.com>
* Update for 0.17.0-rc.0 release (#1118)Ben Kochie2018-10-19
| | | | | | * Update VERSION. * Update CHANGELOG. Signed-off-by: Ben Kochie <superq@gmail.com>
* Update cpufreq metrics collector (#1117)Ben Kochie2018-10-18
| | | | | | | * Update Linux cpufreq collector to use new procfs library functions. * Split thermal throttle collection to a separate function. * Add new required fixtures and repack ttar file. Signed-off-by: Ben Kochie <superq@gmail.com>
* Update vendor github.com/prometheus/client_golang/...@v0.9.0 (#1111)Ben Kochie2018-10-15
| | | | | | | * Update vendor github.com/prometheus/client_golang/...@v0.9.0 * Update vendor github.com/prometheus/common/... Signed-off-by: Ben Kochie <superq@gmail.com>
* fix a few minor golint warnings (#1110)Paul Gier2018-10-15
| | | Signed-off-by: Paul Gier <pgier@redhat.com>
* Update diskstats for linux kernel 4.19 (#1109)Paul Gier2018-10-15
| | | | | | | | | | The format of /proc/diskstats is changing in linux-4.19 to include some additional fields. See: https://www.kernel.org/doc/Documentation/iostats.txt * collector/diskstats: use constants for some hard coded strings * collector/diskstats: update diskstats for linux-4.19 * collector/diskstats: remove kernel doc url from individual metrics Signed-off-by: Paul Gier <pgier@redhat.com>
* Ignore additional virtual filesystems (#1104)Ben Kochie2018-10-12
| | | | | | | | | Add more virtual filesystems to the default ignore list * bpf * cgroup2 * selinuxfs * squashfs Signed-off-by: Ben Kochie <superq@gmail.com>
* Update vendoring (#1105)Ben Kochie2018-10-11
| | | | | | | | | | | * Update vendor github.com/sirupsen/logrus@v1.1.1 * Update vendor github.com/coreos/go-systemd/dbus@v17 * Update vendor github.com/golang/protobuf/proto@v1.2.0 * Update vendor github.com/konsorten/go-windows-terminal-sequences@v1.0.1 * Update vendor github.com/mdlayher/... * Update vendor github.com/prometheus/procfs/... * Update vendor golang.org/x/... Signed-off-by: Ben Kochie <superq@gmail.com>
* Update readme (#1107)Bryan Boreham2018-10-11
| | | | | | | | | | * State that wifi collector is disabled by default Signed-off-by: Bryan Boreham <bjboreham@gmail.com> * Add the 'processes' collector to the Readme Signed-off-by: Bryan Boreham <bjboreham@gmail.com>
* launchctl example based on LaunchDaemons (#1102)dbalakirev2018-10-10
| | | | | | LaunchDaemons are the correct way to create services that are restart proof. There is now only a single destination place mentioned in the readme for the plist file. Signed-off-by: Dávid Balakirev <dave00ster@gmail.com>
* Update prometheus/client_golang vendoring (#1099)Björn Rabenstein2018-10-05
| | | | | | This is mostly required to fix a bug with histograms on 32bit platforms. (Which might or might not be used in node_exporter. Just in case...) Signed-off-by: beorn7 <beorn@soundcloud.com>
* Remove continue with label (#1084)Dario Maiocchi2018-10-05
| | | | Instead of continue with label use helper function Signed-off-by: dmaiocchi <dmaiocchi@suse.com>
* Cleanup unused /proc/mounts fixture. (#1097)Ben Kochie2018-10-04
| | | | | | | * Cleanup unused /proc/mounts fixture. * Ignore Uint -> Unit in codespell. Signed-off-by: Ben Kochie <superq@gmail.com>
* infiniband: Handle iWARP* RDMA modules N/A (#974)Mario Trangoni2018-10-04
| | | | | | | | | | | * infiniband: Add not connected i40iw0/ports/1 fixtures * infiniband: Handle issue when iWARP* RDMA modules are not available This is related to #966, and handle this error, Jun 07 13:33:24 hostname node_exporter[81888]: time="2018-06-07T13:33:24+02:00" level=error msg="ERROR: infiniband collector failed after 0.000929s: strconv.ParseUint: parsing \"N/A (no PMA)\": invalid syntax" source="collector.go:132" Signed-off-by: Mario Trangoni <mjtrangoni@gmail.com>
* [continue 912] strip rootfs prefix for run in docker (#1058)Yecheng Fu2018-10-04
| | | | | | | | | | | | * strip rootfs prefix for run in docker * Use `/` as default value of path.rootfs, and parse mounts from `/proc/1/mounts`. * No need to mount `/proc` and `/sys` because we share host's PID namespace, which allows processes within the container to see all of the processes on the system. Closes: #66 Signed-off-by: Ivan Mikheykin <ivan.mikheykin@flant.com> Signed-off-by: Yecheng Fu <cofyc.jackson@gmail.com>
* Add node_exporter script for init.d (#1059)gentlejo2018-10-04
| | | | | * Add node_exporter script for init.d Signed-off-by: gentlejo <josungil@gmail.com>
* Add missing words and update markdown syntax (#1095)Andrew Banchich2018-10-03
| | | Signed-off-by: Andrew Banchich <andrewbanchich@gmail.com>
* Update cpu collector for OpenBSD 6.4 (#1094)Ralf Horstmann2018-10-02
| | | | | | | | | | | | | Starting with (not yet released) OpenBSD 6.4, sysctl KERN_CPTIME2 will return ENODEV for offline CPUs. SMT siblings are reported as offline when hw.smt is disabled, which is the default since one of the later Spectre variants. So this might affect a few systems. For more details see: https://cvsweb.openbsd.org/src/sys/kern/kern_sysctl.c#rev1.348 Signed-off-by: Ralf Horstmann <ralf+github@ackstorm.de>
* Add linux/s390x build (#1092)Ben Kochie2018-09-30
| | | Signed-off-by: Ben Kochie <superq@gmail.com>
* Add ppc64 build. (#1089)Ben Kochie2018-09-30
| | | Add ppc64 build.
* Update dependency mattn/go-xmlrpc (#1091)Fabian Heymann2018-09-30
| | | Signed-off-by: Fabian Heymann <fabian.heymann@finanzcheck.de>
* filesystem: Ignore netns/nsfs mounts (#1047)Daniele Sluijters2018-09-26
| | | | | | | | | | | When starting Docker containers a whole bunch of netns (network namespace) mounts are created that the node exporter can't make any sense of (and can't read either). This ignores all nsfs filesystems. Fixes #875 Signed-off-by: Daniele Sluijters <daenney@users.noreply.github.com>
* Update build (#1081)Ben Kochie2018-09-25
| | | | | | | | | | * Update build * Only use CGO when building non-Linux. * Update build to Go 1.11 * Use tab indenting consistently. Signed-off-by: Ben Kochie <superq@gmail.com>
* Change systemd unit filtering (#1083)Ben Kochie2018-09-24
| | | | | | | | | | | * Change systemd unit filtering Get all units from systemd and filter in Go. * Improves compatibility with older versions of systemd. * Improve debugging by printing when units pass the filter. * Remove extraneous newlines from log messages. Signed-off-by: Ben Kochie <superq@gmail.com>
* collector/timex: remove cgo dependency (#1079)Luca Bruno2018-09-20
| | | | | | | | This removes the cgo import from timex collector, as it was only used to define two constants. Those are part of the Linux kernel<->userspace interface, thus there is no need to depend on libc to source them: https://github.com/torvalds/linux/blob/v4.18/include/uapi/linux/timex.h Signed-off-by: Luca Bruno <luca.bruno@coreos.com>
* textfile example script rework (#1074)Christopher Blum2018-09-18
| | | | | | | | | | | | | | * textfile smartmon.sh Added functions to also parse megaraid disks. Added parsing to also detect the grown_defects counters. * textfile storcli.py Reworked the example file to export lots more information about megaraid attached controllers, VDs and PDs. Signed-off-by: Christopher Blum <christopher.blum@profitbricks.com>
* Update vendoring for client_golang and friends (#1076)Björn Rabenstein2018-09-17
| | | Signed-off-by: beorn7 <beorn@soundcloud.com>
* Note how to get moreutils on FreeBSD (#1073)Mateusz Piotrowski2018-09-14
| | | Signed-off-by: Mateusz Piotrowski <0mp@FreeBSD.org>
* Correctly cast Darwin memory info (#1060)Ben Kochie2018-09-07
| | | | | | | | | | * Correctly cast Darwin memory info * Cast stats to float64 before doing math on them to avoid integer wrapping. * Remove invalid `_total` suffix from gauge values. * Handle counters in `meminfo.go`. Signed-off-by: Ben Kochie <superq@gmail.com>
* Fix typo on description of read_time_seconds_total (#1057)Marco Tulio R Braga2018-09-02
| | | | | | Fix typo on unit description of metric `*read_time_seconds_total` from milliseconds to seconds. Signed-off-by: Marco Tulio R Braga <marco.tulio@mtulio.eng.br>
* Using the recommended syntax for maintainer label (#1053)Tariq Ibrahim2018-08-28
| | | Signed-off-by: Tariq Ibrahim <tariq.ibrahim@microsoft.com>
* Fix SmartOS build #1017 (#1018)Dan Fredell2018-08-23
| | | Signed-off-by: Dan Fredell <Dan.Fredell@gmail.com>
* Add metrics exposing extended md RAID info (#958)Matt Bostock2018-08-18
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Add metrics that expose more information about MD RAID devices and disks: - the RAID level in use - the RAID set that a disk belongs to This allows for things like alert on unusually high I/O utilisation for a disk compared to other disks in the same RAID set, which usually means the disk is failing, and for comparing write/read latency across RAID sets. Output looks like: node_md_disk_info{disk_device="/dev/dm-0", md_device="md1", md_set="A"} 1 node_md_disk_info{disk_device="/dev/dm-3", md_device="md1", md_set="B"} 1 node_md_disk_info{disk_device="/dev/dm-2", md_device="md1", md_set="A"} 1 node_md_disk_info{disk_device="/dev/dm-1", md_device="md1", md_set="B"} 1 node_md_disk_info{disk_device="/dev/dm-4", md_device="md1", md_set="A"} 1 node_md_disk_info{disk_device="/dev/dm-5", md_device="md1", md_set="B"} 1 node_md_info{md_device="md1", md_name="foo", raid_level="10", md_metadata_version="1.2"} 1 The `node_md_info` metric, which gives additional information about the RAID array, is intentionally separate to avoid adding all of those labels to each disk. If you need to query using the labels contained in `node_md_info`, you can do that using PromQL: https://www.robustperception.io/how-to-have-labels-for-machine-roles/ I looked at adding the array UUID, but there's no sysfs entry for it and I'm not sure there's a strong use case for it. This patch to add a sysfs entry for the UUID was apparently not accepted: https://www.spinics.net/lists/raid/msg40667.html Add these metrics as a textfile script rather than adding them to the Go 'md' module as they're perhaps less commonly useful. If lots of people find them useful, we can later rewrite this in Go. Signed-off-by: Matt Bostock <mbostock@cloudflare.com>
* vendor: bump github.com/mdlayher/wifi and dependencies (#1045)Matt Layher2018-08-14
| | | Signed-off-by: Matt Layher <mdlayher@gmail.com>
* NRestarts or NRefused aren't available on older systemd versions (#1039)James Hartig2018-08-14
| | | | | | * If NRestarts or NRefused are not available, don't ignore the unit itself * Don't report systemd metrics (NRestarts/NRefused) that are not available Signed-off-by: James Hartig <james@getadmiral.com>
* Handle vanishing PIDs (#1043)Ben Kochie2018-08-13
| | | | | | | | | | PIDs can vanish (exit) from /proc/ between gathering the list of PIDs and getting all of their stats. * Ignore file not found errors. * Explicitly count the PIDs we find. * Cleanup some error style issues. Signed-off-by: Ben Kochie <superq@gmail.com>
* Update build (#1041)Ben Kochie2018-08-13
| | | | | | | | | | Update build * Update to Go 1.10. * Enable `ppc64le` build. * Enable MIPS builds. Signed-off-by: Ben Kochie <superq@gmail.com>
* Disable wifi collector by default (#1037)Ben Kochie2018-08-07
| | | | | | | | | | * Disable wifi collector by default Disable the wifi collector by default due to suspected cashing issues and goroutine leaks. * https://github.com/prometheus/node_exporter/issues/870 * https://github.com/prometheus/node_exporter/issues/1008 Signed-off-by: Ben Kochie <superq@gmail.com>
* Fix supervisord collector (#978)Ben Kochie2018-08-06
| | | | | | | | | | | | | * Replace supervisord xmlrpc library * Use `github.com/mattn/go-xmlrpc` that doesn't leak goroutines. * Fix uptime metric * Use Prometheus best practices for uptime metric. * Use "start time" rather than "uptime". * Don't emit a start time if the process is down. * Add changelog entry. * Add example compatibility rules. Signed-off-by: Ben Kochie <superq@gmail.com>
* systemd: Remove unneeded/unhandled error returns (#1035)Julius Volz2018-08-05
| | | Signed-off-by: Julius Volz <julius.volz@gmail.com>
* build: make staticcheck happy by using real regexp patterns #1025 (#1026)Christian Hoffmann2018-07-30
| | | Signed-off-by: Christian Hoffmann <mail@hoffmann-christian.info>
* Fix sample rules for migration (#1022)Rene Treffer2018-07-27
| | | | | | | - add conversion from _ms to _seconds on disk metrics - add missing node_textfile_mtime section - add groups: header to pass promtool check rules Signed-off-by: Rene Treffer <rene.treffer@soundcloud.com>
* Enable nfs protocol (#998)Hannes Körber2018-07-24
| | | | | | | | | | | | | | | | | | | * vendor: Update prometheus/procfs Signed-off-by: Hannes Körber <hannes.koerber@haktec.de> * mountstats: Use new NFS protocol field In https://github.com/prometheus/procfs/pull/100, the NFSTransportStats struct was expanded by a field called protocol that specifies the NFS protocol in use, either "tcp" or "udp". This commit adds the protocol as a label to all NFS metrics exported via the mountstats collector. Signed-off-by: Hannes Körber <hannes.koerber@haktec.de> * Update fixtures for UDP mount Signed-off-by: Hannes Körber <hannes.koerber@haktec.de>
* Exclude only subdirectories of /var/lib/docker (#1003)Johannes Wienke2018-07-23
| | | | | | It is quite common to put /var/lib/docker itself on a separate partition and that should be monitored as well. Signed-off-by: Johannes Wienke <languitar@semipol.de>
* Fix docker build (#1016)Ben Kochie2018-07-23
| | | | | | Fix override of make docker target to include new `DOCKER_REPO` variable pattern. Signed-off-by: Ben Kochie <superq@gmail.com>
* Update build (#1010)Ben Kochie2018-07-23
| | | | | | | * Update from upstream `Makefile.common`. * Update CircleCI with simplifed upstream templating. * Cleanup `Makefile`. Signed-off-by: Ben Kochie <superq@gmail.com>
* Fix ntp collector thread safety (#1014)Ben Kochie2018-07-22
| | | | | | Make the ntp collector thread safe by wrapping a mutex lock around the leapMidnight variable. Signed-off-by: Ben Kochie <superq@gmail.com>
* Filter out uninstalled systemd units when collecting all units (#1011)xginn82018-07-22
| | | | | fixes #567 Signed-off-by: Matthew McGinn <mamcgi@gmail.com>
* Add systemd uptime metric collection (#952)Sven Lange2018-07-18
| | | | | * Add systemd uptime metric collection Signed-off-by: Sven Lange <tdl@hadiko.de>
* Add note about SYS_TIME capability for Docker. (#1001)Ben Kochie2018-07-16
| | | Signed-off-by: Ben Kochie <superq@gmail.com>