aboutsummaryrefslogtreecommitdiff
path: root/README.md
Commit message (Collapse)AuthorAge
* Update the link to prometheus-dcgm胡玮文2020-07-14
| | | | | | The original link is broken (404). Signed-off-by: 胡玮文 <huww98@outlook.com>
* Update windows_exporter referenceCalle Pettersson2020-06-01
| | | | Signed-off-by: Calle Pettersson <calle@cape.nu>
* Add perf tracepoint collection flag (#1664)Daniel Hodges2020-04-17
| | | | | * Add tracepoint collector option for perf collector Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>
* minor README doc fix re. collector.perf.cpusMichael Vorburger ⛑️2020-04-17
| | | | Signed-off-by: Michael Vorburger <mike@vorburger.ch>
* Add gauges for allocated memory for queued UDP and TCP packages (#1503)Peter Bueschel2020-03-31
| | | | | | | | | | | | | | | | * Two new states will be added to the tcpstat collector called rx_queued_bytes and tx_queued_bytes. For UDP datagrams an additional collector 'udp_queues' can be used to expose the total lengths of the tx_queue and rx_queue. @SuperQ and @discordianfish this changes gives us the option to check for overloaded UDP + TCP processing. The names of the new TCP states and the UDP metric can be discussed. The current reasons are just: I don't want to add another collector for the same exposed file, so I just added the new states to the tcpstat collector. I chose the name 'udp_queue' instead of 'udpstat' as UDP has no state. Signed-off-by: Peter Bueschel <peter.bueschel@logmein.com>
* Release 1.0.0-rc.0 (#1614)v1.0.0-rc.0Ben Kochie2020-02-20
| | | | | | | Update CHANGELOG/VERSION for 1.0.0-rc.0 release. * Add a note about new https settings to top-level README. * Mark --web.config flag as experimental. Signed-off-by: Ben Kochie <superq@gmail.com>
* Fix num cpu (#1561)Daniel Hodges2020-02-20
| | | | | | | | | | | | | | | | | | | | * add a map of profilers to CPUids `runtime.NumCPU()` returns the number of CPUs that the process can run on. This number does not necessarily correlate to CPU ids if the affinity mask of the process is set. This change maintains the current behavior as default, but also allows the user to specify a range of CPUids to use instead. The CPU id is stored as the value of a map keyed on the profiler object's address. Signed-off-by: Joe Damato <jdamato@fastly.com> Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com> Signed-off-by: Daniel Hodges <hodges@uber.com> Co-authored-by: jdamato-fsly <55214354+jdamato-fsly@users.noreply.github.com>
* rapl_linux collectorUkri Niemimuukko2020-02-01
| | | | | | | This exposes RAPL statistics from /sys/class/powercap. Co-Authored-By: Ben Kochie <superq@gmail.com> Signed-off-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>
* Add softnet collector (#1576)Peter Nicholson2019-12-30
| | | Signed-off-by: Peter Nicholson <petergoods@hotmail.com>
* Scrape cooling_device stateAlex Schmitz2019-08-30
| | | | Signed-off-by: Alex Schmitz <alex.schmitz@gmail.com>
* Update rootfs syntax in Docker example (#1443)Sandro Jäckel2019-08-07
| | | Signed-off-by: Sandro Jäckel <sandro.jaeckel@gmail.com>
* Extends uname collector to export on Darwin OS (#1433)Philip Gough2019-08-03
| | | | | Adds uname collector support for Darwin and OpenBSD Signed-off-by: Philip Gough <philip.p.gough@gmail.com>
* Scrape CPU latency stats from /proc/schedstat (#1389)Phil Frost2019-07-10
| | | | | | | | | | | | | These are useful as a direct indication of CPU contention and task scheduler latency. Handy references: - https://github.com/torvalds/linux/blob/master/Documentation/scheduler/sched-stats.txt - https://doc.opensuse.org/documentation/leap/tuning/html/book.sle.tuning/cha.tuning.taskscheduler.html procfs is updated to pull in the enabling change: https://github.com/prometheus/procfs/pull/186 Signed-off-by: Phil Frost <phil@postmates.com>
* Add perf exporter (#1274)Daniel Hodges2019-05-07
| | | Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>
* README: Move pressure to enabled table (#1325)Daniele Sluijters2019-04-18
| | | | | Follow-up from #1261. Signed-off-by: Daniele Sluijters <daenney@users.noreply.github.com>
* Expose /proc/pressure (#1261)Daniele Sluijters2019-04-18
| | | | | | | | | This enables the collection of pressure stall information as exposed by the `/proc/pressure` interface added in the 4.20 release of the Linux kernel. Closes #1174 Signed-off-by: Daniele Sluijters <daenney@users.noreply.github.com>
* Update README: Add note about ts in textfileJohannes 'fish' Ziemke2019-03-19
| | | This closes #1284
* collector/cpu: split cpu freq metrics into separate collector (#1253)Paul Gier2019-02-19
| | | | | | | | | | | The cpu frequency information is not always needed and/or available. This change allows the cpu frequency metrics to be enabled/disabled separately from the other cpu metrics, and also prevents a frequency metric failure (such as a parse error) from failing the main cpu collector. Fixes #1241 Signed-off-by: Paul Gier <pgier@redhat.com>
* Add diskstats collector for OpenBSD (#1250)Ralf Horstmann2019-02-06
| | | | | | | | | | | | | * Add diskstats collector for OpenBSD Tested on i386 and amd64, OpenBSD 6.4 and -current. * Refactor diskstats collectors This moves common descriptors from Linux, Darwin, OpenBSD diskstats collectors into diskstats_common.go Signed-off-by: Ralf Horstmann <ralf+github@ackstorm.de>
* collector: Implement uname collector for FreeBSD (#1239)David O'Rourke2019-02-05
| | | | | * collector: Implement uname collector for FreeBSD Signed-off-by: David O'Rourke <david.orourke@gmail.com>
* Add kstat-based Solaris metrics (#1197)Jon Davies2019-01-12
| | | | | | | | | | | | * collector/loadavg_solaris.go: Use libkstat to gather load averages. * go.mod: Added go-kstat. * boot_time_solaris.go: Added. * cpu_solaris.go: Added. * README.md: Updated entries for Solaris. * collector/zfs_solaris.go: Added. * CHANGELOG.md: Added note about kstat-based Solaris metrics. Signed-off-by: Jonathan Davies <jpds@protonmail.com>
* Add link to prometheus-dcgm (#1164)Johannes 'fish' Ziemke2018-11-19
| | | Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>
* Update readme (#1107)Bryan Boreham2018-10-11
| | | | | | | | | | * State that wifi collector is disabled by default Signed-off-by: Bryan Boreham <bjboreham@gmail.com> * Add the 'processes' collector to the Readme Signed-off-by: Bryan Boreham <bjboreham@gmail.com>
* [continue 912] strip rootfs prefix for run in docker (#1058)Yecheng Fu2018-10-04
| | | | | | | | | | | | * strip rootfs prefix for run in docker * Use `/` as default value of path.rootfs, and parse mounts from `/proc/1/mounts`. * No need to mount `/proc` and `/sys` because we share host's PID namespace, which allows processes within the container to see all of the processes on the system. Closes: #66 Signed-off-by: Ivan Mikheykin <ivan.mikheykin@flant.com> Signed-off-by: Yecheng Fu <cofyc.jackson@gmail.com>
* Add missing words and update markdown syntax (#1095)Andrew Banchich2018-10-03
| | | Signed-off-by: Andrew Banchich <andrewbanchich@gmail.com>
* Add note about SYS_TIME capability for Docker. (#1001)Ben Kochie2018-07-16
| | | Signed-off-by: Ben Kochie <superq@gmail.com>
* Add sys/class/net parsing from procfs and expose its metrics (#851)Jan Klat2018-07-16
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * add sys/class/net parsing from procfs and expose its metrics Signed-off-by: Jan Klat <jenik@klatys.cz> * change code to use int pointers per procfs change, move netclass to separate collector, change metric naming Signed-off-by: Jan Klat <jenik@klatys.cz> * bump year in licence, remove redundant newline, correct fixtures Signed-off-by: Jan Klat <jenik@klatys.cz> * fix style Signed-off-by: Jan Klat <jenik@klatys.cz> * change carrier changes to counter type Signed-off-by: Jan Klat <jenik@klatys.cz> * fix e2e output Signed-off-by: Jan Klat <jenik@klatys.cz> * add fixtures Signed-off-by: Jan Klat <jenik@klatys.cz> * update vendor, use fixtures correctly Signed-off-by: Jan Klat <jenik@klatys.cz> * change fixtures (device in /sys/class/net should be symlinked) Signed-off-by: Jan Klat <jenik@klatys.cz> * correct fixtures for 64k page, updated readme Signed-off-by: Jan Klat <jenik@klatys.cz>
* Remove travis build badgeJohannes 'fish' Ziemke2018-05-30
| | | | Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>
* Merge pull request #852 from prometheus/remove-gmondBen Kochie2018-04-27
|\ | | | | Remove gmond collector
| * Remove gmond collectorJohannes 'fish' Ziemke2018-04-17
| | | | | | | | Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>
* | update README.md and CHANGELOG.md to reflect new bsd boottime collectormischief2018-04-15
| | | | | | | | Signed-off-by: mischief <mischief@offblast.org>
* | Fix spelling mistakes found by codespellKarsten Weiss2018-04-09
| | | | | | | | Signed-off-by: Karsten Weiss <knweiss@gmail.com>
* | Enable bonding collector by default. (#872)Brian Brazil2018-03-29
|/ | | Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>
* Enable NFS client metrics by default. (#828)Ben Kochie2018-02-16
| | | | | | Enable NFS client metrics by default now that it nolonger prints errors on scrape if there are no metrics to display. Also fixup the nfsd README to match the nfs entry.
* Update README with nfsd collector. (#826)Ben Kochie2018-02-16
| | | Add missing documentation about the `nfsd` collector.
* Remove obsolete megacli collector. (#798)Ben Kochie2018-01-23
| | | | This collector has been replaced by the textfile collector tool `storcli.py`.
* Update collect[] param documentation (#784)Ben Kochie2018-01-10
| | | | | | Improve recommendations and wording around advanced use of the collect[] param. Remove example that causes users to copy-and-paste it.
* Update READMEBen Kochie2018-01-04
| | | | | | Add OpenBSD to supported list for meminfo collector[0]. [0]: https://github.com/prometheus/node_exporter/pull/724
* Use host PID namespace in docker example (#672)david2017-11-02
| | | | | | | | | | * Use host PID namespace in docker example See https://github.com/prometheus/node_exporter/issues/671 * Update readme for readability * Fix comments in readme
* Add `collect[]` parameter (#699)Siavash Safi2017-10-14
| | | | | | | | | | | | | | | | | | * Add `collect[]` parameter * Add TODo comment about staticcheck ignored * Restore promhttp.HandlerOpts * Log a warning and return HTTP error instead of failing * Check collector existence and status, cleanups * Fix warnings and error messages * Don't panic, return error if collector registration failed * Update README
* Add extra notes to Building section (#694)Ben Kochie2017-10-11
| | | | * Add link to Golang * Add note about RHEL/CentOS build dep.
* add rslave for docker example, so node_exporter can receive host ↵Wei Wei2017-10-11
| | | | mount/unmount events (#660)
* Replace --collectors.enabled with per-collector flags (#640)Calle Pettersson2017-09-28
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Move NodeCollector into package collector * Refactor collector enabling * Update README with new collector enabled flags * Fix out-of-date inline flag reference syntax * Use new flags in end-to-end tests * Add flag to disable all default collectors * Track if a flag has been set explicitly * Add --collectors.disable-defaults to README * Revert disable-defaults flag * Shorten flags * Fixup timex collector registration * Fix end-to-end tests * Change procfs and sysfs path flags * Fix review comments
* Add timex collector (#664)Sami Kerola2017-09-19
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This collector is based on adjtimex(2) system call. The collector returns three values, status if time is synchronised, offset to remote reference, and local clock frequency adjustment. Values are taken from kernel time keeping data structures to avoid getting involved how the synchronisation is implemented. By that I mean one should not care if time is update using ntpd, systemd.timesyncd, ptpd, and so on. Since all time sync implementation will always end up telling to kernel what is the status with time one can simply omit the software in between, and look results of the syncing. As a positive side effect this makes collector very quick and conceptually specific, this does not monitor availability of NTP server, or network in between, or dns resolution, and other unrelated but necessary things. Minimum set of values to keep eye on are the following three: The node_timex_sync_status tells if local clock is in sync with a remote clock. Value is set to zero when synchronisation to a reliable server is lost, or a time sync software is misconfigured. The node_timex_offset_seconds tells how much local clock is off when compared to reference. In case of multiple time references this value is outcome of RFC 5905 adjustment algorithm. Ideally offset should be close to zero, and it depends about use case how large value is acceptable. For example a typical web server is probably fine if offset is about 0.1 or less, but that would not be good enough for mobile phone base station operator. The node_timex_freq tells amount of adjustment to local clock tick frequency. For example if offset is one second and growing the local clock will need instruction to tick quicker. Number value itself is not very important, and occasional small adjustments are fine. When frequency is unusually in stable one can assume quality of time stamps will not be accurate to very far in sub second range. Obviously explaining why local clock frequency behaves like a passenger in roller coaster is different matter. Explanations can vary from system load, to environmental issues such as a machine being physically too hot. Rest of the measurements can help when debugging. If you run a clock server do probably want to collect and keep track of everything. Pull-request: https://github.com/prometheus/node_exporter/pull/664
* Add metrics from SNTPv4 packet to ntp collector & add ntpd sanity check (#655)Leonid Evdokimov2017-09-19
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * Add metrics from SNTPv4 packet to ntp collector & add ntpd sanity check 1. Checking local clock against remote NTP daemon is bad idea, local ntpd acting as a client should do it better and avoid excessive load on remote NTP server so the collector is refactored to query local NTP server. 2. Checking local clock against remote one does not check local ntpd itself. Local ntpd may be down or out of sync due to network issues, but clock will be OK. 3. Checking NTP server using sanity of it's response is tricky and depends on ntpd implementation, that's why common `node_ntp_sanity` variable is exported. * `govendor add golang.org/x/net/ipv4`, it is dependency of github.com/beevik/ntp * Update github.com/beevik/ntp to include boring SNTP fix * Use variable name from RFC5905 * ntp: move code to make export of raw metrics more explicit * Move NTP math to `github.com/beevik/ntp` * Make `golint` happy * Add some brief docs explaining `ntp` #655 and `timex` #664 modules * ntp: drop XXX comment that got its decision * ntp: add `_seconds` suffix to relevant metrics * Better `node_ntp_leap` comment * s/node_ntp_reftime/node_ntp_reference_timestamp_seconds/ as requested by @discordianfish * Extract subsystem name to const as suggested by @SuperQ
* Add buildkite status badge.Ben Kochie2017-08-24
|
* Clarify Infiniband collector support (#643)Joe Handzik2017-08-16
| | | | | Tested a DL360 Gen9 box with an Omni-Path adapter in it. The existing InfiniBand collector can provide support for the same metrics on Omni-Path cards as well. Signed-Off-By: Joe Handzik <joseph.t.handzik@hpe.com>
* Switch to kingpin flags (#639)Calle Pettersson2017-08-12
| | | | | | | | | | * Switch to kingpin flags * Fix logrus vendoring * Fix flags in main tests * Fix vendoring versions
* Status information in /proc/drbd (#630)Vojtech Galda2017-08-02
| | | in version 8.4 deprecated (but won’t be removed)
* Fix Docker mountpoint prefix docsTeoh Han Hui2017-07-28
|