vzrmond Configuration File
This file (
/etc/vzrmond.conf
) is the configuration file for the
vzrmond
daemon which is running on the Monitor Server and provides the remote monitoring of servers registered in it and the sending of alerts to the specified e-mail addresses. It also allows you to use external applications for sending alerts (e.g. via ICQ or SMS). The file lists a number of parameters some of which have values that should be provided by the user (from
HOSTS
through
CUSTOM_LIST
). These values are included in double quotes and separated by spaces from each other. The remaining parameters have default values that may be altered by the user. They are not included in quotes.
Parameter
|
Description
|
Default value
|
HOSTS
|
The list of hosts to be monitored delimited by spaces. Both hostnames and IP addresses are allowed.
|
“”
|
EMAIL_ADDRESSES
|
E-mail addresses to receive the alerts. Must be separated by spaces.
|
“”
|
EMAIL_NOTIFICATIONS
|
The types of notifications to be sent to the specified e-mail address(es).
|
SYSTEM_UP SYSTEM_DOWN
DISK_OK
DISK_BAD
INODES_NORM INODES_HIGH
HDDBUSY_NORM HDDBUSY_HIGH
SSH_UP
SSH_DOWN
VZSTAT_OK
VZSTAT_BAD
LOADAVG_NORM LOADAVG_HIGH
UNINT_NORM UNINT_HIGH
MEMLATM_NORM MEMLATM_HIGH
MEMLATA_NORM MEMLATA_HIGH
CPULATM_NORM CPULATM_HIGH
CPULATA_NORM CPULATA_HIGH
SWAPIN_NORM SWAPIN_HIGH
SWAPOUT_NORM SWAPOUT_HIGH
|
CUSTOM_ACTION
|
The program to send alerts of a customized type (e.g. via ICQ or SMS).
|
“”
|
CUSTOM_LIST
|
Options passed as the command-line parameters of the program specified by
CUSTOM_ACTION
. Must be separated by spaces.
|
“”
|
POLL_PERIOD
|
Periodicity of checking up the registered servers, in seconds.
|
15
|
CHK_MAX_FAILS
|
After this number of unsuccessful attempts to reach a server, the “Server is dead” alert is sent.
|
4
|
LOAD_AVG
|
The average number of processes on the server. When this value is exceeded, an alert is sent.
|
30
|
PROC_UNINT
|
The number of uninterruptable sleeping processes (in the “D” state). When this value is exceeded, an alert is sent.
|
20
|
CPU_LAT_MAX_ERR
|
The maximal process scheduling latency, in milliseconds. When this value is exceeded, an alert is sent.
|
1000
|
CPU_LAT_AVG_ERR
|
The average process scheduling latency, in milliseconds. When this value is exceeded, an alert is sent.
|
750
|
MEM_LAT_MAX_ERR
|
The maximal memory allocation latency, in milliseconds. When this value is exceeded, an alert is sent.
|
500
|
MEM_LAT_AVG_ERR
|
The average memory allocation latency, in milliseconds. When this value is exceeded, an alert is sent.
|
400
|
SWAP_IN_ERR
|
The swap in activity, in Mb/s. When this value is exceeded, an alert is sent.
|
1.0
|
SWAP_OUT_ERR
|
The swap out activity, in Mb/s. When this value is exceeded, an alert is sent.
|
1.0
|
DISK_FREE_INODES_ERR
|
The percentage of free disk inodes. When the actual value becomes less than this value, an alert is sent.
|
5
|
DISK_FREE_SPACE_ERR
|
The percentage of free disk space. When the actual value becomes less than this value, an alert is sent.
|
5
|
To be able to begin monitoring a server, you should provide the valid values for the
HOSTS
and
EMAIL
parameters. If you wish to use an external program for sending alerts about the server state, you should install in on the Monitor Server and provide its name and options in the
CUSTOM_ACTION
and
CUSTOM_LIST
parameters. The alert message text will be sent as the standard input for the specified program.
You should increase the value of the
POLL_PERIOD
parameter together with the increase in the number of monitored servers not to create an overload on the Monitor Server. The parameters related to the scheduling latency, memory allocation latency, and swap in/out activity serve to have an alert generated if the system’s performance plummets due to the abnormal values of these parameters.
Do not forget to restart the
vzrmond
daemon after you have edited this configuration file.
|