Previous page

Next page

Locate page in Contents

Print this page

vzrmond Configuration File

This file ( /etc/vzrmond.conf ) is the configuration file for the vzrmond daemon which is running on the Monitor Server and provides the remote monitoring of servers registered in it and the sending of alerts to the specified e-mail addresses. It also allows you to use external applications for sending alerts (e.g. via ICQ or SMS). The file lists a number of parameters some of which have values that should be provided by the user (from HOSTS through CUSTOM_LIST ). These values are included in double quotes and separated by spaces from each other. The remaining parameters have default values that may be altered by the user. They are not included in quotes.

Parameter

Description

Default value

HOSTS

The list of hosts to be monitored delimited by spaces. Both hostnames and IP addresses are allowed.

“”

EMAIL_ADDRESSES

E-mail addresses to receive the alerts. Must be separated by spaces.

“”

EMAIL_NOTIFICATIONS

The types of notifications to be sent to the specified e-mail address(es).

SYSTEM_UP SYSTEM_DOWN

DISK_OK

DISK_BAD

INODES_NORM INODES_HIGH

HDDBUSY_NORM HDDBUSY_HIGH

SSH_UP

SSH_DOWN

VZSTAT_OK

VZSTAT_BAD

LOADAVG_NORM LOADAVG_HIGH

UNINT_NORM UNINT_HIGH

MEMLATM_NORM MEMLATM_HIGH

MEMLATA_NORM MEMLATA_HIGH

CPULATM_NORM CPULATM_HIGH

CPULATA_NORM CPULATA_HIGH

SWAPIN_NORM SWAPIN_HIGH

SWAPOUT_NORM SWAPOUT_HIGH

CUSTOM_ACTION

The program to send alerts of a customized type (e.g. via ICQ or SMS).

“”

CUSTOM_LIST

Options passed as the command-line parameters of the program specified by CUSTOM_ACTION . Must be separated by spaces.

“”

POLL_PERIOD

Periodicity of checking up the registered servers, in seconds.

15

CHK_MAX_FAILS

After this number of unsuccessful attempts to reach a server, the “Server is dead” alert is sent.

4

LOAD_AVG

The average number of processes on the server. When this value is exceeded, an alert is sent.

30

PROC_UNINT

The number of uninterruptable sleeping processes (in the “D” state). When this value is exceeded, an alert is sent.

20

CPU_LAT_MAX_ERR

The maximal process scheduling latency, in milliseconds. When this value is exceeded, an alert is sent.

1000

CPU_LAT_AVG_ERR

The average process scheduling latency, in milliseconds. When this value is exceeded, an alert is sent.

750

MEM_LAT_MAX_ERR

The maximal memory allocation latency, in milliseconds. When this value is exceeded, an alert is sent.

500

MEM_LAT_AVG_ERR

The average memory allocation latency, in milliseconds. When this value is exceeded, an alert is sent.

400

SWAP_IN_ERR

The swap in activity, in Mb/s. When this value is exceeded, an alert is sent.

1.0

SWAP_OUT_ERR

The swap out activity, in Mb/s. When this value is exceeded, an alert is sent.

1.0

DISK_FREE_INODES_ERR

The percentage of free disk inodes. When the actual value becomes less than this value, an alert is sent.

5

DISK_FREE_SPACE_ERR

The percentage of free disk space. When the actual value becomes less than this value, an alert is sent.

5

To be able to begin monitoring a server, you should provide the valid values for the HOSTS and EMAIL parameters. If you wish to use an external program for sending alerts about the server state, you should install in on the Monitor Server and provide its name and options in the CUSTOM_ACTION and CUSTOM_LIST parameters. The alert message text will be sent as the standard input for the specified program.

You should increase the value of the POLL_PERIOD parameter together with the increase in the number of monitored servers not to create an overload on the Monitor Server. The parameters related to the scheduling latency, memory allocation latency, and swap in/out activity serve to have an alert generated if the system’s performance plummets due to the abnormal values of these parameters.

Do not forget to restart the vzrmond daemon after you have edited this configuration file.