Monitoring partition percentages

Replication Server operates on 1MB partition segments. Whenever it allocates or deallocates a partition segment, it calculates these statistics:

If the percentage of partition segments in use rises above the percentage specified by sqm_warning_thr1 or sqm_warning_thr2, a message like the following is written to the log file:

WARNING: Stable Storage Use is Above threshold percent

If you see this message often, you may need to add partitions to the Replication Server or correct a recurring failure that causes the queues to fill.

When the first percentage drops below the percentage specified by sqm_warning_thr1 or sqm_warning_thr2, a message like the following is written to the log file to note that the condition that caused the original warning no longer exists:

WARNING CANCEL: Stable Storage Use is Below threshold percent

The percentage of total partition segments in use by the affected stable queue triggers the following warning message when the percentage of the total space used by a single stable queue exceeds the percentage specified by sqm_warning_thr_ind:

WARNING: Stable Storage Use by queue name is Above threshold percent

This warning alerts you to problems that cause a particular stable queue to fill until it is using a disproportionate share of the total partition space. For example, if a route is suspended for a length of time, its stable queue may fill until it occupies enough partition space to trigger a warning.

When the percentage of the total partition space used by a stable queue drops below the sqm_warning_thr_ind percentage, Replication Server writes a cancel message like the following to the log file:

WARNING CANCEL: Stable Storage Use by queue name is Below threshold percent.