Hi,
My organization recently transitioned to an Asterisk/Queue Metrics solution for phones and call center services. The systems have been in place for about two months now. We've had repeated issues with our Asterisk servers, but our Queue Metrics server have had zero problems until today.
I got a call this morning that Agents couldn't log in, pause unpause, add or remove member etc. When I attempted to browse to the webui, I found it non-responsive. The server doesn't provide an error, it simply doesn't respond at all. This happened again about twenty minutes ago. The services are restored with a "service queuemetrics restart." I'm attempting to review the logs in /usr/local/queuemetrics/tomcat/logs but it's challenging to sift through all of extra information tomcat is spitting into the logs. I'm not thoroughly trained on Queue Metrics, having hired an integrator to install it and not having time to get up to speed as of yet. I'm hoping somebody on the forum might be able to point me in the right direct. Here are some details:
This is an HA cluster using heartbeat for resource management, and DRBD for distributed storage.
QM: 1.7.1
OS Platform: PBX In a Flash, Purple, with Queue Metrics installed over top
Heartbeat: 2.1.4
DRBD: 8.3.8
Can anybody recommend where I might start to determine why it's all of a sudden freezing? No changes have been made to the queue metrics boxes recently with the exception of additional agents added etc. Thanks in advance for any suggestions and help!
Mike.