I am posting to this forum as a last resort for me. I think I googled entire internet in search for a cause of issues I have with Zoneminder. I will try to describe my issues as much and as accurately as possible, if more information is needed, please let me know I will post it.
The problem:
When running Zoneminder every few hours (on average, may be more, may be less) I get either kernel panic, or some other total crash, which is not even logged by system logs. This is only happening when running Zoneminder.
My setup:
Mainboard: MSI H81M-P33
CPU: Celeron G1820T
RAM: 2x2GB
Drive: WD Red Series
System: Arch LInux (Antergos)
Zoneminder: AUR version, most recent (1.29.0-2)
Camera: running one HD camera (ubiquiti aircam), I am using "modect" mode registering 1280x720@20fps (this probably does not matter though)
How installed:
I followed Arch Linux guide on installing Zoneminder from AUR.
More detailed view on the problem:
The issue is difficult for me to track down, because I usually do not get logs/trace/dumps when system hangs. A lot seems to be pointing to MySQL issues, but I have not been able to find anything on that or to reproduce it. MySQL has shown segmentation faults a few times, the same was with zmdc.pl having segfaults, I had issues like CPU being stuck for 20+ seconds, low memory corruption, different varieties of things happening. When reading you might be thinking this is a hardware issue, but I do not think so, please continue reading.
What I have done to try to find/fix the problem:
Apart of googling entire internet I did following checks:
- been running Memtest for about 10 passes - no errors,
- ran mprime for 24+ hours (more like 30 probably), with 100% CPU load and a lot of memory used - no errors,
- changed memory mapping to get rid of "Low memory corruption" issues I had - no difference, still crashing (according to this forum post I found: https://bbs.archlinux.org/viewtopic.php?id=189483), my Arch updated its memory map and it does not use low memory (I even reserved entire 640k to be sure),
- updated CPU microcode using this guide: https://wiki.archlinux.org/index.php/Microcode and regenerated Grub config, again - no change
- changed some settings concerning buffering and memory for MySQL - no change,
- checked disk drive with fsck for filesystem and surface problems - no change (there were some filesystem errors, due to system hangs, but were fixed),
- checked MySQL DB for errors - none found
- changed Zoneminder monitor settings (buffers, sources etc.), no change
- changed "nph-zms" to "zms" just for the sake of checking I did not expect this would change anything, and it did not (when works, it works fine on both though)
I think this is most that I did. The machine I am running Zoneminder on can run without it with uptime of weeks without a problem, so I expect this to be a Zoneminder-related problem, I may be wrong though.
Some observations/comments:
- this is not a free disk space issue, accoring to Zoneminder I have 4% disk used,
- this is not insufficient memory issue either, /dev/shm has 1.9G and Zoneminder uses up to 700MB per what I have seen,
- I have noticed in Zoneminder log, that there is a lot (and I mean A LOT) of logs like this:
Code: Select all
08/16/16 20:53:57.336369 zmf[936].DB1-zmf.cpp/244 [Select timed out]
Logs:
I am posting log from /var/log/zoneminder. Their structure is very strange for me (many different files with random numbers, as they seem, at the end), but I found the most recent and largest one, with all those "Select timed out" errors/warnings. I have cut them, because the log has > 2MB and pastebin does not allow more than 512kb. There is nothing else apart of this message in the part I removed.
http://pastebin.com/UJjiVqRK
I have kept the last log entry though, it is cut-off in the log as you see it pasted. It seems that machine hung in the middle of it being written.
Please let me know what other logs would be helpful if any.
Any help is appreciated, the application is very good, I just would like to have it running...
Regards,
XaFFaX