Major system instability problem with fedora 7/xen
Posted: Thu Aug 23, 2007 9:04 pm
I've been moving zoneminder over to a new machine, and in the process I think I've discovered a potential problem that can cause significant system instability, and not just with zoneminder.
I was beginning to think the new machine had a serious h/w problem, as I was experiencing a wide range of crashes, errors and just plain odd behaviour. Firefox and Thunderbird would only run for at best a couple of minutes before crashing out with a Floating Point Exception; Gnome desktop applets would vanish frequently for no apparent reason; and various system services would stop running abruptly several times a day leaving little or no clues as to how or why.
Mostly I noticed this with zoneminder, slimserver, and a custom process I run that collects data from 1-wire sensors. Some of the zm problems were caused by the mysqld process crashing on a "signal 8", which I've subsequently discovered is another Floating Point Exception.
However, I think I've finally managed to track down the cause. Some web searching I did recently pointed me in the direction of glibc, and then I noticed that the old machine, which has always been relatively stable, was running glibc-2.6-3, where as the new machine had been yum updated to 2.6-4 back in July.
First I tried upgrading again to 2.6-90 from the fedora development repo, but this had no discernible effect. I then tried to downgrade to 2.6-3, inadvertently hosing the machine by pulling the glibc-2.6-90 rug from beneath its feet in a most undignified fashion. Whoops. Composure re-gathered I semi-gracefully recovered the situation with the help of the install cd, and managed to get it up and running again with glibc-2.6-3 in place.
The result? It's been up around 3 hours now, and so far none of the previous problems have resurfaced.
In fact the only thing that's appeared in zmdc.log in that time is an abnormal exit from zmc for a known-flakey remote ip webcam monitor which is connected via both a wi-fi bridge and a Homeplug mains-over-ethernet link.
I'm not sure how I can best report this back to the glibc folks, but if any of you are running fedora-7 or one of its close relations, I'd avoid glibc-2.6-4 if I were you.
Hope that's useful to someone...
Jim[/i]
I was beginning to think the new machine had a serious h/w problem, as I was experiencing a wide range of crashes, errors and just plain odd behaviour. Firefox and Thunderbird would only run for at best a couple of minutes before crashing out with a Floating Point Exception; Gnome desktop applets would vanish frequently for no apparent reason; and various system services would stop running abruptly several times a day leaving little or no clues as to how or why.
Mostly I noticed this with zoneminder, slimserver, and a custom process I run that collects data from 1-wire sensors. Some of the zm problems were caused by the mysqld process crashing on a "signal 8", which I've subsequently discovered is another Floating Point Exception.
However, I think I've finally managed to track down the cause. Some web searching I did recently pointed me in the direction of glibc, and then I noticed that the old machine, which has always been relatively stable, was running glibc-2.6-3, where as the new machine had been yum updated to 2.6-4 back in July.
First I tried upgrading again to 2.6-90 from the fedora development repo, but this had no discernible effect. I then tried to downgrade to 2.6-3, inadvertently hosing the machine by pulling the glibc-2.6-90 rug from beneath its feet in a most undignified fashion. Whoops. Composure re-gathered I semi-gracefully recovered the situation with the help of the install cd, and managed to get it up and running again with glibc-2.6-3 in place.
The result? It's been up around 3 hours now, and so far none of the previous problems have resurfaced.
In fact the only thing that's appeared in zmdc.log in that time is an abnormal exit from zmc for a known-flakey remote ip webcam monitor which is connected via both a wi-fi bridge and a Homeplug mains-over-ethernet link.
I'm not sure how I can best report this back to the glibc folks, but if any of you are running fedora-7 or one of its close relations, I'd avoid glibc-2.6-4 if I were you.
Hope that's useful to someone...
Jim[/i]