* [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard)
@ 2009-03-23 15:38 Steve
2009-03-23 19:27 ` Steve
0 siblings, 1 reply; 9+ messages in thread
From: Steve @ 2009-03-23 15:38 UTC (permalink / raw
To: gentoo-user
Has anyone any ideas? The syslog-ng is the usually the first line
reported by top:
4097 root 20 0 3120 1060 708 R 48.3 0.1 677:46.38 syslog-ng
The files in /var/log seem to be growing at an expected slow pace and
aren't reporting anything unexpected. I followed a 'howto' and have
sshguard running. This (comments stripped) is what I have in
/etc/syslog-ng/syslog-ng.conf
> options {
> chain_hostnames(off);
> sync(0);
> stats(43200);
> };
>
> source src {
> unix-stream("/dev/log" max-connections(256));
> internal();
> file("/proc/kmsg");
> };
>
> destination messages { file("/var/log/messages"); };
> destination console_all { file("/dev/tty12"); };
> log { source(src); destination(messages); };
> log { source(src); destination(console_all); };
> destination authlog { file("/var/log/auth.log"); };
> destination authlog { file("/var/log/auth.log"); };
> filter f_authpriv { facility(auth, authpriv); };
> log { source(src); filter(f_authpriv); destination(authlog); };
> filter sshlogs { facility(auth, authpriv) and match("sshd"); };
> destination sshguardproc {
> program("/usr/local/sbin/sshguard"
> template("$DATE $FULLHOST $MESSAGE\n"));
> };
> log { source(src); filter(sshlogs); destination(sshguardproc); };
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) 2009-03-23 15:38 [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) Steve @ 2009-03-23 19:27 ` Steve 2009-03-23 19:31 ` Steve ` (2 more replies) 0 siblings, 3 replies; 9+ messages in thread From: Steve @ 2009-03-23 19:27 UTC (permalink / raw To: gentoo-user Steve wrote: >> destination sshguardproc { >> program("/usr/local/sbin/sshguard" >> template("$DATE $FULLHOST $MESSAGE\n")); >> }; >> The presence of the above line is definitely what triggers the excessive CPU usage - it is almost as-if syslog-ng is 'busy-waiting' for the sshguard process. The sshguard process is running - but using zero CPU. I have this problem with syslog-ng versions 2.1.3 and 2.1.4 (the one with ~x86)... This is very frustrating... having played around, the syslog-ng tends towards using 100% CPU when my server is otherwise quiet - if, and only if, I have the program destination... even if the destination is not used. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) 2009-03-23 19:27 ` Steve @ 2009-03-23 19:31 ` Steve 2009-03-23 19:49 ` Steve 2009-03-23 19:43 ` Alan McKinnon 2009-03-23 19:48 ` Sebastian Günther 2 siblings, 1 reply; 9+ messages in thread From: Steve @ 2009-03-23 19:31 UTC (permalink / raw To: gentoo-user Steve wrote: > This is very frustrating... having played around, the syslog-ng tends > towards using 100% CPU when my server is otherwise quiet - if, and > only if, I have the program destination... even if the destination is > not used. Oh, and strace shows syslog-ng frantically polling file-descriptor 3... which, I presume, is the pipe to the sshguard process. > poll([{fd=6, events=0}, {fd=4, events=POLLIN}, {fd=10, events=POLLIN}, > {fd=3, events=POLLIN}, {fd=13, events=POLLIN}, {fd=14, events=POLLIN}, > {fd=7, events=POLLIN}, {fd=8, events=POLLIN}], 8, 1178000) = 1 > ([{fd=6, revents=POLLERR}]) > gettimeofday({1237836567, 385148}, NULL) = 0 > gettimeofday({1237836567, 385178}, NULL) = 0 > poll([{fd=6, events=0}, {fd=4, events=POLLIN}, {fd=10, events=POLLIN}, > {fd=3, events=POLLIN}, {fd=13, events=POLLIN}, {fd=14, events=POLLIN}, > {fd=7, events=POLLIN}, {fd=8, events=POLLIN}], 8, 1178000) = 1 > ([{fd=6, revents=POLLERR}]) > gettimeofday({1237836567, 385506}, NULL) = 0 > gettimeofday({1237836567, 385712}, NULL) = 0 > poll([{fd=6, events=0}, {fd=4, events=POLLIN}, {fd=10, events=POLLIN}, > {fd=3, events=POLLIN}, {fd=13, events=POLLIN}, {fd=14, events=POLLIN}, > {fd=7, events=POLLIN}, {fd=8, events=POLLIN}], 8, 1178000) = 1 > ([{fd=6, revents=POLLERR}]) > gettimeofday({1237836567, 386027}, NULL) = 0 > gettimeofday({1237836567, 386241}, NULL) = 0 Do others get this behaviour - is this a bug in syslog-ng? ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) 2009-03-23 19:31 ` Steve @ 2009-03-23 19:49 ` Steve 2009-03-23 21:13 ` Paul Hartman 0 siblings, 1 reply; 9+ messages in thread From: Steve @ 2009-03-23 19:49 UTC (permalink / raw To: gentoo-user Steve wrote: > Do others get this behaviour - is this a bug in syslog-ng? Sorry for the multiple posts... a slight error on my part. The sshguard process wasn't running - a /bin/sh process trying to spawn it was running (there was no link from /usr/local... to the binary) and when the binary failed to execute - syslog-ng got itself into a tiz. Everything seems to work fine when I correct the path to the program. Problem solved - but, I guess, this is a flaw in syslog-ng... I'd have hoped it would generate an error message rather than behave as it did. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) 2009-03-23 19:49 ` Steve @ 2009-03-23 21:13 ` Paul Hartman 0 siblings, 0 replies; 9+ messages in thread From: Paul Hartman @ 2009-03-23 21:13 UTC (permalink / raw To: gentoo-user On Mon, Mar 23, 2009 at 2:49 PM, Steve <gentoo_sjh@shic.co.uk> wrote: > Steve wrote: >> >> Do others get this behaviour - is this a bug in syslog-ng? > > Sorry for the multiple posts... a slight error on my part. The sshguard > process wasn't running - a /bin/sh process trying to spawn it was running > (there was no link from /usr/local... to the binary) and when the binary > failed to execute - syslog-ng got itself into a tiz. Everything seems to > work fine when I correct the path to the program. > > Problem solved - but, I guess, this is a flaw in syslog-ng... I'd have hoped > it would generate an error message rather than behave as it did. I had a possibly similar problem a while back with syslog-ng going crazy when a certain daemon would crash (in my case it filled up the log wit about 60 gigabytes of the same thing repeated over and over, in addition to using massive CPU%). I switched to metalog and haven't had any problems since. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) 2009-03-23 19:27 ` Steve 2009-03-23 19:31 ` Steve @ 2009-03-23 19:43 ` Alan McKinnon 2009-03-23 19:55 ` Steve 2009-03-23 19:48 ` Sebastian Günther 2 siblings, 1 reply; 9+ messages in thread From: Alan McKinnon @ 2009-03-23 19:43 UTC (permalink / raw To: gentoo-user On Monday 23 March 2009 21:27:15 Steve wrote: > Steve wrote: > >> destination sshguardproc { > >> program("/usr/local/sbin/sshguard" > >> template("$DATE $FULLHOST $MESSAGE\n")); > >> }; > > The presence of the above line is definitely what triggers the excessive > CPU usage - it is almost as-if syslog-ng is 'busy-waiting' for the > sshguard process. The sshguard process is running - but using zero CPU. > > I have this problem with syslog-ng versions 2.1.3 and 2.1.4 (the one > with ~x86)... > > This is very frustrating... having played around, the syslog-ng tends > towards using 100% CPU when my server is otherwise quiet - if, and only > if, I have the program destination... even if the destination is not used. One word: blocking I find this is usually the cause for higher than normal CPU load as reported by top and other tools. If the load is pegged at exactly 100%, it's almost a sure sign that some process is IO blocking on an idle system, and all the process is doing is checking if IO is available, see it isn't, goes to sleep, wakes up, rinse and repeat. In short: top lies, and load does not mean what most people think it means. The correct definition is "average number of processes that are waiting for cpu time within the measurement period." -- alan dot mckinnon at gmail dot com ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) 2009-03-23 19:43 ` Alan McKinnon @ 2009-03-23 19:55 ` Steve 0 siblings, 0 replies; 9+ messages in thread From: Steve @ 2009-03-23 19:55 UTC (permalink / raw To: gentoo-user Alan McKinnon wrote: > In short: top lies, > On this occasion, top was telling the truth. ;) ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) 2009-03-23 19:27 ` Steve 2009-03-23 19:31 ` Steve 2009-03-23 19:43 ` Alan McKinnon @ 2009-03-23 19:48 ` Sebastian Günther 2009-03-23 19:55 ` Steve 2 siblings, 1 reply; 9+ messages in thread From: Sebastian Günther @ 2009-03-23 19:48 UTC (permalink / raw To: gentoo-user [-- Attachment #1: Type: text/plain, Size: 586 bytes --] * Steve (gentoo_sjh@shic.co.uk) [23.03.09 20:27]: > Steve wrote: > >> destination sshguardproc { > >> program("/usr/local/sbin/sshguard" > >> template("$DATE $FULLHOST $MESSAGE\n")); > >> }; > >> program() only takes 1 argument: the programname. Any thing you want to pass, you have to define via a log statement. BTW: Just curious: you do not use the sshguard from portage, or why is it a /usr/local/sbin? HTH Sebastian -- " Religion ist das Opium des Volkes. " Karl Marx SEB@STI@N GÜNTHER mailto:samson@guenther-roetgen.de [-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --] ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) 2009-03-23 19:48 ` Sebastian Günther @ 2009-03-23 19:55 ` Steve 0 siblings, 0 replies; 9+ messages in thread From: Steve @ 2009-03-23 19:55 UTC (permalink / raw To: gentoo-user Sebastian Günther wrote: > program() only takes 1 argument: the programname. > There aren't two arguments (no comma) - and, yes, the syntax is odd - but it is exactly what is given by the sshguard man page - and seems to be confirmed by the syslog-ng manual, too. > BTW: Just curious: you do not use the sshguard from portage, or why is > it a /usr/local/sbin? > That was my error (a really dumb one!) I'd assumed that the binary from portage was running - whereas my process list showed /bin/sh failing to run a non-existent program. I guess the man page could be improved for gentoo by giving an example using the default install location for sshguard - but that's a very minor issue. I'd expected better error reporting by syslog-ng for a faulty configuration - ho-hum. ^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2009-03-23 21:13 UTC | newest] Thread overview: 9+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2009-03-23 15:38 [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) Steve 2009-03-23 19:27 ` Steve 2009-03-23 19:31 ` Steve 2009-03-23 19:49 ` Steve 2009-03-23 21:13 ` Paul Hartman 2009-03-23 19:43 ` Alan McKinnon 2009-03-23 19:55 ` Steve 2009-03-23 19:48 ` Sebastian Günther 2009-03-23 19:55 ` Steve
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox