[gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard)

public inbox for gentoo-user@lists.gentoo.org
 help / color / mirror / Atom feed

* [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard)
@ 2009-03-23 15:38 Steve
  2009-03-23 19:27 ` Steve
  0 siblings, 1 reply; 9+ messages in thread
From: Steve @ 2009-03-23 15:38 UTC (permalink / raw
  To: gentoo-user

Has anyone any ideas?  The syslog-ng is the usually the first line 
reported by top:

 4097 root      20   0  3120 1060  708 R 48.3  0.1 677:46.38 syslog-ng

The files in /var/log seem to be growing at an expected slow pace and 
aren't reporting anything unexpected.  I followed a 'howto' and have 
sshguard running.  This (comments stripped) is what I have in 
/etc/syslog-ng/syslog-ng.conf
> options {
>         chain_hostnames(off);
>         sync(0);
>         stats(43200);
> };
>
> source src {
>     unix-stream("/dev/log" max-connections(256));
>     internal();
>     file("/proc/kmsg");
> };
>
> destination messages { file("/var/log/messages"); };
> destination console_all { file("/dev/tty12"); };
> log { source(src); destination(messages); };
> log { source(src); destination(console_all); };
> destination authlog { file("/var/log/auth.log"); };
> destination authlog { file("/var/log/auth.log"); };
> filter f_authpriv { facility(auth, authpriv); };
> log { source(src); filter(f_authpriv); destination(authlog); };
> filter sshlogs { facility(auth, authpriv) and match("sshd"); };
> destination sshguardproc {
>     program("/usr/local/sbin/sshguard"
>         template("$DATE $FULLHOST $MESSAGE\n"));
>     };
>     log { source(src); filter(sshlogs); destination(sshguardproc); };




^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard)
  2009-03-23 15:38 [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) Steve
@ 2009-03-23 19:27 ` Steve
  2009-03-23 19:31   ` Steve
                     ` (2 more replies)
  0 siblings, 3 replies; 9+ messages in thread
From: Steve @ 2009-03-23 19:27 UTC (permalink / raw
  To: gentoo-user

Steve wrote:
>> destination sshguardproc {
>>     program("/usr/local/sbin/sshguard"
>>         template("$DATE $FULLHOST $MESSAGE\n"));
>>     };
>>
The presence of the above line is definitely what triggers the excessive 
CPU usage - it is almost as-if syslog-ng is 'busy-waiting' for the 
sshguard process.  The sshguard process is running - but using zero CPU.

I have this problem with syslog-ng versions 2.1.3 and 2.1.4 (the one 
with ~x86)...

This is very frustrating... having played around, the syslog-ng tends 
towards using 100% CPU when my server is otherwise quiet - if, and only 
if, I have the program destination... even if the destination is not used.





^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard)
  2009-03-23 19:27 ` Steve
@ 2009-03-23 19:31   ` Steve
  2009-03-23 19:49     ` Steve
  2009-03-23 19:43   ` Alan McKinnon
  2009-03-23 19:48   ` Sebastian Günther
  2 siblings, 1 reply; 9+ messages in thread
From: Steve @ 2009-03-23 19:31 UTC (permalink / raw
  To: gentoo-user

Steve wrote:
> This is very frustrating... having played around, the syslog-ng tends 
> towards using 100% CPU when my server is otherwise quiet - if, and 
> only if, I have the program destination... even if the destination is 
> not used.

Oh, and strace shows syslog-ng frantically polling file-descriptor 3... 
which, I presume, is the pipe to the sshguard process.

> poll([{fd=6, events=0}, {fd=4, events=POLLIN}, {fd=10, events=POLLIN}, 
> {fd=3, events=POLLIN}, {fd=13, events=POLLIN}, {fd=14, events=POLLIN}, 
> {fd=7, events=POLLIN}, {fd=8, events=POLLIN}], 8, 1178000) = 1 
> ([{fd=6, revents=POLLERR}])
> gettimeofday({1237836567, 385148}, NULL) = 0
> gettimeofday({1237836567, 385178}, NULL) = 0
> poll([{fd=6, events=0}, {fd=4, events=POLLIN}, {fd=10, events=POLLIN}, 
> {fd=3, events=POLLIN}, {fd=13, events=POLLIN}, {fd=14, events=POLLIN}, 
> {fd=7, events=POLLIN}, {fd=8, events=POLLIN}], 8, 1178000) = 1 
> ([{fd=6, revents=POLLERR}])
> gettimeofday({1237836567, 385506}, NULL) = 0
> gettimeofday({1237836567, 385712}, NULL) = 0
> poll([{fd=6, events=0}, {fd=4, events=POLLIN}, {fd=10, events=POLLIN}, 
> {fd=3, events=POLLIN}, {fd=13, events=POLLIN}, {fd=14, events=POLLIN}, 
> {fd=7, events=POLLIN}, {fd=8, events=POLLIN}], 8, 1178000) = 1 
> ([{fd=6, revents=POLLERR}])
> gettimeofday({1237836567, 386027}, NULL) = 0
> gettimeofday({1237836567, 386241}, NULL) = 0
Do others get this behaviour - is this a bug in syslog-ng?





^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard)
  2009-03-23 19:31   ` Steve
@ 2009-03-23 19:49     ` Steve
  2009-03-23 21:13       ` Paul Hartman
  0 siblings, 1 reply; 9+ messages in thread
From: Steve @ 2009-03-23 19:49 UTC (permalink / raw
  To: gentoo-user

Steve wrote:
> Do others get this behaviour - is this a bug in syslog-ng?

Sorry for the multiple posts... a slight error on my part.  The sshguard 
process wasn't running - a /bin/sh process trying to spawn it was 
running (there was no link from /usr/local... to the binary) and when 
the binary failed to execute - syslog-ng got itself into a tiz.  
Everything seems to work fine when I correct the path to the program.

Problem solved - but, I guess, this is a flaw in syslog-ng... I'd have 
hoped it would generate an error message rather than behave as it did.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time...  (I'm using sshguard)
  2009-03-23 19:49     ` Steve
@ 2009-03-23 21:13       ` Paul Hartman
  0 siblings, 0 replies; 9+ messages in thread
From: Paul Hartman @ 2009-03-23 21:13 UTC (permalink / raw
  To: gentoo-user

On Mon, Mar 23, 2009 at 2:49 PM, Steve <gentoo_sjh@shic.co.uk> wrote:
> Steve wrote:
>>
>> Do others get this behaviour - is this a bug in syslog-ng?
>
> Sorry for the multiple posts... a slight error on my part.  The sshguard
> process wasn't running - a /bin/sh process trying to spawn it was running
> (there was no link from /usr/local... to the binary) and when the binary
> failed to execute - syslog-ng got itself into a tiz.  Everything seems to
> work fine when I correct the path to the program.
>
> Problem solved - but, I guess, this is a flaw in syslog-ng... I'd have hoped
> it would generate an error message rather than behave as it did.

I had a possibly similar problem a while back with syslog-ng going
crazy when a certain daemon would crash (in my case it filled up the
log wit about 60 gigabytes of the same thing repeated over and over,
in addition to using massive CPU%). I switched to metalog and haven't
had any problems since.



^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard)
  2009-03-23 19:27 ` Steve
  2009-03-23 19:31   ` Steve
@ 2009-03-23 19:43   ` Alan McKinnon
  2009-03-23 19:55     ` Steve
  2009-03-23 19:48   ` Sebastian Günther
  2 siblings, 1 reply; 9+ messages in thread
From: Alan McKinnon @ 2009-03-23 19:43 UTC (permalink / raw
  To: gentoo-user

On Monday 23 March 2009 21:27:15 Steve wrote:
> Steve wrote:
> >> destination sshguardproc {
> >>     program("/usr/local/sbin/sshguard"
> >>         template("$DATE $FULLHOST $MESSAGE\n"));
> >>     };
>
> The presence of the above line is definitely what triggers the excessive
> CPU usage - it is almost as-if syslog-ng is 'busy-waiting' for the
> sshguard process.  The sshguard process is running - but using zero CPU.
>
> I have this problem with syslog-ng versions 2.1.3 and 2.1.4 (the one
> with ~x86)...
>
> This is very frustrating... having played around, the syslog-ng tends
> towards using 100% CPU when my server is otherwise quiet - if, and only
> if, I have the program destination... even if the destination is not used.

One word:

blocking

I find this is usually the cause for higher than normal CPU load as reported 
by top and other tools. If the load is pegged at exactly 100%, it's almost a 
sure sign that some process is IO blocking on an idle system, and all the 
process is doing is checking if IO is available, see it isn't, goes to sleep, 
wakes up, rinse and repeat.

In short: top lies, and load does not mean what most people think it means. 
The correct definition is "average number of processes that are waiting for 
cpu time within the measurement period." 

-- 
alan dot mckinnon at gmail dot com

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard)
  2009-03-23 19:43   ` Alan McKinnon
@ 2009-03-23 19:55     ` Steve
  0 siblings, 0 replies; 9+ messages in thread
From: Steve @ 2009-03-23 19:55 UTC (permalink / raw
  To: gentoo-user

Alan McKinnon wrote:
> In short: top lies, 
>   
On this occasion, top was telling the truth. ;)



^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard)
  2009-03-23 19:27 ` Steve
  2009-03-23 19:31   ` Steve
  2009-03-23 19:43   ` Alan McKinnon
@ 2009-03-23 19:48   ` Sebastian Günther
  2009-03-23 19:55     ` Steve
  2 siblings, 1 reply; 9+ messages in thread
From: Sebastian Günther @ 2009-03-23 19:48 UTC (permalink / raw
  To: gentoo-user

[-- Attachment #1: Type: text/plain, Size: 586 bytes --]

* Steve (gentoo_sjh@shic.co.uk) [23.03.09 20:27]:
> Steve wrote:
> >> destination sshguardproc {
> >>     program("/usr/local/sbin/sshguard"
> >>         template("$DATE $FULLHOST $MESSAGE\n"));
> >>     };
> >>

program() only takes 1 argument: the programname.

Any thing you want to pass, you have to define via a log statement.

BTW: Just curious: you do not use the sshguard from portage, or why is 
it a /usr/local/sbin?

HTH
Sebastian

-- 
 " Religion ist das Opium des Volkes. "      Karl Marx

 SEB@STI@N GÜNTHER         mailto:samson@guenther-roetgen.de

[-- Attachment #2: Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard)
  2009-03-23 19:48   ` Sebastian Günther
@ 2009-03-23 19:55     ` Steve
  0 siblings, 0 replies; 9+ messages in thread
From: Steve @ 2009-03-23 19:55 UTC (permalink / raw
  To: gentoo-user

Sebastian Günther wrote:
> program() only takes 1 argument: the programname.
>   
There aren't two arguments (no comma) - and, yes, the syntax is odd - 
but it is exactly what is given by the sshguard man page - and seems to 
be confirmed by the syslog-ng manual, too.
> BTW: Just curious: you do not use the sshguard from portage, or why is 
> it a /usr/local/sbin?
>   
That was my error (a really dumb one!) I'd assumed that the binary from 
portage was running - whereas my process list showed /bin/sh failing to 
run a non-existent program.

I guess the man page could be improved for gentoo by giving an example 
using the default install location for sshguard - but that's a very 
minor issue.

I'd expected better error reporting by syslog-ng for a faulty 
configuration - ho-hum.





^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2009-03-23 21:13 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-03-23 15:38 [gentoo-user] Syslog-ng using a spectacular amount of CPU time... (I'm using sshguard) Steve
2009-03-23 19:27 ` Steve
2009-03-23 19:31   ` Steve
2009-03-23 19:49     ` Steve
2009-03-23 21:13       ` Paul Hartman
2009-03-23 19:43   ` Alan McKinnon
2009-03-23 19:55     ` Steve
2009-03-23 19:48   ` Sebastian Günther
2009-03-23 19:55     ` Steve

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox