* [gentoo-amd64] Tcp transfer has a pause every few seconds
@ 2005-10-24 19:34 P.V.Anthony
2005-10-25 12:56 ` [gentoo-amd64] " Duncan
0 siblings, 1 reply; 6+ messages in thread
From: P.V.Anthony @ 2005-10-24 19:34 UTC (permalink / raw
To: gentoo-amd64
Hi,
I have a problem with my nfs transfer using tcp or udp. There is a pause every
few seconds when I am writing to the server. This causes the video that I am
recording to have jerks and lost frames. The tcp stream is not constant.
I will try to explain the best I can.
I have a server for video use. I connect the videoserver with Macs using cross
cable. Using NFS version 3 and using tcp. The filesystem on the server is XFS.
On the macs there is a dv camera connected. Using a video editing software, I am
recording the video to the nfs mount on the mac. After the recording when I play
back the recording there are jerks in the video every 20 seconds or so.
Next I copy a big file to the nfs mount. The file goes through but I notice that
there is pause every 20 seconds or so. DV video only need about 3.5MB per
second. All the ethernet connection is gigabit.
What am I doing wrong? Please do advice.
P.V.Anthony
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 6+ messages in thread
* [gentoo-amd64] Re: Tcp transfer has a pause every few seconds
2005-10-24 19:34 [gentoo-amd64] Tcp transfer has a pause every few seconds P.V.Anthony
@ 2005-10-25 12:56 ` Duncan
2005-10-30 15:27 ` P.V.Anthony
0 siblings, 1 reply; 6+ messages in thread
From: Duncan @ 2005-10-25 12:56 UTC (permalink / raw
To: gentoo-amd64
P.V.Anthony posted <435D3729.4070507@singnet.com.sg>, excerpted below, on
Tue, 25 Oct 2005 03:34:01 +0800:
> I have a problem with my nfs transfer using tcp or udp. There is a pause
> every few seconds when I am writing to the server. This causes the video
> that I am recording to have jerks and lost frames. The tcp stream is not
> constant.
>
> I will try to explain the best I can.
>
> I have a server for video use. I connect the videoserver with Macs using
> cross cable. Using NFS version 3 and using tcp. The filesystem on the
> server is XFS.
>
> On the macs there is a dv camera connected. Using a video editing
> software, I am recording the video to the nfs mount on the mac. After
> the recording when I play back the recording there are jerks in the
> video every 20 seconds or so.
>
> Next I copy a big file to the nfs mount. The file goes through but I
> notice that
> there is pause every 20 seconds or so. DV video only need about 3.5MB
> per
> second. All the ethernet connection is gigabit.
>
> What am I doing wrong? Please do advice.
Pause every few seconds. I've seen two commonly reported solutions to
this -- two different problems resulting in the same symtoms -- and
another one less common in general but likely equally common on amd64.
1) DMA, or rather, the lack thereof, or interrupt sharing. Ensure that
you have the correct chipset drivers installed, both for whatever bus the
NIC is on, and for your hard drive interface (the PATA/SATA/SCSI chipset).
Often, generic or the wrong drivers will work in degraded mode, but DMA
won't be turned on, and performance will suffer. For your hard drives
(yes, wrong drivers there /can/ cause bottlenecks up the chain, including
onto the same bus as the NIC), use hdparm (or sdparm) to verify that both
the drive and the chipset agree that DMA is on.
Also check interrupt distribution and sharing. You may be able to change
these settings in the BIOS, or try switching some cards around
(particularly your NIC if it's not on-board built-in).
2) Do you have an "extra" network interface? I've seen this one on
MSWormOS and am unsure how Linux would even have the problem, unless you
have a spare interface (say a wireless card) that's running but not
connected. Anyway, the problem here is an interface set to DHCP mode that
can't get an IP because it's not connected, so of course can't see a
server. The system would pause every few minutes (every few seconds seems
a bit much, but...) and try to obtain an IP, stalling all I/O in the
process. (Again, the reason it's stalling all I/O is likely rooted in
misconfigured DMA or IRQs, but anyway...)
3) Seen frequently on this list as a problem resulting in one or more of
a number of symtoms, including this one: APIC or ACPI problems. Take a
look at the kernel documentation under $kerneldir/Documentation/x86_64,
the apic and acpi boot options, and idle=poll as well. Note that those
needing these often find they don't after a mature BIOS is available and
flashed. idle=poll is particularly nasty, as it will cause your CPU to
work far harder and run far hotter than it ordinarily would, but some
early amd64 boards required it until an updated BIOS fixed the issue. I'm
not sure if that's still a problem with the newer boards or not.
Also note that for specific net chip, various configuration options may be
available. In particular, there's the Broadcom tigon3/tg3 kernel driver,
and then another (I forget what it's called) for the same net chip. Often
one will work well, the other won't work or will have issues. If that's
your chip, look it up and consider trying the other driver.
--
Duncan - List replies preferred. No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master." Richard Stallman in
http://www.linuxdevcenter.com/pub/a/linux/2004/12/22/rms_interview.html
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [gentoo-amd64] Re: Tcp transfer has a pause every few seconds
2005-10-25 12:56 ` [gentoo-amd64] " Duncan
@ 2005-10-30 15:27 ` P.V.Anthony
2005-10-30 17:53 ` [gentoo-amd64] " Duncan
0 siblings, 1 reply; 6+ messages in thread
From: P.V.Anthony @ 2005-10-30 15:27 UTC (permalink / raw
To: gentoo-amd64
Duncan wrote:
>
> Pause every few seconds. I've seen two commonly reported solutions to
> this -- two different problems resulting in the same symtoms -- and
> another one less common in general but likely equally common on amd64.
>
> 1) DMA, or rather, the lack thereof, or interrupt sharing. Ensure that
> you have the correct chipset drivers installed, both for whatever bus the
> NIC is on, and for your hard drive interface (the PATA/SATA/SCSI chipset).
> Often, generic or the wrong drivers will work in degraded mode, but DMA
> won't be turned on, and performance will suffer. For your hard drives
> (yes, wrong drivers there /can/ cause bottlenecks up the chain, including
> onto the same bus as the NIC), use hdparm (or sdparm) to verify that both
> the drive and the chipset agree that DMA is on.
I am using the Promise EX8350 raid card and using there driver.
>
> Also check interrupt distribution and sharing. You may be able to change
> these settings in the BIOS, or try switching some cards around
> (particularly your NIC if it's not on-board built-in).
This one I was thinking of trying the last. (Read below)
>
> 2) Do you have an "extra" network interface? I've seen this one on
> MSWormOS and am unsure how Linux would even have the problem, unless you
> have a spare interface (say a wireless card) that's running but not
> connected. Anyway, the problem here is an interface set to DHCP mode that
> can't get an IP because it's not connected, so of course can't see a
> server. The system would pause every few minutes (every few seconds seems
> a bit much, but...) and try to obtain an IP, stalling all I/O in the
> process. (Again, the reason it's stalling all I/O is likely rooted in
> misconfigured DMA or IRQs, but anyway...)
All the network cards are set to static address.
>
> 3) Seen frequently on this list as a problem resulting in one or more of
> a number of symtoms, including this one: APIC or ACPI problems. Take a
> look at the kernel documentation under $kerneldir/Documentation/x86_64,
> the apic and acpi boot options, and idle=poll as well. Note that those
> needing these often find they don't after a mature BIOS is available and
> flashed. idle=poll is particularly nasty, as it will cause your CPU to
> work far harder and run far hotter than it ordinarily would, but some
> early amd64 boards required it until an updated BIOS fixed the issue. I'm
> not sure if that's still a problem with the newer boards or not.
I am using the latest bios. So far I am not using any apic boot options. Will
read up.
>
> Also note that for specific net chip, various configuration options may be
> available. In particular, there's the Broadcom tigon3/tg3 kernel driver,
> and then another (I forget what it's called) for the same net chip. Often
> one will work well, the other won't work or will have issues. If that's
> your chip, look it up and consider trying the other driver.
Will check this.
Thank you very much for replying and helping. Please read my observations below
and if you have time please advice. Let me know if I still have to do the checks
as mention above.
Here are some specs and observations.
Specs.
cpu: AMD 64 3000+
drives: Sata drives
mobo: MSI Neo 4 Platinum
Ram: 2GB
Raid: Promise EX8350
OS: Gentoo
Kernel: 2.6.13 r3
Filesystem: XFS
Observations.
When I was using the xfs I had the problem of the pauses in the tcp transfers.
These pauses, I am guessing is because of the pdflush that occurs around every
20 seconds. There is only one pdflush every 20 seconds. This caused lose of
video frames when we did the video recording. The video would have a jerk or
jump when played back.
I changed the dirty_ratio from the default of 40 to 5. This solved the problem.
During the recording preview there seems to be some jerks but during the
playback it seems to be ok. The pdflush happens more often and there is still
one pdflush. I think the jerk that I see in the record preview coincides with
the pdflush. More jerks in record preview but ok during playback.
Next I formatted the drives to the ext3 filesystem. The dirty_ratio was set back
to the default value of 40. When the video started to record there is no loss of
frames and no jerks in the preview window. During the playback there is no loss.
Everything is good. Check the top for pdflush and noticed that the there is two
pdflush. The pdflushs is happening at every 5 seconds or so. Very fast compared
to when I was using xfs.
So the problem must be my xfs setting. For xfs I have only used the default
setting. Or could there be a problem with xfs and AMD64 kernel?
What must I do to get xfs to work? Or what am I doing wrong?
Please comment and advise.
Once again thank you very much for the advice.
P.V.Anthony
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 6+ messages in thread
* [gentoo-amd64] Re: Re: Tcp transfer has a pause every few seconds
2005-10-30 15:27 ` P.V.Anthony
@ 2005-10-30 17:53 ` Duncan
2005-11-03 3:00 ` P.V.Anthony
0 siblings, 1 reply; 6+ messages in thread
From: Duncan @ 2005-10-30 17:53 UTC (permalink / raw
To: gentoo-amd64
P.V.Anthony posted <4364E65D.4070109@singnet.com.sg>, excerpted below, on
Sun, 30 Oct 2005 23:27:25 +0800:
> specs and observations.
>
> Specs.
> cpu: AMD 64 3000+
> drives: Sata drives
> mobo: MSI Neo 4 Platinum
> Ram: 2GB
> Raid: Promise EX8350
> OS: Gentoo
> Kernel: 2.6.13 r3
> Filesystem: XFS
Now that I see that mobo listed...
See the other threads currently active on the list/group right now. (You
can read the list as either a newsgroup or a web forum, thru gmane.org. I
use their list2news interface to read the list as a newsgroup. That way
you can see posts from before you subscribed, if desired.)
I haven't kept exact track but it seems I'm seeing that mobo mentioned in
three different active threads ATM, this one included. The latest BIOS
(1.B IIRC?) apparently has some rather major issues with recent kernels.
The threads have been saying either revert to 1.9 (or whatever it was) or
try an earlier kernel (2.6.11 or earlier, IIRC). They problems mentioned
haven't been this one, but it's a troublesome combination in any case, so
certainly read up, and consider reverting either the kernel or the BIOS.
NOTE THAT ONE GUY ENDED UP FRYING HIS BIOS REVERTING, and had to buy a new
chip ($10-ish, but he had to wait for it to be delivered). However, he
was having issues getting into the BIOS already (the keyboard wasn't being
activated soon enough to detect the delete to put it in BIOS edit mode),
and it's possible the fried BIOS was related to that.
The $10 isn't bad, so you might wish to order the spare BIOS chip and have
it on hand before you try reverting, just in case.
... As I said in a different thread, altho I understand it's changed a
bit now, back when I was shopping for my dual Opteron board, it was
between a Tyan and an MSI board. Visiting the MSI site, all their BIOS
upgrades AND all their documentation was in MSWormOS executable format
(I'm guessing self-extracting zip, but all I know is it had the .exe at
the end). Tyan had the more traditional pdf docs and zip file BIOS packs.
I emailed MSI and told them why they lost the sale. (The MSI board was
cheaper and I would have likely gotten it if it looked like they cared at
all about their Linux customers.) Maybe my mail was part of what made
them decide to switch back to normal formats.
Anyway, I've been quite impressed with Tyan -- they are quite responsive
on the Linux front and regularly have Linux drivers, and even have
customized lm_sensors configs for their boards. As well, many of them
carry Linux certifications -- the manual for mine says Red Hat, SuSE, and
Turbo Linux, all three, supported and certified.
MSI OTOH, as I explained, I was rather disimpressed with, and the recent
threads here on the list haven't helped. I consider myself lucky to have
gone Tyan, and don't think I'll be considering MSI any time soon.
> Observations.
> When I was using the xfs I had the problem of the pauses in the tcp
> transfers. These pauses, I am guessing is because of the pdflush that
> occurs around every 20 seconds. There is only one pdflush every 20
> seconds. This caused lose of video frames when we did the video recording.
> The video would have a jerk or jump when played back.
>
> I changed the dirty_ratio from the default of 40 to 5. This solved the
> problem. During the recording preview there seems to be some jerks but
> during the playback it seems to be ok. The pdflush happens more often and
> there is still one pdflush. I think the jerk that I see in the record
> preview coincides with the pdflush. More jerks in record preview but ok
> during playback.
>
> Next I formatted the drives to the ext3 filesystem. The dirty_ratio was
> set back to the default value of 40. When the video started to record
> there is no loss of frames and no jerks in the preview window. During the
> playback there is no loss. Everything is good. Check the top for pdflush
> and noticed that the there is two pdflush. The pdflushs is happening at
> every 5 seconds or so. Very fast compared to when I was using xfs.
>
> So the problem must be my xfs setting. For xfs I have only used the
> default setting. Or could there be a problem with xfs and AMD64 kernel?
>
> What must I do to get xfs to work? Or what am I doing wrong?
The dirty_ratio stuff you mention is beyond my level -- you know more
there than I do by far, it would seem. I've only used reiserfs here,
since I switched to Linux, as well, so can't say much about ext2/3 or xfs,
beyond what's out there to read about them. I /can/ say I have two
pdflush threads here, but I always figured that was one running on each
CPU...
Anyway, now that you mentioned the MSI Neo 4 Platinum mobo, with the other
threads on it, I'd say the BIOS/kernel issues from that very likely are
causing your issues as well. Address that, and there's a good chance your
problems will disappear. In any case, there are several others on the
list with that board to compare notes with, which should help.
--
Duncan - List replies preferred. No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master." Richard Stallman in
http://www.linuxdevcenter.com/pub/a/linux/2004/12/22/rms_interview.html
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [gentoo-amd64] Re: Re: Tcp transfer has a pause every few seconds
2005-10-30 17:53 ` [gentoo-amd64] " Duncan
@ 2005-11-03 3:00 ` P.V.Anthony
2005-11-03 7:28 ` [gentoo-amd64] " Duncan
0 siblings, 1 reply; 6+ messages in thread
From: P.V.Anthony @ 2005-11-03 3:00 UTC (permalink / raw
To: gentoo-amd64
Duncan wrote:
> Anyway, now that you mentioned the MSI Neo 4 Platinum mobo, with the other
> threads on it, I'd say the BIOS/kernel issues from that very likely are
> causing your issues as well. Address that, and there's a good chance your
> problems will disappear. In any case, there are several others on the
> list with that board to compare notes with, which should help.
You are right. Just installed gentoo kernel 2.6.14 and xfs do not have any
problems. Need to do more test to confirm.
I was looking at the change log for kernel 2.6.14 and there seems to be many
entries for xfs. Maybe they solved some problems.
Will keep you all posted.
P.V.Anthony
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 6+ messages in thread
* [gentoo-amd64] Re: Re: Re: Tcp transfer has a pause every few seconds
2005-11-03 3:00 ` P.V.Anthony
@ 2005-11-03 7:28 ` Duncan
0 siblings, 0 replies; 6+ messages in thread
From: Duncan @ 2005-11-03 7:28 UTC (permalink / raw
To: gentoo-amd64
P.V.Anthony posted <43697D63.5060508@singnet.com.sg>, excerpted below, on
Thu, 03 Nov 2005 11:00:51 +0800:
> You are right. Just installed gentoo kernel 2.6.14 and xfs do not have any
> problems. Need to do more test to confirm.
>
> I was looking at the change log for kernel 2.6.14 and there seems to be many
> entries for xfs. Maybe they solved some problems.
Very glad it seems fixed, and thanks for the feedback!
--
Duncan - List replies preferred. No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master." Richard Stallman in
http://www.linuxdevcenter.com/pub/a/linux/2004/12/22/rms_interview.html
--
gentoo-amd64@gentoo.org mailing list
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2005-11-03 7:31 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-10-24 19:34 [gentoo-amd64] Tcp transfer has a pause every few seconds P.V.Anthony
2005-10-25 12:56 ` [gentoo-amd64] " Duncan
2005-10-30 15:27 ` P.V.Anthony
2005-10-30 17:53 ` [gentoo-amd64] " Duncan
2005-11-03 3:00 ` P.V.Anthony
2005-11-03 7:28 ` [gentoo-amd64] " Duncan
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox