public inbox for gentoo-user@lists.gentoo.org
 help / color / mirror / Atom feed
* [gentoo-user] [OT] Differences between wget and browser file retrieval?
@ 2021-01-14 20:49 Walter Dnes
  2021-01-14 21:10 ` Jack
                   ` (2 more replies)
  0 siblings, 3 replies; 8+ messages in thread
From: Walter Dnes @ 2021-01-14 20:49 UTC (permalink / raw
  To: Gentoo Users List

  I'm bored, so I do a regular daily report at the DSL Reports "CanChat"
sub-forum, on the Covid-19 case counts for Ontario, using provincial
data.  I download 2 files daily as source data.  One of them is a PDF
file, which is run through "pdftotext" and then parsed by a bash script
(don't ask).  Today, the command...

  wget https://files.ontario.ca/moh-covid-19-report-en-2021-01-14.pdf

...returns a zero-byte file.  *BUT*, sticking the URL into the URL bar
of Pale Moon and Google Chrome (and I assume Firefox/etc) brings up the
PDF file just fine.  Is "wget" being blocked?  I have to do extra steps
to get from the browser-invoked PDF to get the PDF file saved to the
standard work area where my script expects it to be, so it can work its
magic and parse out the daily breakdown by PHU (Public Health Unit).
BTW, today's posts requiring the PDF file are...
https://www.dslreports.com/forum/r33002718-
https://www.dslreports.com/forum/r33002752-

  I've tried setting --user-agent= with my browser's string as shown by
https://www.whatismybrowser.com/detect/what-is-my-user-agent  but no
luck.  Is there some way to get around this?  I have not updated this
past week, so I don't think the problem is at my end.

-- 
Walter Dnes <waltdnes@waltdnes.org>
I don't run "desktop environments"; I run useful applications


^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2021-01-15 16:28 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2021-01-14 20:49 [gentoo-user] [OT] Differences between wget and browser file retrieval? Walter Dnes
2021-01-14 21:10 ` Jack
2021-01-14 21:36   ` Andreas Fink
2021-01-14 22:00 ` David Haller
2021-01-15  7:40   ` Philip Webb
2021-01-15 15:09     ` Walter Dnes
2021-01-15  8:24   ` Walter Dnes
2021-01-15 16:28 ` [gentoo-user] Re: [OT SOLVED] " Walter Dnes

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox