From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from lists.gentoo.org (pigeon.gentoo.org [208.92.234.80]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by finch.gentoo.org (Postfix) with ESMTPS id 33C391382C5 for ; Thu, 14 Jan 2021 20:49:42 +0000 (UTC) Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id E4533E0844; Thu, 14 Jan 2021 20:49:34 +0000 (UTC) Received: from pmta11.teksavvy.com (pmta11.teksavvy.com [76.10.157.34]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by pigeon.gentoo.org (Postfix) with ESMTPS id 97747E0815 for ; Thu, 14 Jan 2021 20:49:34 +0000 (UTC) IronPort-SDR: xQc5BhvloLOhPb3vEPmf9camzsNZjKoBoGZNdVRCHzp0yLfW/ehllXDkE66tbehd0BPDIR58xv xRRV6Tw5y5IA== X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: =?us-ascii?q?A2FBLADrrABg/8eESC1iHgEBCxIMQAe?= =?us-ascii?q?DPoErV2KIc4RQhjQBgXkdO4JdXpBbh24BCgEBAQEBAQEBARwQCQECBAEBAoR?= =?us-ascii?q?CBIFzJjgTAgMBAQsBAQEFAQEBAQEGBAIChk4Mg1WBBwEBAQEBAQEBAQEBAQE?= =?us-ascii?q?BAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQUCDRJCEls8HHIiTRODJ4F7AYEKD65?= =?us-ascii?q?LGjx0gTSEQAEDAw5ARIUQBoE4jXQ/gQI/g3UHg0kDAocyIgSCPoERASuBQkE?= =?us-ascii?q?+L48yRotReokykiUKgneBGAaIEJJbkxaPUJ8ykTmHDFuBCw4HfQiDJFAZDVe?= =?us-ascii?q?DeYxuVIp2JjACNQIGCgEBAwlXAYxrAQE?= X-IPAS-Result: =?us-ascii?q?A2FBLADrrABg/8eESC1iHgEBCxIMQAeDPoErV2KIc4RQh?= =?us-ascii?q?jQBgXkdO4JdXpBbh24BCgEBAQEBAQEBARwQCQECBAEBAoRCBIFzJjgTAgMBA?= =?us-ascii?q?QsBAQEFAQEBAQEGBAIChk4Mg1WBBwEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBA?= =?us-ascii?q?QEBAQEBAQEBAQEBAQUCDRJCEls8HHIiTRODJ4F7AYEKD65LGjx0gTSEQAEDA?= =?us-ascii?q?w5ARIUQBoE4jXQ/gQI/g3UHg0kDAocyIgSCPoERASuBQkE+L48yRotReokyk?= =?us-ascii?q?iUKgneBGAaIEJJbkxaPUJ8ykTmHDFuBCw4HfQiDJFAZDVeDeYxuVIp2JjACN?= =?us-ascii?q?QIGCgEBAwlXAYxrAQE?= X-IronPort-AV: E=Sophos;i="5.79,347,1602561600"; d="scan'208";a="155179012" Received: from 45-72-132-199.cpe.teksavvy.com (HELO waltdnes.org) ([45.72.132.199]) by smtp11.teksavvy.com with SMTP; 14 Jan 2021 15:49:32 -0500 Received: by waltdnes.org (sSMTP sendmail emulation); Thu, 14 Jan 2021 15:49:41 -0500 From: "Walter Dnes" Date: Thu, 14 Jan 2021 15:49:41 -0500 To: Gentoo Users List Subject: [gentoo-user] [OT] Differences between wget and browser file retrieval? Message-ID: Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-user@lists.gentoo.org Reply-to: gentoo-user@lists.gentoo.org X-Auto-Response-Suppress: DR, RN, NRN, OOF, AutoReply MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline X-Archives-Salt: 0e6c69eb-22ce-48b3-a87e-297388391027 X-Archives-Hash: e8cb870a8d05ec5de0efa4289195433a I'm bored, so I do a regular daily report at the DSL Reports "CanChat" sub-forum, on the Covid-19 case counts for Ontario, using provincial data. I download 2 files daily as source data. One of them is a PDF file, which is run through "pdftotext" and then parsed by a bash script (don't ask). Today, the command... wget https://files.ontario.ca/moh-covid-19-report-en-2021-01-14.pdf ...returns a zero-byte file. *BUT*, sticking the URL into the URL bar of Pale Moon and Google Chrome (and I assume Firefox/etc) brings up the PDF file just fine. Is "wget" being blocked? I have to do extra steps to get from the browser-invoked PDF to get the PDF file saved to the standard work area where my script expects it to be, so it can work its magic and parse out the daily breakdown by PHU (Public Health Unit). BTW, today's posts requiring the PDF file are... https://www.dslreports.com/forum/r33002718- https://www.dslreports.com/forum/r33002752- I've tried setting --user-agent= with my browser's string as shown by https://www.whatismybrowser.com/detect/what-is-my-user-agent but no luck. Is there some way to get around this? I have not updated this past week, so I don't think the problem is at my end. -- Walter Dnes I don't run "desktop environments"; I run useful applications