public inbox for gentoo-user@lists.gentoo.org
 help / color / mirror / Atom feed
From: <nunojsilva@ist.utl.pt> (Nuno Silva)
To: gentoo-user@lists.gentoo.org
Subject: [gentoo-user] Re: [OT] Saving an image as black and white
Date: Mon, 01 Mar 2021 12:48:25 +0000	[thread overview]
Message-ID: <s1inqp$5tc$1@ciao.gmane.io> (raw)
In-Reply-To: 603CE962.7070202@youngman.org.uk

On 2021-03-01, Wols Lists wrote:

> On 01/03/21 12:11, (Nuno Silva) wrote:
>> On 2021-03-01, Wols Lists wrote:
>> 
>>> I've got a bunch of scans, let's assume they're text documents. And
>>> they're rather big ... I want to email them.
>>>
>>> How on earth do I convert them to TRUE b&w documents? At the moment they
>>> are jpegs that weigh in at 3MB, and I guess they're using about 5 bytes
>>> to store all the colour, luminance, whatever, per pixel. But actually,
>>> there's only ONE BIT of information there - whether that pixel is black
>>> or white.
>>>
>>> I'm using imagemagick, but so far all my attempts to strip out the
>>> surplus information have resulted in INcreasing the file size ???
>>>
>>> So basically, how do I save an image as "one bit per pixel" like you'd
>>> think you'd send to a B&W printer?
>>>
>>> Even at 300dpi, I make that 300*300/8 ~= 10KB/in^2 or 800KB of
>>> uncompressed info for a page of A4, not 3MB.
>>>
>>> Cheers,
>>> Wol
>> 
>> Somebody else might have a better suggestion, or perhaps a better
>> understanding of the JPEG format and of what needs to be tuned, but, for
>> example:
>> 
>>     convert origin.jpg -threshold 70% -monochrome result.jpg
>> 
>> (And adjust the "-threshold percent" if needed. It might be that you
>> don't need thresholding at all, but if you do, it apparently must go
>> before "-monochrome".)
>> 
>> (Depending on the receiving end, you could also explore other
>> formats. Here, if the scanned document can be stored in monochrome, I
>> usually use djvu.)
>> 
> Thanks but no, I've already tried that. It makes matters worse!
>
> I've messed about with the scanner, so it is now creating 800KB images,
> but I don't want to rescan everything I've done.
>
> The problem is that it is clearly saving the images as greyscale, not as
> black&white. And when I search for help, what I want is swamped by all
> the false positives for greyscale.
>
> Oh - and for Nuno - sorry tesseract is no use, they are NOT text. That's
> why I used the word "assume" - to make it clear that I want a
> 1-bit/pixel palette, not a 5-byte/pixel greyscale.
>
> Cheers,
> Wol

Sorry, my bad - I was checking the file sizes, but I didn't notice the
larger one was the new, "monochrome" version. More coffee needed, it
seems.

-- 
Nuno Silva



  reply	other threads:[~2021-03-01 12:48 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-03-01 11:50 [gentoo-user] [OT] Saving an image as black and white Wols Lists
2021-03-01 12:01 ` Hund
2021-03-01 12:11 ` [gentoo-user] " Nuno Silva
2021-03-01 13:17   ` Wols Lists
2021-03-01 12:48     ` Nuno Silva [this message]
2021-03-01 13:24     ` William Kenworthy
2021-03-01 13:48 ` [gentoo-user] " Neil Bothwick
2021-03-01 14:22   ` Rich Freeman
2021-03-01 15:54   ` Wols Lists
2021-03-01 18:00     ` Rich Freeman
2021-03-04 20:03     ` Frank Steinmetzger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='s1inqp$5tc$1@ciao.gmane.io' \
    --to=nunojsilva@ist.utl.pt \
    --cc=gentoo-user@lists.gentoo.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox