From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from lists.gentoo.org (pigeon.gentoo.org [208.92.234.80]) by finch.gentoo.org (Postfix) with ESMTP id 6D52B1389FE for ; Fri, 31 Oct 2014 22:22:56 +0000 (UTC) Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id 0DDE0E0883; Fri, 31 Oct 2014 22:22:47 +0000 (UTC) Received: from mail-qc0-f174.google.com (mail-qc0-f174.google.com [209.85.216.174]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by pigeon.gentoo.org (Postfix) with ESMTPS id CC491E07E2 for ; Fri, 31 Oct 2014 22:22:45 +0000 (UTC) Received: by mail-qc0-f174.google.com with SMTP id r5so6570702qcx.19 for ; Fri, 31 Oct 2014 15:22:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:content-type; bh=D9QstdTlY363L5P3qQfN/oJFTrQUZu+ikPHqqMC4zHQ=; b=nR7A+xVETDpS3VDqPpoGo2F/sL4qoEFUwsZlLWJVeQpRtv4nBaG8s4M46At/ma2TdS TRFutD9LizTJ5SSPmdJtyMKHj/Rc2bs5CXxRBWOYbdKw70sTi3E77veq1nfOm1887MxA /lEfZkhx2zCy9nkomIO62YSsYpv1MdomxCjTvLveaUoHgVzia83XFl2jbZ5yKBQgmv19 B6hxpnbikIOlaUVePiXe4Nd5QdBaRDKR+mLFPSnGpRrjDhwJR5eyfciSyyxVaYj4q5eB UcOZgYj2eIcvzCoDHcNKJ3AHkot1xsrqyn3A0WwpF1CEk2UYmEntIFDSbT3wSn+K84yE DCBw== Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-user@lists.gentoo.org Reply-to: gentoo-user@lists.gentoo.org MIME-Version: 1.0 X-Received: by 10.224.79.146 with SMTP id p18mr39654173qak.67.1414794165015; Fri, 31 Oct 2014 15:22:45 -0700 (PDT) Sender: freemanrich@gmail.com Received: by 10.140.102.134 with HTTP; Fri, 31 Oct 2014 15:22:44 -0700 (PDT) In-Reply-To: References: <20141031153659.GA13217@solfire> <5453AE7D.8060505@ramses-pyramidenbau.de> <20141031155917.GB13217@solfire> <20141031185545.GA536@grusum.endjinn.de> Date: Fri, 31 Oct 2014 18:22:44 -0400 X-Google-Sender-Auth: bajidnC2Rjes8vzfQrRPpwD5VZA Message-ID: Subject: Re: [gentoo-user] Re: OT Best way to compress files with digits From: Rich Freeman To: gentoo-user@lists.gentoo.org Content-Type: text/plain; charset=UTF-8 X-Archives-Salt: c1189c32-71a3-4fcd-b635-cdf0dc0ab241 X-Archives-Hash: 10313895e20ee4709ca1791934eaa146 On Fri, Oct 31, 2014 at 4:25 PM, Grant Edwards wrote: > > You're cheating. The algorithm you tested will compress strings of > arbitrary 8-bit values. The algorithm you proposed will only compress > strings of bytes where each byte can have only one of 10 values. > Of course. I wasn't expecting the general-purpose algorithm to do as well. In some sense, part of the information that is being encoded is actually in the compression algorithm itself (the mapping), while in a general-purpose compression algorithm that information has to be part of the compressed data stream. I was just expecting gzip/etc to get much closer to the theoretical limit. I figured that it might be a few percent higher, but I wasn't expecting a 10+% difference. -- Rich