From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from pigeon.gentoo.org ([208.92.234.80] helo=lists.gentoo.org) by finch.gentoo.org with esmtp (Exim 4.60) (envelope-from ) id 1NJnKL-0004Us-5a for garchives@archives.gentoo.org; Sun, 13 Dec 2009 12:13:29 +0000 Received: from pigeon.gentoo.org (localhost [127.0.0.1]) by pigeon.gentoo.org (Postfix) with SMTP id D39B7E06BE; Sun, 13 Dec 2009 12:12:50 +0000 (UTC) Received: from smtpout.karoo.kcom.com (smtpout.karoo.kcom.com [212.50.160.34]) by pigeon.gentoo.org (Postfix) with ESMTP id 93269E06BE for ; Sun, 13 Dec 2009 12:12:50 +0000 (UTC) X-IronPort-AV: E=Sophos;i="4.47,390,1257120000"; d="scan'208";a="150146826" Received: from unknown (HELO compaq.stroller.uk.eu.org) ([213.152.39.90]) by smtpout.karoo.kcom.com with ESMTP; 13 Dec 2009 12:12:49 +0000 Received: from [192.168.1.102] (unknown [192.168.1.102]) by compaq.stroller.uk.eu.org (Postfix) with ESMTP id 20623124BE for ; Sun, 13 Dec 2009 12:12:47 +0000 (GMT) Content-Type: text/plain; charset=us-ascii Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Id: Gentoo Linux mail X-BeenThere: gentoo-user@lists.gentoo.org Reply-to: gentoo-user@lists.gentoo.org Mime-Version: 1.0 (Apple Message framework v1077) Subject: Re: [gentoo-user] OT: extract an image from a .doc file? From: Stroller In-Reply-To: <200912131051.17996.michaelkintzios@gmail.com> Date: Sun, 13 Dec 2009 12:12:46 +0000 Content-Transfer-Encoding: quoted-printable Message-Id: <7CDEF608-BEEB-4EB6-B95D-CA1B0BE26567@stellar.eclipse.co.uk> References: <19C9F1BB-65F4-4D4C-8506-160A471F1625@stellar.eclipse.co.uk> <200912131051.17996.michaelkintzios@gmail.com> To: gentoo-user@lists.gentoo.org X-Mailer: Apple Mail (2.1077) X-Archives-Salt: 26861c06-e7fd-4157-9914-cd1f04505b5b X-Archives-Hash: 59fc0791d0dcd1f9d646d10920412a76 On 13 Dec 2009, at 10:50, Mick wrote: > On Sunday 13 December 2009 08:46:05 Stroller wrote: >> A .doc file contains an image. Is there any way to extract the image >> file in its original format, please? >> .... I have tried in OpenOffice on Windows and Word for Mac. In >> OpenOffice I can't see any way to save the image file,=20 >=20 > I don't know about MSWindows, but in OOo-bin in Linux I can = right-click on the=20 > image and select 'Save graphics' when the image is jpeg/png/etc. Not = sure if=20 > this works with MS embedded images/files from e.g. Powerpoint. This is strange. I get the same thing in Open Office (on Windows) if I = create a new .doc and add a jpeg to it. Right-clicking on the image gives me a menu of: Arrange, Alignment, = Anchor, Wrap, (separator), Picture..., Save Graphics..., Caption..., = ImageMap, (separator), Cut, Copy, Paste. If I open the file(s) I have the interest in, the first 4 entries in the = context-menu are the same, but after the first separator I get instead = "Object" (which did not appear previously) and "Caption". There is then = another separator and instead of Cut, Copy, Paste, I see only Cut & = Copy. This file was created by the software that a lettings agency uses to = manage their properties. It runs on Windows and automatically generates = letters (for overdue rent, inspections &c) in .doc format. One image in = question is the boss' signature, so the letters appear like he actually = signed them, but I think they also use company logos in other letters. Apart from that, I don't see why this image is treated differently by = OpenOffice. Isn't there a program (command line?) for converting .doc into HTML? = Maybe that would extract the image. The reason I'd like to see this is because some of the .doc files are 2 = meg in size (some others exactly 1meg, so cluster size may affect this) = and there are thousands of them taking up space on the server. If the = image is to blame then we would benefit many times from the size saving. = I haven't yet spoken to the site about this, only discovering it = yesterday, so I don't know if I can find the file by accessing the = property management software. Cheers, Stroller.=