public inbox for gentoo-user@lists.gentoo.org
 help / color / mirror / Atom feed
* [gentoo-user] pdf -> txt
@ 2011-01-06  3:45 meino.cramer
  2011-01-06  4:19 ` [gentoo-user] " Nikos Chantziaras
  2011-01-07  1:40 ` [gentoo-user] Re: pdf -> txt James
  0 siblings, 2 replies; 10+ messages in thread
From: meino.cramer @ 2011-01-06  3:45 UTC (permalink / raw
  To: Gentoo



Hi,

 I want to convert a couple of pdf-documents, which
 are of test and "ASCIIbased tables", to pure text
 (ASCII, vim-editable ;) ).

 What tool/s are worth being tried out for this task?

 Thank you very much in advance for any help!

 Best regards
 mcc




^ permalink raw reply	[flat|nested] 10+ messages in thread

* [gentoo-user] Re: pdf -> txt
  2011-01-06  3:45 [gentoo-user] pdf -> txt meino.cramer
@ 2011-01-06  4:19 ` Nikos Chantziaras
  2011-01-08 10:48   ` Nuno J. Silva
  2011-01-07  1:40 ` [gentoo-user] Re: pdf -> txt James
  1 sibling, 1 reply; 10+ messages in thread
From: Nikos Chantziaras @ 2011-01-06  4:19 UTC (permalink / raw
  To: gentoo-user

On 01/06/2011 05:45 AM, meino.cramer@gmx.de wrote:
>
>
> Hi,
>
>   I want to convert a couple of pdf-documents, which
>   are of test and "ASCIIbased tables", to pure text
>   (ASCII, vim-editable ;) ).
>
>   What tool/s are worth being tried out for this task?
>
>   Thank you very much in advance for any help!

pdftotext might help.  Comes with app-text/poppler.




^ permalink raw reply	[flat|nested] 10+ messages in thread

* [gentoo-user] Re: pdf -> txt
  2011-01-06  3:45 [gentoo-user] pdf -> txt meino.cramer
  2011-01-06  4:19 ` [gentoo-user] " Nikos Chantziaras
@ 2011-01-07  1:40 ` James
  2011-01-07 16:18   ` meino.cramer
  1 sibling, 1 reply; 10+ messages in thread
From: James @ 2011-01-07  1:40 UTC (permalink / raw
  To: gentoo-user

 <meino.cramer <at> gmx.de> writes:


>  I want to convert a couple of pdf-documents, which
>  are of test and "ASCIIbased tables", to pure text
>  (ASCII, vim-editable ;) ).

app-text/pdftk

http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/


Is very cool for all things PDF on linux.....


hth,
James






^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-user] Re: pdf -&gt; txt
  2011-01-07  1:40 ` [gentoo-user] Re: pdf -&gt; txt James
@ 2011-01-07 16:18   ` meino.cramer
  2011-01-08 15:24     ` [gentoo-user] Re: pdf -&amp;gt; txt James
  0 siblings, 1 reply; 10+ messages in thread
From: meino.cramer @ 2011-01-07 16:18 UTC (permalink / raw
  To: gentoo-user

James <wireless@Tampabay.rr.com> [11-01-07 04:03]:
>  <meino.cramer <at> gmx.de> writes:
> 
> 
> >  I want to convert a couple of pdf-documents, which
> >  are of test and "ASCIIbased tables", to pure text
> >  (ASCII, vim-editable ;) ).
> 
> app-text/pdftk
> 
> http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/
> 
> 
> Is very cool for all things PDF on linux.....
> 
> 
> hth,
> James
> 

Hi James,

I think I need an additional help...

Can you give me a hint, how to create ASCII text from pdf files
(as mentioned in my initial mail) with pdftk?

Thank you very much in advance fro your help!

Best regards,
mcc





^ permalink raw reply	[flat|nested] 10+ messages in thread

* [gentoo-user] Re: pdf -> txt
  2011-01-06  4:19 ` [gentoo-user] " Nikos Chantziaras
@ 2011-01-08 10:48   ` Nuno J. Silva
  0 siblings, 0 replies; 10+ messages in thread
From: Nuno J. Silva @ 2011-01-08 10:48 UTC (permalink / raw
  To: gentoo-user

Nikos Chantziaras <realnc@arcor.de> writes:

> On 01/06/2011 05:45 AM, meino.cramer@gmx.de wrote:
>>
>>
>> Hi,
>>
>>   I want to convert a couple of pdf-documents, which
>>   are of test and "ASCIIbased tables", to pure text
>>   (ASCII, vim-editable ;) ).
>>
>>   What tool/s are worth being tried out for this task?
>>
>>   Thank you very much in advance for any help!
>
> pdftotext might help.  Comes with app-text/poppler.

If the output looks messed up, don't give up, first see the manpage,
there are some switches that may result in a better output.

I think using -layout will help you with the tables.

-- 
Nuno J. Silva
gopher://sdf-eu.org/1/users/njsg




^ permalink raw reply	[flat|nested] 10+ messages in thread

* [gentoo-user] Re: pdf -&amp;gt; txt
  2011-01-07 16:18   ` meino.cramer
@ 2011-01-08 15:24     ` James
  2011-01-08 16:47       ` meino.cramer
  0 siblings, 1 reply; 10+ messages in thread
From: James @ 2011-01-08 15:24 UTC (permalink / raw
  To: gentoo-user

 <meino.cramer <at> gmx.de> writes:


> Can you give me a hint, how to create ASCII text from pdf files

The simpliest method is to install app-text/acroread


There is a button at the top to save out the pdf file into
an txt file....


-->File-->Save as Text


Other tools, such as the one I mentioned previously, have
greater capability, but need to be 'explored' and 'researched'
and 'tested' by the potential user.


hth,

James




^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-user] Re: pdf -&amp;gt; txt
  2011-01-08 15:24     ` [gentoo-user] Re: pdf -&amp;gt; txt James
@ 2011-01-08 16:47       ` meino.cramer
  2011-01-09 23:45         ` [gentoo-user] Re: pdf -&amp;amp;gt; txt James
  0 siblings, 1 reply; 10+ messages in thread
From: meino.cramer @ 2011-01-08 16:47 UTC (permalink / raw
  To: gentoo-user

James <wireless@tampabay.rr.com> [11-01-08 17:16]:
>  <meino.cramer <at> gmx.de> writes:
> 
> 
> > Can you give me a hint, how to create ASCII text from pdf files
> 
> The simpliest method is to install app-text/acroread
> 
> 
> There is a button at the top to save out the pdf file into
> an txt file....
> 
> 
> -->File-->Save as Text
> 
> 
> Other tools, such as the one I mentioned previously, have
> greater capability, but need to be 'explored' and 'researched'
> and 'tested' by the potential user.
> 
> 
> hth,
> 
> James
> 
> 

Hi James,

I explored the manual of that tool (pdftk) but didnt find any
hint of converting pdf ot txt.

Please, give me one little, a keyword, only an option
which I can search for in the documentation to find
out how to convert pdf to txt with pdftk.

Thank you so much in advance for any help regarding this!

Best regards and have a nice weekend!
mcc




^ permalink raw reply	[flat|nested] 10+ messages in thread

* [gentoo-user] Re: pdf -&amp;amp;gt; txt
  2011-01-08 16:47       ` meino.cramer
@ 2011-01-09 23:45         ` James
  2011-01-10  0:52           ` meino.cramer
  0 siblings, 1 reply; 10+ messages in thread
From: James @ 2011-01-09 23:45 UTC (permalink / raw
  To: gentoo-user

 <meino.cramer <at> gmx.de> writes:


> I explored the manual of that tool (pdftk) but didnt find any
> hint of converting pdf ot txt.
> Please, give me one little, a keyword, only an option
> which I can search for in the documentation to find
> out how to convert pdf to txt with pdftk.


Maybe this page can help? 
http://www.pdflabs.com/docs/pdftk-cli-examples/
Maybe pdftk is not the right tool for your needs?


If all you want to do is convert a pdf to text, just
use acroread (as previously suggested):

There is a button at the top to save out the pdf file into
a txt file....
-->File-->Save as Text


cya,
James




^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [gentoo-user] Re: pdf -&amp;amp;gt; txt
  2011-01-09 23:45         ` [gentoo-user] Re: pdf -&amp;amp;gt; txt James
@ 2011-01-10  0:52           ` meino.cramer
  2011-01-14 19:32             ` Sebastián Ramírez Magrí
  0 siblings, 1 reply; 10+ messages in thread
From: meino.cramer @ 2011-01-10  0:52 UTC (permalink / raw
  To: gentoo-user

James <wireless@tampabay.rr.com> [11-01-10 01:41]:
>  <meino.cramer <at> gmx.de> writes:
> 
> 
> > I explored the manual of that tool (pdftk) but didnt find any
> > hint of converting pdf ot txt.
> > Please, give me one little, a keyword, only an option
> > which I can search for in the documentation to find
> > out how to convert pdf to txt with pdftk.
> 
> 
> Maybe this page can help? 
> http://www.pdflabs.com/docs/pdftk-cli-examples/
> Maybe pdftk is not the right tool for your needs?
> 
> 
> If all you want to do is convert a pdf to text, just
> use acroread (as previously suggested):
> 
> There is a button at the top to save out the pdf file into
> a txt file....
> -->File-->Save as Text
> 
> 
> cya,
> James
> 
> 

Hi James,

I asked here for a tool to convert from pdf to txt.
I had to recompile the gcc (missing USE-flag) to install pdftk.
This effort only to get said:

    "Maybe pdftk is not the right tool for your needs?"

....






^ permalink raw reply	[flat|nested] 10+ messages in thread

* [gentoo-user] Re: pdf -&amp;amp;gt; txt
  2011-01-10  0:52           ` meino.cramer
@ 2011-01-14 19:32             ` Sebastián Ramírez Magrí
  0 siblings, 0 replies; 10+ messages in thread
From: Sebastián Ramírez Magrí @ 2011-01-14 19:32 UTC (permalink / raw
  To: gentoo-user

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

On 09/01/11 20:22, meino.cramer@gmx.de wrote:
> James <wireless@tampabay.rr.com> [11-01-10 01:41]:
>>  <meino.cramer <at> gmx.de> writes:
>>
>>
>>> I explored the manual of that tool (pdftk) but didnt find any
>>> hint of converting pdf ot txt.
>>> Please, give me one little, a keyword, only an option
>>> which I can search for in the documentation to find
>>> out how to convert pdf to txt with pdftk.
>>
>>
>> Maybe this page can help? 
>> http://www.pdflabs.com/docs/pdftk-cli-examples/
>> Maybe pdftk is not the right tool for your needs?
>>
>>
>> If all you want to do is convert a pdf to text, just
>> use acroread (as previously suggested):
>>
>> There is a button at the top to save out the pdf file into
>> a txt file....
>> -->File-->Save as Text
>>
>>
>> cya,
>> James
>>
>>
> 
> Hi James,
> 
> I asked here for a tool to convert from pdf to txt.
> I had to recompile the gcc (missing USE-flag) to install pdftk.
> This effort only to get said:
> 
>     "Maybe pdftk is not the right tool for your needs?"
> 
> ....
> 
> 
> 
> 
> 

If you use emacs you could also use docview mode to achieve that. It's a
little bit processor heavy but just does the work for me.

- -- 
http://sebasmagri.com/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.16 (GNU/Linux)

iQEcBAEBCAAGBQJNMKS+AAoJECm0W7LfoBgvv+4H/AkS/mBYmtGm6jnTu87F+3bW
0cLwrSbe7OhpBU1xZxhhiraNRKhDUTFZxZ3pipSxajtBdwsvMcpRPPTyNKz0mLiu
hOWlEIpDQchxTm1tMzCz3HB9mF+7G9MPvSjF0SYSKdt6R9okUm9FmAti8oXmy1db
RSymQCzKi1oBr97PKwpC/tavC5tTS9t2i+RUF7rIgV2KuVvHG0M+bHlFrpCDrDsi
Q6bac/Qz/pO35qSyNUWok4nBmKWdELztTSChC4+/0oYlmXtPab1c2s8C6XkcpmKK
hVvneGd9lfFmPsMeFVZfjQziRN3vbasFj2vl3rY/0aj9hssu85JK4nxdxqXLqlk=
=ykzz
-----END PGP SIGNATURE-----




^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2011-01-14 19:34 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-01-06  3:45 [gentoo-user] pdf -> txt meino.cramer
2011-01-06  4:19 ` [gentoo-user] " Nikos Chantziaras
2011-01-08 10:48   ` Nuno J. Silva
2011-01-07  1:40 ` [gentoo-user] Re: pdf -&gt; txt James
2011-01-07 16:18   ` meino.cramer
2011-01-08 15:24     ` [gentoo-user] Re: pdf -&amp;gt; txt James
2011-01-08 16:47       ` meino.cramer
2011-01-09 23:45         ` [gentoo-user] Re: pdf -&amp;amp;gt; txt James
2011-01-10  0:52           ` meino.cramer
2011-01-14 19:32             ` Sebastián Ramírez Magrí

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox