public inbox for gentoo-user@lists.gentoo.org
 help / color / mirror / Atom feed
Search results ordered by [date|relevance]  view[summary|nested|Atom feed]
thread overview below | download: 
* [gentoo-user]  Piggy Bank as a screen scraper
@ 2007-08-02  7:26 99% Thufir
  0 siblings, 0 replies; 1+ results
From: Thufir @ 2007-08-02  7:26 UTC (permalink / raw
  To: gentoo-user

I glanced over an article about Piggy Bank, <http://simile.mit.edu/wiki/
Piggy_Bank>, which interests me as a screen scraper.

What I have in mind are RSS feeds from <http://www.craigslist.org/>.  
Now, I can setup Feed-on-Feeds <http://code.google.com/p/feed-on-feeds/> 
so that lotsa data from Craigslist downloads into the MySQL database.

However, much of the useful detail is buried in the text :(

So, this now makes me think of screen scraping the Feed-on-Feeds 
interface.  Kinda backwards, I'm sure others would come up with something 
more sophisticated, directly accessing the database, but...

Anyhow, my thinking is to use this piggy bank to break down, get at, some 
of the data.  Then I can add that to the database to better track, well, 
whatever.

Just kinda excited at the prospect of a new tool :)

While piggy bank may not really be Linux specific, and definitely not 
Gentoo specific, I just really like the way the different Linux magazines 
talk about software and tools, getting things done :)


-Thufir

-- 
gentoo-user@gentoo.org mailing list



^ permalink raw reply	[relevance 99%]

Results 1-1 of 1 | reverse | options above
-- pct% links below jump to the message on this page, permalinks otherwise --
2007-08-02  7:26 99% [gentoo-user] Piggy Bank as a screen scraper Thufir

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox