* [gentoo-user] Piggy Bank as a screen scraper
@ 2007-08-02 7:26 99% Thufir
0 siblings, 0 replies; 1+ results
From: Thufir @ 2007-08-02 7:26 UTC (permalink / raw
To: gentoo-user
I glanced over an article about Piggy Bank, <http://simile.mit.edu/wiki/
Piggy_Bank>, which interests me as a screen scraper.
What I have in mind are RSS feeds from <http://www.craigslist.org/>.
Now, I can setup Feed-on-Feeds <http://code.google.com/p/feed-on-feeds/>
so that lotsa data from Craigslist downloads into the MySQL database.
However, much of the useful detail is buried in the text :(
So, this now makes me think of screen scraping the Feed-on-Feeds
interface. Kinda backwards, I'm sure others would come up with something
more sophisticated, directly accessing the database, but...
Anyhow, my thinking is to use this piggy bank to break down, get at, some
of the data. Then I can add that to the database to better track, well,
whatever.
Just kinda excited at the prospect of a new tool :)
While piggy bank may not really be Linux specific, and definitely not
Gentoo specific, I just really like the way the different Linux magazines
talk about software and tools, getting things done :)
-Thufir
--
gentoo-user@gentoo.org mailing list
^ permalink raw reply [relevance 99%]
Results 1-1 of 1 | reverse | options above
-- pct% links below jump to the message on this page, permalinks otherwise --
2007-08-02 7:26 99% [gentoo-user] Piggy Bank as a screen scraper Thufir
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox