|
Boost : |
From: David Abrahams (david.abrahams_at_[hidden])
Date: 2002-02-19 12:21:22
Great, offer accepted!
Please do it, and if you can additionally convert the messages to mbox
format you'll earn my eternal gratitude. There's no need for XML AFAIK;
mailman doesn't read XML, but deduces the threads from mbox. So that would
just create more work.
In fact, someone wrote a really cool Python-based thread viewer for Mailman
archives which we can get, so /really/ don't bother with the XML ;-)
Thanks, Carl!!!
----- Original Message -----
From: "Carl Daniel" <cpdaniel_at_[hidden]>
To: <boost_at_[hidden]>
Sent: Tuesday, February 19, 2002 11:58 AM
Subject: Re: [boost] http savvy?
> From: "Daniel Frey" <daniel.frey_at_[hidden]>
>
> > In case you can't give the brower itself away: Can you read all messages
> > using it and pass the messages to David?
>
> I'm looking into that too - there's a bit over 25,500 messages, I'm
estimating 40Mb or so downloaded size (not including
> all the HTML junk that Yahoo adds). I have a reasonably fast Internet
connection, so I expect I could get them pulled
> down in a few days at most. The pages are formatted such that the message
content itself is easily identified, so I
> could save out only that content. The thread information would be easily
captured as well, perhaps saving the messages
> as XML along with tags for their replies would be a sensible choice.
>
> Once downloaded, I could ZIP the whole thing up & push it to an FTP server
somewhere.
>
> Thoughts? Dave?
>
> -cd
>
>
>
>
> Info: http://www.boost.org Send unsubscribe requests to:
<mailto:boost-unsubscribe_at_[hidden]>
>
> Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/
>
>
Boost list run by bdawes at acm.org, gregod at cs.rpi.edu, cpdaniel at pacbell.net, john at johnmaddock.co.uk