|
Boost : |
From: Achim Domma (achim.domma_at_[hidden])
Date: 2002-02-19 18:08:53
Hi,
what's the status about the conversion of the html ? I'm just downloading
the messages to my computer (about 2sec per message -> about 14 hours
downloading). Is there more help needed ... or am I to late ? If not : Where
can I read about how mbox format looks like ? (I'm a windows user ;-) )
greetings
Achim
> -----Original Message-----
> From: David Abrahams [mailto:david.abrahams_at_[hidden]]
> Sent: Tuesday, February 19, 2002 18:21
> To: boost_at_[hidden]
> Subject: Re: [boost] http savvy?
>
>
> Great, offer accepted!
>
> Please do it, and if you can additionally convert the messages to mbox
> format you'll earn my eternal gratitude. There's no need for XML AFAIK;
> mailman doesn't read XML, but deduces the threads from mbox. So that would
> just create more work.
>
> In fact, someone wrote a really cool Python-based thread viewer
> for Mailman
> archives which we can get, so /really/ don't bother with the XML ;-)
>
>
> Thanks, Carl!!!
>
> ----- Original Message -----
> From: "Carl Daniel" <cpdaniel_at_[hidden]>
> To: <boost_at_[hidden]>
> Sent: Tuesday, February 19, 2002 11:58 AM
> Subject: Re: [boost] http savvy?
>
>
> > From: "Daniel Frey" <daniel.frey_at_[hidden]>
> >
> > > In case you can't give the brower itself away: Can you read
> all messages
> > > using it and pass the messages to David?
> >
> > I'm looking into that too - there's a bit over 25,500 messages, I'm
> estimating 40Mb or so downloaded size (not including
> > all the HTML junk that Yahoo adds). I have a reasonably fast Internet
> connection, so I expect I could get them pulled
> > down in a few days at most. The pages are formatted such that
> the message
> content itself is easily identified, so I
> > could save out only that content. The thread information would
> be easily
> captured as well, perhaps saving the messages
> > as XML along with tags for their replies would be a sensible choice.
> >
> > Once downloaded, I could ZIP the whole thing up & push it to an
> FTP server
> somewhere.
> >
> > Thoughts? Dave?
> >
> > -cd
> >
> >
> >
> >
> > Info: http://www.boost.org Send unsubscribe requests to:
> <mailto:boost-unsubscribe_at_[hidden]>
> >
> > Your use of Yahoo! Groups is subject to
http://docs.yahoo.com/info/terms/
>
>
Info: http://www.boost.org Send unsubscribe requests to:
<mailto:boost-unsubscribe_at_[hidden]>
Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/
Boost list run by bdawes at acm.org, gregod at cs.rpi.edu, cpdaniel at pacbell.net, john at johnmaddock.co.uk