|
Boost : |
From: David Abrahams (david.abrahams_at_[hidden])
Date: 2002-02-19 18:57:21
Thanks Achim, Carl Daniel has generously volunteered to do that
download/conversion. I've cc'd him in case he wants help from you.
-Dave
----- Original Message -----
From: "Achim Domma" <achim.domma_at_[hidden]>
To: <boost_at_[hidden]>
Sent: Tuesday, February 19, 2002 6:08 PM
Subject: RE: [boost] http savvy?
> Hi,
>
> what's the status about the conversion of the html ? I'm just downloading
> the messages to my computer (about 2sec per message -> about 14 hours
> downloading). Is there more help needed ... or am I to late ? If not :
Where
> can I read about how mbox format looks like ? (I'm a windows user ;-) )
>
> greetings
> Achim
>
> > -----Original Message-----
> > From: David Abrahams [mailto:david.abrahams_at_[hidden]]
> > Sent: Tuesday, February 19, 2002 18:21
> > To: boost_at_[hidden]
> > Subject: Re: [boost] http savvy?
> >
> >
> > Great, offer accepted!
> >
> > Please do it, and if you can additionally convert the messages to mbox
> > format you'll earn my eternal gratitude. There's no need for XML AFAIK;
> > mailman doesn't read XML, but deduces the threads from mbox. So that
would
> > just create more work.
> >
> > In fact, someone wrote a really cool Python-based thread viewer
> > for Mailman
> > archives which we can get, so /really/ don't bother with the XML ;-)
> >
> >
> > Thanks, Carl!!!
> >
> > ----- Original Message -----
> > From: "Carl Daniel" <cpdaniel_at_[hidden]>
> > To: <boost_at_[hidden]>
> > Sent: Tuesday, February 19, 2002 11:58 AM
> > Subject: Re: [boost] http savvy?
> >
> >
> > > From: "Daniel Frey" <daniel.frey_at_[hidden]>
> > >
> > > > In case you can't give the brower itself away: Can you read
> > all messages
> > > > using it and pass the messages to David?
> > >
> > > I'm looking into that too - there's a bit over 25,500 messages, I'm
> > estimating 40Mb or so downloaded size (not including
> > > all the HTML junk that Yahoo adds). I have a reasonably fast Internet
> > connection, so I expect I could get them pulled
> > > down in a few days at most. The pages are formatted such that
> > the message
> > content itself is easily identified, so I
> > > could save out only that content. The thread information would
> > be easily
> > captured as well, perhaps saving the messages
> > > as XML along with tags for their replies would be a sensible choice.
> > >
> > > Once downloaded, I could ZIP the whole thing up & push it to an
> > FTP server
> > somewhere.
> > >
> > > Thoughts? Dave?
> > >
> > > -cd
> > >
> > >
> > >
> > >
> > > Info: http://www.boost.org Send unsubscribe requests to:
> > <mailto:boost-unsubscribe_at_[hidden]>
> > >
> > > Your use of Yahoo! Groups is subject to
> http://docs.yahoo.com/info/terms/
> >
> >
>
>
> Info: http://www.boost.org Send unsubscribe requests to:
> <mailto:boost-unsubscribe_at_[hidden]>
>
> Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/
>
>
>
>
>
> Info: http://www.boost.org Send unsubscribe requests to:
<mailto:boost-unsubscribe_at_[hidden]>
>
> Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/
>
>
Boost list run by bdawes at acm.org, gregod at cs.rpi.edu, cpdaniel at pacbell.net, john at johnmaddock.co.uk