Subject: Re: [boost] RFC - Updated MapReduce library
From: Phil Endecott (spam_from_boost_dev_at_[hidden])
Date: 2009-08-09 08:46:07
Craig Henderson wrote:
> I have considerably revised and updated my MapReduce library and committed
> the changes to the Sandbox. I've also uploaded to the Vault for easy access
> I am very interested to hear any comment on design, code, performance or any
> other area.
Quoting from the start of your docs:
"The Boost.MapReduce library is a MapReduce implementation across a
plurality of CPU cores rather than machines."
Isn't that rather missing the point of what MapReduce is supposed to be
about? If I'm limited to one machine, I can write parallel code using
the full repertoire of techniques. By re-designing my application to
fit into the MapReduce pattern I can potentially scale it over multiple
machines. But if I can't scale over multiple machines, why bother?
Are you planning to support scaling over multiple machines in the future?
Boost list run by bdawes at acm.org, gregod at cs.rpi.edu, cpdaniel at pacbell.net, john at johnmaddock.co.uk