Boost :

Date view	Thread view	Subject view	Author view

Subject: Re: [boost] [gsoc] boost.simd news from the front.
From: Mathias Gaunard (mathias.gaunard_at_[hidden])
Date: 2011-06-14 20:25:07

Next message: Christophe Henry: "Re: [boost] [MSM] exit_ps stuks in case of outer state machine uses Row with event to event base class"
Previous message: Loïc Joly: "[boost] [shared_ptr] Design question about make_shared"
In reply to: David A. Greene: "Re: [boost] [gsoc] boost.simd news from the front."
Next in thread: David A. Greene: "Re: [boost] [gsoc] boost.simd news from the front."
Reply: David A. Greene: "Re: [boost] [gsoc] boost.simd news from the front."

On 14/06/2011 23:38, David A. Greene wrote:
> Mathias Gaunard<mathias.gaunard_at_[hidden]> writes:
>
>> We generate something along the lines of
>>
>> float tmp = 0.f;
>> for(int i ....)
>> tmp += d[i] + e[i];
>>
>> for(int i ...)
>> f[i] = b[i] + 3 * c[i] + tmp;
>>
>
> Will NT2 fuse the loops to get rid of the temporary?

Exactly how can you fuse the loops here?

This is actually an instance of splitting, where we extract things that
cannot/shouldn't be done in a single loop (or a single kernel).

> Does it do
> strip-mining or other such things (beyond that needed for
> vectorization)? Does NT2 try to generate a loop nest with the
> appropriate loops interchanged to improve performance?

Loops are in the cache-friendly order, obviously. Smarter things are
usually only done for higher-level abstractions than simple tables.

Loop fusion of different expressions is somewhat limited to what we
statically know about the sizes of the tables we're dealing with.

> I am really, really interested in this. Abstracting loops for HPC is a
> really good idea, in my mind. It would be best if there was an option
> to leave the resulting loops scalar in case the user wants to try to
> have the compiler vectorize them.

All components of the system are meant to be independent, so that you
can only use the part you want.

Next message: Christophe Henry: "Re: [boost] [MSM] exit_ps stuks in case of outer state machine uses Row with event to event base class"
Previous message: Loïc Joly: "[boost] [shared_ptr] Design question about make_shared"
In reply to: David A. Greene: "Re: [boost] [gsoc] boost.simd news from the front."
Next in thread: David A. Greene: "Re: [boost] [gsoc] boost.simd news from the front."
Reply: David A. Greene: "Re: [boost] [gsoc] boost.simd news from the front."

Date view	Thread view	Subject view	Author view

Boost list run by bdawes at acm.org, gregod at cs.rpi.edu, cpdaniel at pacbell.net, john at johnmaddock.co.uk