Boost :

kim.walisch_at_[hidden]

---
This email has been checked for viruses by AVG.
http://www.avg.com
_______________________________________________
Unsubscribe & other changes: http://lists.boost.org/mailman
/listinfo.cgi/boost
OK, I'll try to narrow it down. The simplest algorithm using the
int256_t type in primesum is S2_trivial. You can have a look at
the algorithm here:
https://github.com/kimwalisch/primesum/blob/256-bit/src/
deleglise-rivat/S2_trivial.cpp#L59
There are only 2 lines of code (62-63) using the int256_t type in
this algorithm:
maxint_t diff = prime_sums[pi[y]] - prime_sums[pi[xn]];
s2_trivial += prime * diff;
Note that maxint_t is a typedef for int256_t. The first line does an
__int128_t substraction and converts the result (impliciltly) to int256_t.
The second code line does an int256_t multiplication and adds the
result to the int256_t s2_trivial variable.
As soon as I add -std=c++11 to the compiler flags the algorithm runs
15% slower (using Clang and GCC on Linux x86_64). Funnily, if I
change the code lines to:
maxint_t diff = prime_sums[pi[y]] - prime_sums[pi[xn]];
maxint_t prime2 = prime;
diff *= prime2;
s2_trivial += diff;
This code runs already 11% faster using -std=c++11 even though it does
exactly the same (and only 4% slower than without -std=c++11).
Without -std=c++11 this code does not run faster. My code mixes
__int128_t with int256_t a lot and one of my guesses on what causes
the slowdown is that the __int128_t to int256_t conversion has become
much slower (in some cases) using -std=c++11.
Kim