Hi, I am somewhat new to Boost.uBlas,
but having examined it a bit, it is not clear to me that it supports block
storage organization, as opposed to row major or column major. The intent
of this capability is to take into account the cache size of the
architecture to optimize sub-matrix operations.
The usual way of representing matrix
in memory does not always take full advantage of the cache line. For example,
matrix multiplication could be made faster if all elements of certain sub-matrixes
are near-by (in a contiguous address range). This can be achieved by using
a blocked storage organization for matrix. Is there interest to take a
look at this in uBlas or are there already existing ways to support
this?
Thanks.
Michael Wong
XL C++ Compiler kernel Development
IBM Canada Ltd., C2/KD2/8200/MKM
8200 Warden Avenue
Markham, Ontario L6G 1C7
W:905-413-3283 F:905-413-4839
Boost test results http://www-1.ibm.com/support/docview.wss?rs=2239&context=SSJT9L&uid=swg27006911
C/C++ Compilers Support Page http://www.ibm.com/software/awdtools/ccompilers/support/
C/C++ Feature Request Interface http://www.ibm.com/support/docview.wss?uid=swg27005811
XL Fortran Compiler Support Page http://www.ibm.com/software/awdtools/fortran/xlfortran/support/
XL Fortran Feature Request Interface http://www.ibm.com/support/docview.wss?uid=swg27005812