 multiprecision::powm() with unchecked uints is _much_ slower (i.e. I
 actually perceive the time it takes for the function to return on an 4GHz
 i5 in release builds) than say the equivalent libtomcrypt/math operation.
 I'm guessing the major reason for this is the
 optimisation (or lack thereof in multiprecision).
 So, can you implement a CRT 'enabled' powm overload (I presume this would
 also require a function for factoring a large multiprecision uint into to
 dp, dq, etc. factors)?

