W.r.t. the interfaces, I think we could move the uplo-tags detection to the
free functions, enabling us to offer both interfaces, with and without the
uplo/diag arguments. I think we could end up with low-level drivers that
support both type-decorated matrices and true-triangular matrices. I.e.,
something like