Fetch dgbmv.f from Netlib/BLAS.
Write a MEX-file in C (you may use a wrapper
m-file as
well, if you like). You do not have to
implement all the capabilities of dgbmv.f,
it is sufficient that the MEX-file can handle y = B * x, where x and y
are vectors and B is a square band matrix. It should be possible for
the upper
and lower bandwidths to be different. Call your routine from Matlab and test it on some simple examples so you can see that everything works correctly. Run your routine from Matlab and check the speedup compared to the sparse version. You must probably use faster BLAS (see below) to get any speedup. |