Matrix multiplication

This used to be in the PyOpenCL distribution, but was moved here for license concerns. [[!table header="no" class="mointable" data=""" License of this example: | (unclear, likely non-free) Date: | 2013-09-15 PyOpenCL version: | 2013.1 OpenCL implementations (and versions) tried: | all but Apple CPU """]]