Fine-tuned i-k-j order MPI schoolbook matrix multiplication benchmark with sync/async send/recv options