In this paper, we focus on three sparse matrix operations that are relevant for machine learning applications, namely, the sparse-dense matrix multiplication (SPMM), the sampled dense-dense matrix multiplication (SDDMM), and the composition of the SDDMM with SPMM, also termed as FusedMM. We develop optimized implementations for SPMM, SDDMM, and FusedMM operations utilizing Intel oneAPI’s Explicit
![hgpu.org](https://cdn-ak-scissors.b.st-hatena.com/image/square/78d38e7609d0c6e2644cf9a747feb344b5ded719/height=288;version=1;width=512/https%3A%2F%2Fhgpu.org%2Fimg%2Fsocial-logo.png)