Matrix Multiplication in Java

NVIDIA cuTile Python Guide Shows 90% cuBLAS Performance for Matrix Ops

NVIDIA releases detailed cuTile Python tutorial for Blackwell GPUs, demonstrating matrix multiplication achieving over 90% of cuBLAS performance with simplified code. NVIDIA has published a ...

IEEE

ScanNow: A Scan Window-Based Sparse Matrix Multiplication Accelerator Design

Sparse matrix-matrix multiplication (SpMM) is a crucial kernel in various applications, including sparse deep neural networks [1]–[6], graph analytics [7], triangle counting [8], and linear algebra ...

BBC

Guardians: Defenders of Mathematica - KS2 Maths

Choose a character, then add, subtract, divide and multiply numbers to battle your way to victory in this fun maths game. Challenge yourself across 11 KS2 maths topics from the Mysterious Mountains of ...

GitHub

tritonBLAS: A Lightweight Triton-based General Matrix Multiplication (GEMM) Library

This project is intended for research purposes only. Use it at your own risk and discretion. Triton is a language and compiler for writing highly efficient ML primitives, one of the most common ...

IEEE

MH-SpGEMM: Efficient Sparse General Matrix-Matrix Multiplication on Modern GPUs via Masking and Hashing Cooperative Optimization

Abstract: Sparse General Matrix-Matrix Multiplication (SpGEMM) is a core operation in high-performance computing applications such as algebraic multigrid solvers, machine learning, and graph ...

GitHub

KAMI: Communication-Avoiding General Matrix Multiplication within a Single GPU

This repository contains the artifact for the SC '25 paper submission "KAMI: Communication-Avoiding General Matrix Multiplication within a Single GPU." The NVIDIA GH200 is installed with Ubuntu 22.04 ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results