The easiest way to manage your kernel settings
-
Updated
Mar 24, 2023 - Java
The easiest way to manage your kernel settings
Custom PyTorch CUDA kernel implementing optimized ReLU activation with vectorization, performance profiling, and memory analysis on Tesla T4 GPU achieving 75% bandwidth efficiency.
Add a description, image, and links to the kernel-profiler topic page so that developers can more easily learn about it.
To associate your repository with the kernel-profiler topic, visit your repo's landing page and select "manage topics."