Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[PyTorch] Change order of args in another permutation triton kernel
#2488 opened Dec 9, 2025 by tdophung Loading…
6 of 13 tasks
Add logic for block-scaled tensors with GEMM swizzled scales enhancement New feature or request refactor
#2486 opened Dec 6, 2025 by timmoon10 Draft
4 of 17 tasks
Fix the sm120 compilation with CUDA 12
#2482 opened Dec 5, 2025 by ptrendx Loading…
1 of 13 tasks
[PyTorch] Add THD support for max_logit/MuonClip 2.11.0
#2480 opened Dec 4, 2025 by cyanguwa Loading…
8 of 13 tasks
Add support for SWA (left, right) with FusedAttention 2.11.0
#2477 opened Dec 4, 2025 by sudhakarsingh27 Loading…
22 of 28 tasks
fix ce loss calculation when some tokens are ignored bug Something isn't working
#2476 opened Dec 4, 2025 by yashaswikarnati Loading…
1 of 13 tasks
[JAX] Einsum with quantization
#2474 opened Dec 3, 2025 by phu0ngng Draft
13 tasks
[Draft] Jax primitives for permutation on single GPU
#2473 opened Dec 3, 2025 by tdophung Loading…
13 tasks
[PyTorch] Documentation for op fuser API documentation Improvements or additions to documentation
#2447 opened Dec 3, 2025 by timmoon10 Loading…
8 of 13 tasks
Add ccache support to TE and use it in GitHub actions
#2444 opened Dec 2, 2025 by ptrendx Draft
1 of 6 tasks
[PyTorch] Enable post-RHT amax estimation
#2442 opened Dec 2, 2025 by negvet Loading…
1 of 13 tasks
[pyTorch] CPU performance optimizations
#2439 opened Dec 1, 2025 by ptrendx Draft
13 tasks
support cuda graph capture offloading module
#2435 opened Dec 1, 2025 by lhb8125 Draft
13 tasks
[PyTorch] Add FA4 Support
#2432 opened Nov 28, 2025 by yaox12 Draft
1 of 16 tasks
[PyTorch] Convert sample tuple to list in cudagraph input reuse
#2426 opened Nov 26, 2025 by buptzyb Loading…
13 tasks
Fix FusedAdam DTensor compatibility issue
#2425 opened Nov 26, 2025 by shjwudp Loading…
13 tasks
[JAX] Wrapper for Permutation Triton kernel MoE
#2419 opened Nov 25, 2025 by tdophung Draft
9 of 16 tasks
[Common] Add kFloat64 partial support
#2417 opened Nov 24, 2025 by phu0ngng Loading…
7 of 13 tasks
ProTip! Follow long discussions with comments:>50.