-
Notifications
You must be signed in to change notification settings - Fork 571
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[PyTorch] Change order of args in another permutation triton kernel
#2488
opened Dec 9, 2025 by
tdophung
Loading…
6 of 13 tasks
Add separate RNG states for column-wise quantization with Stochastic Rounding
#2487
opened Dec 8, 2025 by
negvet
Loading…
1 of 13 tasks
Add logic for block-scaled tensors with GEMM swizzled scales
enhancement
New feature or request
refactor
[JAX] Remove unused TE DPA module dtype which fixes cuDNN backend detection to properly use input dtypes
#2485
opened Dec 5, 2025 by
jberchtold-nvidia
Loading…
8 of 13 tasks
[PyTorch] Add THD support for max_logit/MuonClip
2.11.0
#2480
opened Dec 4, 2025 by
cyanguwa
Loading…
8 of 13 tasks
[JAX] Estimate post-RHT amax using regular amax
#2479
opened Dec 4, 2025 by
jberchtold-nvidia
•
Draft
13 tasks
Add support for SWA (left, right) with FusedAttention
2.11.0
#2477
opened Dec 4, 2025 by
sudhakarsingh27
Loading…
22 of 28 tasks
fix ce loss calculation when some tokens are ignored
bug
Something isn't working
#2476
opened Dec 4, 2025 by
yashaswikarnati
Loading…
1 of 13 tasks
[Draft] Jax primitives for permutation on single GPU
#2473
opened Dec 3, 2025 by
tdophung
Loading…
13 tasks
[PyTorch] Documentation for op fuser API
documentation
Improvements or additions to documentation
#2447
opened Dec 3, 2025 by
timmoon10
Loading…
8 of 13 tasks
Fix transformer 2.9.0 (torch 2.9.1 used by SGLang 0.5.5) build
#2445
opened Dec 2, 2025 by
yiakwy-xpu-ml-framework-team
Loading…
13 tasks
[JAX] Better error message when Q, K, V are sharded differently
#2440
opened Dec 2, 2025 by
jberchtold-nvidia
Loading…
8 of 13 tasks
[PyTorch] Convert sample tuple to list in cudagraph input reuse
#2426
opened Nov 26, 2025 by
buptzyb
Loading…
13 tasks
[JAX] Add tutorial for integrating TE/JAX quantization into an existing framework
#2423
opened Nov 26, 2025 by
jberchtold-nvidia
Loading…
8 of 13 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.