-
Notifications
You must be signed in to change notification settings - Fork 187
Pull requests: NVIDIA/cudnn-frontend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
DSA: add q causal offsets and Rubin SM100F support
#316
opened Jun 17, 2026 by
jiayus-nvidia
Contributor
Loading…
fix(sdpa_benchmark): use sampled SM clock + per-arch MMA throughput for SOL%
#314
opened Jun 16, 2026 by
brandonfzhang
Contributor
Loading…
Test softmax stats outputs in various permutations
#304
opened Jun 11, 2026 by
egilliam-nv
Contributor
•
Draft
experimental: grouped gemm AOT
cat-feature
Requests for new functionality, APIs, examples, or behavior improvements.
mod-frontend
cuDNN frontend APIs, operation graph construction, plans, and user-facing wrappers.
orig-nv-eng
Reported or requested by NVIDIA engineering.
Use UID-based graph JSON v2 for repro extraction
cat-infra
Build, packaging, tooling, dependency, release, or repository maintenance work.
mod-frontend
cuDNN frontend APIs, operation graph construction, plans, and user-facing wrappers.
orig-nv-eng
Reported or requested by NVIDIA engineering.
Vedaanta/sdpa d256 cudnn 9 23 bypass oss
cat-enhancements
mod-frontend
cuDNN frontend APIs, operation graph construction, plans, and user-facing wrappers.
orig-nv-eng
Reported or requested by NVIDIA engineering.
Add linear attention API
cat-feature
Requests for new functionality, APIs, examples, or behavior improvements.
#268
opened Jun 2, 2026 by
jhjpark
Loading…
Add NWH + B2B causal conv1d notebooks; refresh outputs
#246
opened May 20, 2026 by
yeliu-oss
Loading…
ProTip!
Exclude everything labeled
bug with -label:bug.