[Feature] Make QDP CUDA kernel build targets configurable and future-compatible by viiccwen · Pull Request #1283 · apache/mahout

viiccwen · 2026-04-22T17:26:31Z

Related Issues

Changes

Why

QDP's CUDA kernel build currently relies on a small hardcoded set of nvcc -gencode targets. That makes it awkward to keep existing GPUs working while also supporting newer architectures such as Blackwell, and it forces follow-up source edits whenever the desired target mix changes.

This change makes architecture targeting configurable and toolchain-aware so one build configuration can remain usable across current and newer NVIDIA GPU generations without changing QDP runtime behavior.

How

derive the default cubin/PTX target set from a project shortlist filtered by the local nvcc supported architecture lists
add QDP_CUDA_ARCH_LIST as an explicit override for local builds, CI, and packaging workflows
preserve a legacy fallback when nvcc does not expose architecture listing flags
validate the change locally on both Ada and Blackwell GPUs and rerun targeted QDP GPU tests

Checklist

Added or updated unit tests for all changes
Added or updated documentation for all changes

viiccwen · 2026-04-22T17:43:21Z

cc @ryankert01, @rich7420, need testing on ur local machine.
Already testing in both RTX 4090 and newer architectures Pro 6000 Blackwell. It's all work well.

Copilot

Pull request overview

This PR updates qdp-kernels’ CUDA build script to make nvcc -gencode architecture targeting configurable and to better accommodate newer NVIDIA GPU architectures without requiring source edits.

Changes:

Introduces an architecture-target model for nvcc -gencode flags (SM and optional PTX targets).
Adds QDP_CUDA_ARCH_LIST to explicitly override the CUDA target set at build time.
Attempts to derive default targets from nvcc’s supported architecture lists, with a legacy fallback when listing flags are unavailable.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

viiccwen · 2026-04-23T15:20:11Z

@ryankert01 sounds like getting-started.md. I'll woke on it.

ryankert01 · 2026-04-23T15:36:09Z

sorry @viiccwen, I thought we have a section say we only support some Nvidia gpus (30, 40), but we turns out don't have this section. So, I think it's not needed.

Also, just a heads up, the 30s gpu server that I have access to is currently compromised. So I might not be able to test it anywhere soon.

400Ping · 2026-04-24T01:11:29Z

Agree on writing a section on the current supporting GPUs(Including Nvidia and AMD).

viiccwen · 2026-04-24T06:39:24Z

Goti it, I think a backend-level section is more accurate than listing specific GPU models.

For NVIDIA, QDP does not maintain a fixed supported-SKU whitelist. The actual CUDA targets generated by the build depend on the installed CUDA toolkit and the local nvcc supported architectures, with QDP selecting from its default architecture shortlist.

For AMD, support similarly depends on the local ROCm environment and the Triton backend used by QDP, rather than a hardcoded model list in the repo.

Because of that, I plan to document this as a “Supported GPU Backends” section instead of list a set of supported GPU models.

This reverts commit 9062a4b.

ryankert01

tested!

viiccwen requested review from 400Ping, guan404ming and ryankert01 as code owners April 22, 2026 17:26

viiccwen force-pushed the feature/qdp-cuda-arch-targets branch 2 times, most recently from 19080c0 to d9b32ce Compare April 22, 2026 17:41

viiccwen force-pushed the feature/qdp-cuda-arch-targets branch from d9b32ce to 85c801a Compare April 23, 2026 03:21

ryankert01 requested a review from Copilot April 23, 2026 07:48

Copilot started reviewing on behalf of ryankert01 April 23, 2026 07:48 View session

Copilot AI reviewed Apr 23, 2026

View reviewed changes

Comment thread qdp/qdp-kernels/build.rs

Comment thread qdp/qdp-kernels/build.rs

Comment thread qdp/qdp-kernels/build.rs

viiccwen force-pushed the feature/qdp-cuda-arch-targets branch 2 times, most recently from 78b1590 to 4767bd0 Compare April 23, 2026 15:01

viiccwen added 4 commits May 3, 2026 21:07

Make QDP CUDA kernel build targets configurable

2fab5e3

docs(qdp): document CUDA architecture target override

d935cf5

Revert "docs(qdp): document CUDA architecture target override"

46e536f

This reverts commit 9062a4b.

docs(qdp): add supported GPU backends section

17f2759

ryankert01 force-pushed the feature/qdp-cuda-arch-targets branch from 57d0197 to 17f2759 Compare May 3, 2026 13:07

ryankert01 reviewed May 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Make QDP CUDA kernel build targets configurable and future-compatible#1283

[Feature] Make QDP CUDA kernel build targets configurable and future-compatible#1283
viiccwen wants to merge 4 commits intoapache:mainfrom
viiccwen:feature/qdp-cuda-arch-targets

viiccwen commented Apr 22, 2026 •

edited

Loading

Uh oh!

viiccwen commented Apr 22, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

viiccwen commented Apr 23, 2026 •

edited

Loading

Uh oh!

ryankert01 commented Apr 23, 2026 •

edited

Loading

Uh oh!

400Ping commented Apr 24, 2026

Uh oh!

viiccwen commented Apr 24, 2026 •

edited

Loading

Uh oh!

ryankert01 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

viiccwen commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issues

Changes

Why

How

Checklist

Uh oh!

viiccwen commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

viiccwen commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryankert01 commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

400Ping commented Apr 24, 2026

Uh oh!

viiccwen commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryankert01 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

viiccwen commented Apr 22, 2026 •

edited

Loading

viiccwen commented Apr 22, 2026 •

edited

Loading

viiccwen commented Apr 23, 2026 •

edited

Loading

ryankert01 commented Apr 23, 2026 •

edited

Loading

viiccwen commented Apr 24, 2026 •

edited

Loading