Skip to content

[None][chore] Update flashinfer-python from 0.6.6 to 0.6.8rc1#13064

Open
yihwang-nv wants to merge 21 commits intoNVIDIA:mainfrom
yihwang-nv:yihwang-nv/update_flashinfer_0.6.8rc1
Open

[None][chore] Update flashinfer-python from 0.6.6 to 0.6.8rc1#13064
yihwang-nv wants to merge 21 commits intoNVIDIA:mainfrom
yihwang-nv:yihwang-nv/update_flashinfer_0.6.8rc1

Conversation

@yihwang-nv
Copy link
Copy Markdown
Collaborator

@yihwang-nv yihwang-nv commented Apr 15, 2026

Summary

  • Bump flashinfer-python from 0.6.6 to 0.6.8rc1
  • Bump nvidia-cutlass-dsl from 4.3.4 to 4.4.2 (required by flashinfer-python 0.6.8rc1 >=4.4.2)
  • Add nvidia-cutlass-dsl-libs-base 4.4.2 to poetry.lock (new transitive dependency)
  • Updated version pins in requirements.txt, security_scanning/pyproject.toml, security_scanning/poetry.lock, and ATTRIBUTIONS-Python.md

Test plan

  • pip install -r requirements.txt installs successfully
  • pytest tests/unittest/_torch/flashinfer/ -v
  • pytest tests/unittest/_torch/attention/test_flashinfer_attention.py -v
  • CI pre-merge passes

Summary by CodeRabbit

Chores

  • Updated CUDA-related package dependencies to newer versions for improved compatibility and performance.

Bump flashinfer-python dependency to 0.6.8rc1.
Also update nvidia-cutlass-dsl from 4.3.4 to 4.4.2 (required by flashinfer >=4.4.2).
Updated version pins in requirements.txt, security_scanning/pyproject.toml,
security_scanning/poetry.lock, and ATTRIBUTIONS-Python.md.

Signed-off-by: Yihan Wang <yihwang@nvidia.com>
@yihwang-nv yihwang-nv requested a review from a team as a code owner April 15, 2026 04:25
@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --stage-list "Build-Docker-Images"

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai bot commented Apr 15, 2026

📝 Walkthrough

Walkthrough

This change updates two CUDA-related package dependencies across three configuration files: flashinfer-python from 0.6.6 to 0.6.8rc1 and nvidia-cutlass-dsl to 4.4.2 across the project.

Changes

Cohort / File(s) Summary
Dependency Version Updates
ATTRIBUTIONS-Python.md, requirements.txt, security_scanning/pyproject.toml
Updated flashinfer-python to 0.6.8rc1 and nvidia-cutlass-dsl to 4.4.2 across attribution records, main requirements, and security scanning configuration.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

🚥 Pre-merge checks | ✅ 3
✅ Passed checks (3 passed)
Check name Status Explanation
Title check ✅ Passed The title clearly and specifically summarizes the main change: updating flashinfer-python from 0.6.6 to 0.6.8rc1, which aligns with the primary focus of the changeset.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Description check ✅ Passed The PR description clearly explains the changes (version bumps), rationale (compatibility requirements), and provides comprehensive test coverage details.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@requirements.txt`:
- Line 57: requirements.txt currently pins flashinfer-python==0.6.8rc1 which is
not on PyPI; either revert the pin to a published version (e.g., 0.6.7.post3) or
add the FlashInfer nightly wheel index to your install config (e.g., add
--extra-index-url https://flashinfer.ai/whl/nightly/ in CI and developer docs)
and update CI/pip config accordingly; also run compatibility tests and review
tensorrt_llm/_torch/attention_backend/flashinfer.py plus any callers for API
changes between 0.6.6/0.6.7 and the nightly to ensure no breaking changes before
keeping 0.6.8rc1.

In `@security_scanning/pyproject.toml`:
- Line 58: Replace the non-existent dependency "flashinfer-python (==0.6.8rc1)"
in pyproject.toml with an available release (e.g., "flashinfer-python
(==0.6.7.post3)"), then regenerate any lockfiles (poetry lock / pip-compile) and
verify installation to ensure the package resolves correctly.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: 8c184ec0-702c-44f0-a56a-0484b3a8d283

📥 Commits

Reviewing files that changed from the base of the PR and between d09ed1e and 43dd843.

⛔ Files ignored due to path filters (1)
  • security_scanning/poetry.lock is excluded by !**/*.lock
📒 Files selected for processing (3)
  • ATTRIBUTIONS-Python.md
  • requirements.txt
  • security_scanning/pyproject.toml

Comment thread requirements.txt Outdated
Comment thread security_scanning/pyproject.toml Outdated
@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #43382 [ run ] triggered by Bot. Commit: 43dd843 Link to invocation

Force-reinstall nvidia-cutlass-dsl and nvidia-cutlass-dsl-libs-base in
the Docker build to replace the stale 4.3.5 from the base image with
4.4.2. Add nvidia-cutlass-dsl>=4.4.2 to constraints.txt.

Signed-off-by: Yihan Wang <yihwang@nvidia.com>
@yihwang-nv yihwang-nv requested review from a team as code owners April 15, 2026 04:33
@yihwang-nv yihwang-nv requested review from niukuo and venkywonka April 15, 2026 04:33
@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --stage-list "Build-Docker-Images"

Force-reinstall nvidia-cutlass-dsl and nvidia-cutlass-dsl-libs-base in
the Docker build to replace the stale 4.3.5 from the base image with
4.4.2. Add nvidia-cutlass-dsl>=4.4.2 to constraints.txt.

Signed-off-by: Yihan Wang <yihwang@nvidia.com>
@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --stage-list "Build-Docker-Images"

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #43382 [ run ] completed with state ABORTED. Commit: 43dd843

Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #43386 [ run ] triggered by Bot. Commit: f3b96ee Link to invocation

Point current_image_tags.properties to the CI tritondevel images
built from PR NVIDIA#13064 (flashinfer + nvidia-cutlass-dsl upgrade).

Signed-off-by: Yihan Wang <yihwang@nvidia.com>
@yihwang-nv yihwang-nv requested a review from a team as a code owner April 15, 2026 06:15
@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #43410 [ run ] triggered by Bot. Commit: 3ba87d1 Link to invocation

The DLFW base image (pytorch:26.02-py3) ships nvidia-cutlass-dsl 4.3.5.
When pip upgrades to 4.4.2 in-place, it corrupts shared namespace dirs.
Add explicit uninstall + rm -rf cleanup before tensorrt_llm wheel install.

Signed-off-by: Yihan Wang <yihwang@nvidia.com>
@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #43459 [ run ] triggered by Bot. Commit: fcfbc7e Link to invocation

This reverts commit 0dd321a59fc067e5fd3124f1fb5c6b8aba1d7ad3.
pip installs dependency packages (nvidia-cutlass-dsl-libs-base) before
uninstalling the old meta-wheel (nvidia-cutlass-dsl).  Since both write
to the same nvidia_cutlass_dsl/ directory, the uninstall step removes
files that the deps just installed, breaking the package.

Add scripts/clean_site_packages.py that uninstalls known problematic
packages and removes leftover site-packages fragments before install.
Call it from test_pip_install.py before both wheel and editable installs.

This avoids Docker image changes — the cleanup runs at CI test time.

Signed-off-by: Yihan Wang <yihwang@nvidia.com>
@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #43474 [ run ] triggered by Bot. Commit: 4ae0955 Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #43459 [ run ] completed with state ABORTED. Commit: fcfbc7e

Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #43474 [ run ] completed with state SUCCESS. Commit: 4ae0955
/LLM/main/L0_MergeRequest_PR pipeline #33993 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

…lass-dsl 4.4.2

PipelineAsync.producer_tail is decorated with @dsl_user_op in
cutlass-dsl 4.4.2 and forwards loc/ip kwargs to producer_acquire. The
overrides in custom_pipeline.py did not accept these, raising
DSLRuntimeError in test_fp4_linear_cute_dsl. Add loc=None, ip=None
keyword-only parameters to producer_acquire, producer_commit,
consumer_release, and producer_tail across PipelineTmaUmma,
PipelineUmmaAsync, and PipelineCpAsyncUmma, and thread them through to
the inner sync_object and cute.arch calls.

Signed-off-by: Yihan Wang <yihwang@nvidia.com>
Signed-off-by: Yihan Wang <yihwang@nvidia.com>
@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44129 [ run ] triggered by Bot. Commit: 82b22a8 Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44129 [ run ] completed with state SUCCESS. Commit: 82b22a8
/LLM/main/L0_MergeRequest_PR pipeline #34556 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44179 [ run ] triggered by Bot. Commit: 82b22a8 Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44179 [ run ] completed with state FAILURE. Commit: 82b22a8
/LLM/main/L0_MergeRequest_PR pipeline #34606 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44259 [ run ] triggered by Bot. Commit: 82b22a8 Link to invocation

@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --stage-list "Build-Docker-Images"

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44259 [ run ] completed with state FAILURE. Commit: 82b22a8
/LLM/main/L0_MergeRequest_PR pipeline #34680 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44264 [ run ] triggered by Bot. Commit: 82b22a8 Link to invocation

@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44302 [ run ] triggered by Bot. Commit: 31da856 Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44302 [ run ] completed with state FAILURE. Commit: 31da856
/LLM/main/L0_MergeRequest_PR pipeline #34724 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44335 [ run ] triggered by Bot. Commit: 31da856 Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44335 [ run ] completed with state FAILURE. Commit: 31da856
/LLM/main/L0_MergeRequest_PR pipeline #34753 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44367 [ run ] triggered by Bot. Commit: 31da856 Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44367 [ run ] completed with state FAILURE. Commit: 31da856
/LLM/main/L0_MergeRequest_PR pipeline #34784 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

Wanli-Jiang pushed a commit to Wanli-Jiang/TensorRT-LLM that referenced this pull request Apr 20, 2026
Point current_image_tags.properties to the CI tritondevel images
built from PR NVIDIA#13064 (flashinfer + nvidia-cutlass-dsl upgrade).

Signed-off-by: Yihan Wang <yihwang@nvidia.com>
@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44387 [ run ] triggered by Bot. Commit: 31da856 Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #44387 [ run ] completed with state SUCCESS. Commit: 31da856
/LLM/main/L0_MergeRequest_PR pipeline #34802 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

Link to invocation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants