[None][test] Add TestServePrefixAwareScheduling base on LMBenchmark/synthetic-multi-round-qa by SimengLiu-nv · Pull Request #13182 · NVIDIA/TensorRT-LLM

SimengLiu-nv · 2026-04-19T03:24:46Z

This is the very first multi-round tests with meaningful prefix reuse. Aim to catch functional bugs with all config combinations of the scheduler.

Caught SWA+prefix-aware bugs during the test developement. Proposed fix in cpp/tensorrt_llm/batch_manager/kvCacheManager.cpp.

Tests with kv-cache offloading is not added to the test-db as they are known bugs. Will enable them once the fix is merged.

@coderabbitai summary

Description

Test Coverage

PR Checklist

Please review the following before submitting your PR:

PR description clearly explains what and why. If using CodeRabbit's summary, please make sure it makes sense.
PR Follows TRT-LLM CODING GUIDELINES to the best of your knowledge.
Test cases are provided for new code paths (see test instructions)
Any new dependencies have been scanned for license and vulnerabilities
CODEOWNERS updated if ownership changes
Documentation updated as needed
Update tava architecture diagram if there is a significant design change in PR.
The reviewers assigned automatically/manually are appropriate for the PR.
Please check this after reviewing the above items as appropriate for this PR.

GitHub Bot Help

To see a list of available CI bot commands, please comment /bot help.

…ynthetic-multi-round-qa This is the very first multi-round tests with meaningful prefix reuse. Aim to catch functional bugs with all config combinations of the scheduler. Caught SWA+prefix-aware bugs during the test developement. Proposed fix in cpp/tensorrt_llm/batch_manager/kvCacheManager.cpp. Tests with kv-cache offloading is not added to the test-db as they are known bugs. Will enable them once the fix is merged. Signed-off-by: Simeng Liu <simengl@nvidia.com>

SimengLiu-nv requested review from eopXD, lancelly, liji-nv and thorjohnsen April 19, 2026 03:24

SimengLiu-nv requested a review from a team as a code owner April 19, 2026 03:24

github-actions bot assigned SimengLiu-nv Apr 19, 2026

SimengLiu-nv changed the title ~~[NONE][test] Add TestServePrefixAwareScheduling base on LMBenchmark/synthetic-multi-round-qa~~ [None][test] Add TestServePrefixAwareScheduling base on LMBenchmark/synthetic-multi-round-qa Apr 19, 2026

SimengLiu-nv merged commit a7ccae8 into NVIDIA:feat/bench_y Apr 20, 2026
6 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[None][test] Add TestServePrefixAwareScheduling base on LMBenchmark/synthetic-multi-round-qa#13182

[None][test] Add TestServePrefixAwareScheduling base on LMBenchmark/synthetic-multi-round-qa#13182
SimengLiu-nv merged 1 commit intoNVIDIA:feat/bench_yfrom
SimengLiu-nv:prefix-aware-tests

SimengLiu-nv commented Apr 19, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

SimengLiu-nv commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Test Coverage

PR Checklist

GitHub Bot Help

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

SimengLiu-nv commented Apr 19, 2026 •

edited

Loading