common : only load backends when required by angt · Pull Request #22290 · ggml-org/llama.cpp

angt · 2026-04-23T13:10:43Z

Overview

Only load backends when required.

Additional information

This fixe the following issues: #20186, #21708 and maybe others.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: YES (kimi): I've asked the list of functions to modify for a minimal patch

ggerganov · 2026-04-28T14:51:40Z

The test-thread-safety is failing

ggerganov · 2026-04-28T15:32:59Z

Hm, I don't see where we would call ggml_backend_load_all() for llama-cli, llama-server, etc. Building with -DGGML_BACKEND_DL=ON makes these binaries not load any backends.

angt · 2026-04-28T15:55:02Z

Hm, I don't see where we would call ggml_backend_load_all() for llama-cli, llama-server, etc. Building with -DGGML_BACKEND_DL=ON makes these binaries not load any backends.

Damn, checking. I don’t use AI often (for llama.cpp!). Kimi is this bad ? 🤣

angt · 2026-04-28T16:27:28Z

The last call was missing...
Edit: So Kimi is fine, it was my fault 🙄

angt · 2026-04-30T07:32:13Z

@ggerganov maybe I can take this PR as an opportunity to remove ggml_backend_load_all from common_params_parser_init and call it explicitly where it’s needed, instead of hiding it there?

ggerganov · 2026-04-30T07:55:41Z

@ggerganov maybe I can take this PR as an opportunity to remove ggml_backend_load_all from common_params_parser_init and call it explicitly where it’s needed, instead of hiding it there?

Yes, let's do that.

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

angt · 2026-04-30T12:29:07Z

I finally have updated llama_backend_init() to call ggml_backend_load_all(). The name was a perfect fit, so I decided to try this, but I’m not yet 100% convinced too, and maybe we should just call ggml_backend_load_all() everywhere...

rgerganov · 2026-05-06T07:47:56Z

This patch broke RPC (issue #22721) and introduced obscure regressions (issue #22748). Having multiple calls to ggml_backend_load_all() is not how loading dynamic backends is supposed to work and breaks many implicit assumptions.

I suggest to revert this patch.

* common : only load backends when required Signed-off-by: Adrien Gallouët <angt@huggingface.co> * llama : call ggml_backend_load_all() directly from llama_backend_init() Signed-off-by: Adrien Gallouët <angt@huggingface.co> * Add ggml_backend_load_all() where llama_backend_init() is not used Signed-off-by: Adrien Gallouët <angt@huggingface.co> --------- Signed-off-by: Adrien Gallouët <angt@huggingface.co>

angt · 2026-05-06T09:26:34Z

I think it’s just because of some missing calls it’s hard to track all of them

angt · 2026-05-06T10:04:23Z

#22752 should fix both issues (and others)

* common : only load backends when required Signed-off-by: Adrien Gallouët <angt@huggingface.co> * llama : call ggml_backend_load_all() directly from llama_backend_init() Signed-off-by: Adrien Gallouët <angt@huggingface.co> * Add ggml_backend_load_all() where llama_backend_init() is not used Signed-off-by: Adrien Gallouët <angt@huggingface.co> --------- Signed-off-by: Adrien Gallouët <angt@huggingface.co>

angt requested a review from a team as a code owner April 23, 2026 13:10

danbev reviewed Apr 23, 2026

View reviewed changes

Comment thread ggml/src/ggml-backend-reg.cpp Outdated

Comment thread common/arg.cpp

github-actions Bot added the ggml changes relating to the ggml tensor library for machine learning label Apr 23, 2026

angt mentioned this pull request Apr 23, 2026

ggml : skip already registered backends and devices #22296

Merged

angt force-pushed the common-only-load-backends-when-required branch from 1952e4b to f2d4b06 Compare April 28, 2026 14:08

angt requested a review from ggerganov as a code owner April 28, 2026 15:09

github-actions Bot added the testing Everything test related label Apr 28, 2026

angt force-pushed the common-only-load-backends-when-required branch from 9027fd9 to 37ac5d5 Compare April 28, 2026 16:26

angt added 3 commits April 30, 2026 12:11

common : only load backends when required

01d11a5

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

llama : call ggml_backend_load_all() directly from llama_backend_init()

4c25b63

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

Add ggml_backend_load_all() where llama_backend_init() is not used

b967482

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

angt force-pushed the common-only-load-backends-when-required branch from 37ac5d5 to b967482 Compare April 30, 2026 12:25

angt requested a review from a team as a code owner April 30, 2026 12:25

github-actions Bot added the examples label Apr 30, 2026

ggerganov approved these changes May 4, 2026

View reviewed changes

ggerganov added the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label May 4, 2026

pwilkin approved these changes May 4, 2026

View reviewed changes

danbev approved these changes May 5, 2026

View reviewed changes

angt merged commit bf76ac7 into ggml-org:master May 5, 2026
44 of 46 checks passed

Stoney49th mentioned this pull request May 5, 2026

Eval bug: RPC Backend no longer found #22721

Closed

This was referenced May 6, 2026

common : call ggml_backend_load_all before llama_supports_rpc #22751

Closed

llama : add missing call to ggml_backend_load_all() #22752

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

common : only load backends when required#22290

common : only load backends when required#22290
angt merged 3 commits into
ggml-org:masterfrom
angt:common-only-load-backends-when-required

angt commented Apr 23, 2026

Uh oh!

Uh oh!

Uh oh!

ggerganov commented Apr 28, 2026

Uh oh!

ggerganov commented Apr 28, 2026

Uh oh!

angt commented Apr 28, 2026 •

edited

Loading

Uh oh!

angt commented Apr 28, 2026 •

edited

Loading

Uh oh!

angt commented Apr 30, 2026

Uh oh!

ggerganov commented Apr 30, 2026

Uh oh!

angt commented Apr 30, 2026

Uh oh!

Uh oh!

rgerganov commented May 6, 2026 •

edited

Loading

Uh oh!

angt commented May 6, 2026

Uh oh!

angt commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

angt commented Apr 23, 2026

Overview

Additional information

Requirements

Uh oh!

Uh oh!

Uh oh!

ggerganov commented Apr 28, 2026

Uh oh!

ggerganov commented Apr 28, 2026

Uh oh!

angt commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

angt commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

angt commented Apr 30, 2026

Uh oh!

ggerganov commented Apr 30, 2026

Uh oh!

angt commented Apr 30, 2026

Uh oh!

Uh oh!

rgerganov commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

angt commented May 6, 2026

Uh oh!

angt commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

angt commented Apr 28, 2026 •

edited

Loading

angt commented Apr 28, 2026 •

edited

Loading

rgerganov commented May 6, 2026 •

edited

Loading