Built-in llama-cpp provider via inline ExtensionFactory by julien-c · Pull Request #4823 · earendil-works/pi

julien-c · 2026-05-20T18:02:59Z

Built-in llama-cpp provider, activated when any LLAMA_* env var is set (LLAMA_BASE_URL, LLAMA_CACHE, or LLAMA_ARG_*). Shipped as an inline ExtensionFactory so it can use the existing extension event hooks.

Discovers models from ${baseUrl}/models at startup and on /model, then refines per-model contextWindow via ${server}/props on first use.

Test plan

No LLAMA_* set: factory not instantiated, pi unchanged.
LLAMA_BASE_URL + running llama-server: provider appears, models list, streams.
Server down: surfaces a notify error, other providers still work.

Co-Authored-By: julien-agent <Agents+cyolo@huggingface.co>

Add ExtensionFactoryEntry union so inline factories can carry a path, thread "<built-in:llama-cpp>" through the resource loader, and strip the <built-in:NAME> wrapping in interactive-mode label helpers. Co-Authored-By: julien-agent <Agents+cyolo@huggingface.co>

…p list" This reverts commit 8d92537.

…veat Built-in slash commands like /model are intercepted by the interactive editor before emitInput runs, so the pi.on("input") handler never sees them. Surface failures via ctx.ui.notify when ctx is available, and leave a comment pointing at a future "model_selector_open" event as the proper trigger to refresh the model list. Co-Authored-By: julien-agent <Agents+cyolo@huggingface.co>

julien-c · 2026-05-20T18:13:15Z

Hi @badlogic does this approach of hooking a built-in provider for llama-cpp (to make local models as seamless as remote ones 🙏 ) based on a few possible env vars, look reasonable to you, or not really? (cc @hanouticelina with whom we've worked on this)

badlogic · 2026-05-21T10:22:20Z

i'll need a few days to find time to think about and review this. directionally, i think it is right. not sure about env var detection yet. could be a setting instead.

julien-c and others added 6 commits May 20, 2026 19:26

WIP

bb8d5aa

Co-Authored-By: julien-agent <Agents+cyolo@huggingface.co>

Use exported ProviderModelConfig type in llama-cpp factory

97cdc04

Co-Authored-By: julien-agent <Agents+cyolo@huggingface.co>

Skip llama-cpp factory instantiation when no LLAMA_* env is set

e4b8726

Co-Authored-By: julien-agent <Agents+cyolo@huggingface.co>

Revert "Display built-in llama-cpp extension as "llama-cpp" in startu…

0c513e9

…p list" This reverts commit 8d92537.

julien-c marked this pull request as ready for review May 20, 2026 18:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Built-in llama-cpp provider via inline ExtensionFactory#4823

Built-in llama-cpp provider via inline ExtensionFactory#4823
julien-c wants to merge 6 commits into
earendil-works:mainfrom
julien-c:builtin-llama-cpp-provider

julien-c commented May 20, 2026 •

edited

Loading

Uh oh!

julien-c commented May 20, 2026

Uh oh!

badlogic commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

julien-c commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test plan

Uh oh!

julien-c commented May 20, 2026

Uh oh!

badlogic commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

julien-c commented May 20, 2026 •

edited

Loading