Parallel Completions
Overview
Llama-vscode generates parallel code completions (default 3) if a version of llama.cpp after December, 6, 2025 (commit c42712b) is used. The next completion is shown by pressing Ctrl+], previous completion is shown by pressing Ctrl+[.
The setting max_parallel_completions determines how many completions are generated.
How to use it
- Run the completion model and start coding
- When a code completion is shown, press Ctrl+] to show the next completion, Ctrl+[ to show the previous completion
- Alternatively - you could hover over the shown completion and when the toolbar is shown click the arrows to show the other completions.
Settings:
- max_parallel_completions: The max number of parallel completions to generate. Default is 3.
Screencast from 2026-01-05 15-05-00.webm