Commit graph

79 commits

Author SHA1 Message Date
igardev
a544d93511
API key for getting models list, ultiline field for Edit with AI, qwen3.5 models added
- API key is used (if needed and provided) on getting the list of models for adding OpenAI compatible provider
- Multiline field for Edit with AI
- Qwen3.5 models (2B, 4B, 9B) added in the predefined list  - good for tools and chat
2026-05-04 08:42:01 +03:00
Alexey Mekhanoshin
a73d9498ab
feat: add authorization headers to models fetch request (#180)
Adds support for the Authorization header when fetching the list of models from an OpenAI-compatible provider.
2026-04-30 07:59:35 +03:00
Copilot
f98919badf
Add llama.vscode model provider for GitHub Copilot Chat (#171)
With this change llama.vscode could provide models for VS Code Copilot:
1. Start tools model from llama-vscode (local or external)  
2. In VS Code Copilot show the models list -> Other Models -> Manage Models  
3. Make the models (all models available by the application serving the tools model are shown) you want to use visible (click on the left of the model name)  
4. Select the desired model from Copilot and start using it

Not needed tools from Copilot could be unchecked to reduce contex size if local model is used.
2026-04-29 21:04:33 +03:00
Haafiz
0a588177b7
feat: add configurable debounce for inline completion requests (#164)
Waits for the user to pause typing before sending a request to the server.
Set to 0 (default) to disable.
2026-03-04 08:18:42 +02:00
Haafiz
caa0f9363d
fix: show 'extension is updated' notification only on actual version change (#167)
Add version tracking and update persistence keys
2026-03-04 08:06:32 +02:00
igardev
bf7d0c2892
Subagents (#169)
* Read SOUL.md and USER.md files from project root and add them in the prompt if they exist (similar to OpenClaw).

* - Subagents implemented
- new agent Unit Test Writer
- new tool create_agent
- new agent "Agent creator"

* Update documentation for llama-vscode
2026-03-02 23:52:38 +02:00
crsawyer
3cef9a4947
Add AGENTS.md loading from workspace root (#158)
- Only if `llama-vscode-rules.md` is not present to preserve existing functionality
- TODO: Implement nested AGENTS.md
- See: https://agents.md/

Co-authored-by: Caleb Sawyer <caleb.sawyer@gsdeng.com>
2026-02-18 06:40:16 +02:00
Georgi Gerganov
0182f4fa00
Use fixed slot id for FIM requests (#155) 2026-02-08 23:31:13 +02:00
igardev
dbdc8f523d
Add health check for models (works only with llama.cpp server) (#156)
- Add health check for models (works only with llama.cpp server)
- Visible in environment view
New settings:
- Health_check_interval_s: The interval in seconds for the health check
- Health_check_compl_enabled: Enable/disable health check for completion model
- Health_check_chat_enabled: Enable/disable health check for chat model
- Health_check_embs_enabled: Enable/disable health check for embedding model
- Health_check_tools_enabled: Enable/disable health check for tools model
2026-01-18 22:02:40 +02:00
igardev
a035f96de4
Fix code completion in case max_parallel_completions = 1 or empty. (#150) 2026-01-07 18:44:35 +02:00
igardev
8e42441edc
Generate multiple completions in parallel (#148)
- Setting max_parallel_completions determines how many completions to generate in parallel (default 3)
- Shortcuts - Alt+] - next completion, Alt+[ - previous completion
- Requires llama.cpp after December, 6, 2025 (commit c42712b) but is backword compatible (generates one completion for older versions)
2026-01-05 15:43:33 +02:00
igardev
1d5f9387ab
Enable skills usage (#147)
- Skills (https://agentskills.io/home) could be now parsed by the LLM and added in the prompt
- skills_folder setting determines where are skills descriptions. If empty the <project_folder>/skills folder is used by default
- Anthropic models support skills best. I guess, the open source models will catch up.
2025-12-31 17:36:24 +02:00
igardev
ed3270d08a
Agent todo tool (#146)
- update_todo_list  tool added - now the agent could plan and track the execution of user request by todo items
2025-12-30 19:35:57 +02:00
igardev
e5a56f269d
Image selection for agent context (#140)
* Image selection is now possible in Agent view

* Add qwen3 VL 30B Instruct as a predefined model
2025-11-18 17:37:01 +02:00
igardev
c03e32623e
Adding a default model for agent is not possible (#134)
Now the agent could have a default tools model. If there is such a model, on selecting the agent, the (tools) agent model will also be selected. However, it is still possible to change the tools model, while using the same agent.
2025-11-03 00:12:48 +02:00
Nuno
723432b54a
fix: use correct function for last used env on startup (#133)
Signed-off-by: rare-magma <rare-magma@posteo.eu>
2025-11-02 23:49:21 +02:00
igardev
7a57e41c05 test 2025-10-31 18:22:32 +02:00
igardev
73eaa326e6
- Show error in agent view UI in case of error on editing file (#131)
- Show error in agent view UI in case of error on editing file
- Other bug fixes
2025-10-27 02:47:50 +02:00
stoperro
5e565da06b
Working multi-file edit PoC. (#130) 2025-10-25 21:34:08 +03:00
igardev
b91585925d
Edit Agent View (#128)
- Edit Agent view added
- Add model from OpenAI compatible provider
2025-10-14 14:48:28 +03:00
igardev
889c5f6d57
Remove the todo from the agent view (#127)
Co-authored-by: igardev <ivailo.gardev@akros.ch>
2025-10-08 17:44:06 +03:00
igardev
43416b7d3c
Bug fixes (#126)
- Fix the embeddings ranking - the wrong query was used
- Removed the free DeepSeek model from open router as not working now
- Setting endpoint_tools is now enought to start agent
- Agent View is now not hidden if the tools model is deselected
- Refactoring
2025-10-08 16:59:23 +03:00
igardev
13bf27699e
Menu refactor and Ask agent added (#124)
* menu.ts is refactored - services classes are extracted

* - Refactor menu.ts model - extract services
- agent "Ask" added for questions about the project without changing the files
- predefiled free models from OpenRouter added (and xAi removed as not free anymore)
- Some bugs fixed

---------

Co-authored-by: igardev <ivailo.gardev@akros.ch>
2025-10-05 15:44:04 +03:00
igardev
fa9aa34755
Refactor menu.ts and bug fixes
- Fix bug for adding files by the agent
- Extract agent service from menu
- Adding agent from menu is now possible (not very user friendly, but working)
2025-10-03 00:50:10 +03:00
igardev
8b0943ec8f
Bug fixes
- launch commands fixed (some chars were escaped)
- Agent chat is visualized on opening chat
2025-10-02 14:23:14 +03:00
igardev
fd964e708e
- Menu.ts refactored (#119)
- Bug with editing files by agent fixed
- Menu.ts refactored
2025-10-01 10:04:01 +03:00
igardev
c1edca02c4
Predefined lists and refactoring (#118)
- File menu.ts refactored
- Predefined lists added for completion models, chat models, embeddings models, tools models and for envs
- Bugfixes
- If chat model is not selected, but a tools model is selected, it is used for generating commit messages, editing code with AI and in search_source tool
2025-09-30 19:32:06 +03:00
igardev
1a38de5c3d
Documentation updated (#115)
- xAI Grog4 free (from OpenRouter) added to the initial models
- Chat with AI with project context removed (agent does it better)
- Chat with AI about llama-vscode is now with agent, not using webui
- Agent - new buttons "Tools Model" and "Agent" -  possibility to view the selected model and agent and to change them.
2025-09-21 15:32:57 +03:00
igardev
dff8c28e32
Resolve the problems with the new webui (#114)
- Chat with AI with project context removed (agent does it better)
- Chat with AI about llama-vscode is now with agent, not using webui
- Agent - new buttons "Tools Model" and "Agent" -  possibility to view the selected model and agent and to change them.
- xAI Grog4 free (from OpenRouter) added to the initial models
2025-09-21 12:36:06 +03:00
igardev
042d04df1b
Agent Rules and Commands (#112)
- Added rules - setting agent_rules or llama-vscode-rules.md
- Added agent commands - setting agent_commands/llama-vscode menu "agent commands...". (shortcut for often used prompts, in agent - press "/" and select agent command).
- Generate commit message now checks if there is a running chat model (or endpoint_chat)
2025-09-18 13:42:17 +03:00
igardev
cabf670882
Agent UI with stream (#111)
- In Agent UI the requests the tokens are shown immediately, no when the complete response is received
- Bug fixes for Edit with AI
- tools_custom and context_custom settings are added
- -fa option is removed from huggingface download command
- Add model menu command is replaced with two Add local model and Add external model
2025-09-15 11:49:46 +03:00
igardev
598fedaa3d
Add settings to disable the popups on startup for installing/upgrading llama.cpp (#109)
Setting ask_install_llamacpp added to control if llama-vscode should ask the user to install llama.cpp
Setting upgrade_llamacpp_hours added to control how often llama-vscode should ask the user to upgrade llama.cpp
If the user cancels the llama.cpp installation on startup - llama-vscode suggests to disable the future popups for installation
If the user cancels the llama.cpp upgrade on startup - llama-vscode suggests to disable the future popups for upgrade (sets upgrade_llamacpp_hours to more than 8 years)
2025-09-09 15:08:07 +03:00
igardev
bc39b95828
Environment UI (#106)
Environment UI added
-fa option removed from the llama-server calls
Env extended with agent and some settings (not only models anymore)
2025-09-05 00:59:21 +03:00
igardev
8af7f8ed6a
Fix the problem with endless cycle on refusing to install llama.cpp on startup
- Fix cycle on refusing to install llama.cpp
- Other UX changes
2025-08-28 14:25:08 +03:00
igardev
c26950cb2e
Chats, separate view (#100)
- Changes history added
- Chats could be selected, deleted, exported, imported
- llama-vscode UI (agent) is shown in a separate view now, not as part of Explorer view.
2025-08-27 12:48:37 +03:00
igardev
838311659a
fix for starting completion model twice (#98) 2025-08-18 23:38:53 +03:00
igardev
08c0a22a73
Agent entity added (#97)
- Agent entity added - agents with different system prompts and default tools could be selected
- Fixed showing tables in llama agent
- Local envs with gpt-oss 20B added (also available for import from here )
2025-08-18 10:48:55 +03:00
igardev
6e46e719af
Fixed bug in case no env selected
- Fixed a bug in case no env selected
- Docu updated
- Setting env_start_last_used added to start automatically the last used env on startup
2025-08-16 12:38:32 +03:00
igardev
6d5c3f0c79
Remove envs with gpt-oss agent local use (#94)
* Increase the space for llama agent, 

* fix a bug for showing llama-agent.

* Update the documentation for llama-vscode

* Envs with local gpt-oss for agent removed
2025-08-15 12:53:34 +03:00
igardev
3ed9ef7585
Improve llama agent (#93)
Llama Agent UI improved
Chat with AI about llama.vscode added
Markdown is used in llama agent for better visualization
Orchestra is renamed to env
2025-08-14 17:11:56 +03:00
igardev
d55a8c5c0a
Remove typescript module from the source code. (#91) 2025-08-11 18:32:37 +03:00
igardev
8e0531f906
Add models from huggingface (#89)
* Adding a model from huggingface implemented

* Tool llama_vscode_help is added
2025-08-11 17:40:11 +03:00
igardev
61d1de7a07
Improvements for Llama Agent, introduction of Orchestra concept. (#84)
Llama Agent UI improved - look and feel, statuc, etc.
New menus for managing completion models, chat models, embeddings models and tools models
Concept of selected models - for completion, chat, embeddings and tools
Orchestra concept introduced. Orchestra is a group of models. Starting(selecting)/stopping orchestra starts(selects)/stops all the models
Import/Export orchestra and models from/to file implemented
OpenAI gpt-oss 20B added as a local one in tools models and chat models
Predefined Orchestras for different use cases - only completion, chat + completion, chat + agent, etc.
2025-08-07 23:04:56 +03:00
igardev
e49bb9f128
Agent support added (#79)
- Llama Agent UI in Explorer view
- OpenRouter API model selection (assumes your OpenRauter key is in setting Api_key_tools)
- MCP Support
- 9 internal tools available for use
- custom_tool - returns the content of a file or a web page
- custom_eval_tool - write your own tool in Typescript/javascript
- Attach the selection to the context
- Configure maximum loops for Llama Agent
2025-08-01 15:59:16 +03:00
igardev
019f8a52b5
feat: improve completion function by handling tabs as well as spaces (#73)
- Enhanced the `removeLeadingSpaces` function to account for tabs in addition to spaces

Co-authored-by: igardev <ivailo.gardev@akros.ch>
2025-05-27 11:35:03 +03:00
igardev
b559e3d08f
feat: added RAG enable/disable setting and menu item (#70)
Co-authored-by: igardev <ivailo.gardev@akros.ch>
2025-05-20 14:48:36 +03:00
igardev
4e4a5ccb6a
Fixes and improvements (#67)
* feat: enhance text editor functionality
- Added methods to expand selection to full lines
- Implemented functions to remove leading spaces from text
- Added functions to add leading spaces to text

* fix: don't send requests for updating the context if the completions are disabled

* refactor: remove unused code and optimize performance
- Removed duplicate code and optimized performance in `architect.ts` and `text-editor.ts`

* feat: add RAG configuration option `rag_max_files` to limit the number of files indexed for RAG search
2025-05-20 06:33:08 +03:00
Georgi Gerganov
80808eb276
release : v0.0.11 2025-05-16 14:48:57 +03:00
igardev
6ab5113983
feat: reuse received embeddings (#64)
* fix: update cosine similarity logic

- Updated cosine similarity function to use chunk.embedding instead of getting embedding again
- Fixed edge case where chunk.embedding is empty

* feat: update menu items
- Added "Start all models" item with description
2025-05-15 18:06:14 +03:00
igardev
8be731ad7e
Add api key for chat server and embeddings server (#62)
* feat: update chat edit text prompt

- Improve formatting for instructions and original text
- Remove redundant chunks section
- Navigate to the first difference after opening diff panel

* feat: update configuration options for llama.cpp server API keys

- Added `llama-vscode.api_key_chat` and `llama-vscode.api_key_embeddings` configuration options
- Updated `llama-vscode.api_key` to use new key names
- Edit with AI - don't send chunks, navigate to the first change in the diff panel

* bug: update API key configuration
- Updated API key configuration for chat and embeddings endpoints

---------

Co-authored-by: igardev <ivailo.gardev@akros.ch>
2025-05-11 11:15:00 +03:00