ollama-autocoder

ollama-autocoder

A simple to use Ollama autocompletion engine with options exposed and streaming functionality

Stars: 92

Visit
 screenshot

Ollama Autocoder is a simple to use autocompletion engine that integrates with Ollama AI. It provides options for streaming functionality and requires specific settings for optimal performance. Users can easily generate text completions by pressing a key or using a command pallete. The tool is designed to work with Ollama API and a specified model, offering real-time generation of text suggestions.

README:

Ollama Autocoder

A simple to use Ollama autocompletion engine with options exposed and streaming functionality

example

Requirements

  • Ollama must be serving on the API endpoint applied in settings
    • For installation of Ollama, visit ollama.ai
  • Ollama must have the model applied in settings installed. The current default is qwen2.5-coder:latest.
  • The prompt window size should align with the maximum context window of the model.

How to Use

  1. In a text document, press space (or any character in the completion keys setting). The option Autocomplete with Ollama or a preview of the first line of autocompletion will appear. Press enter to start generation.
    • Alternatively, you can run the Autocomplete with Ollama command from the command pallete (or set a keybind).
  2. After startup, the tokens will be streamed to your cursor.
  3. To stop the generation early, press the "Cancel" button on the "Ollama Autocoder" notification or type something.
  4. Once generation stops, the notification will disappear.

Notes

  • For fastest results, an Nvidia GPU or Apple Silicon is recommended. CPU still works on small models.
  • The prompt only sees behind the cursor. The model is unaware of text in front of its position.
  • For CPU-only, low end, or battery powered devices, it is highly recommended to disable the response preview option, as it automatically triggers the model. This will cause continue inline to be always on. You can also increase the preview delay time.
  • If you don't want inline generation to continue beyond the response preview, change the continue inline option in settings to false. This doesn't apply to the command pallete.

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for ollama-autocoder

Similar Open Source Tools

For similar tasks

For similar jobs