extensionOS

Imagine a world where everyone can access powerful AI models—LLMs, generative image models, and speech recognition—directly in their web browser. Integrating AI into daily browsing will revolutionise online interactions, offering instant, intelligent assistance tailored to individual needs.

Stars: 73

Visit

Extension | OS is an open-source browser extension that brings AI directly to users' web browsers, allowing them to access powerful models like LLMs seamlessly. Users can create prompts, fix grammar, and access intelligent assistance without switching tabs. The extension aims to revolutionize online information interaction by integrating AI into everyday browsing experiences. It offers features like Prompt Factory for tailored prompts, seamless LLM model access, secure API key storage, and a Mixture of Agents feature. The extension was developed to empower users to unleash their creativity with custom prompts and enhance their browsing experience with intelligent assistance.

README:

Extension | OS

⭐️ Welcome to Extension | OS

Tired of the endless back-and-forth with ChatGPT, Claude, and other AI tools just to repeat the same task over and over?

You're not alone! I felt the same frustration, so I built a solution: Extension | OS—an open-source browser extension that makes AI accessible directly where you need it.

Imagine: You create a prompt like "Fix the grammar for this text," right-click, and job done—no more switching tabs, no more wasted time.

Imagine a world where every user has access to powerful models (LLMs and more) directly within their web browser. By integrating AI into everyday internet browsing, we can revolutionise the way people interact with information online, providing them with instant, intelligent assistance tailored to their needs.

Pre-release on Google Chrome Store

Join an exclusive group of up to 100 early adopters and be among the first to experience the future of AI-powered browsing!

also compatible with

📸 Screenshots

Select, right-click and select the functionality—it's that easy!

Pick your favorite provider and select the model that excites you the most.

Customize your look and feel, and unleash your creativity with your own prompts!

Mixture of Agents (pre-release) s

Help me grow this extension

Use my affiliation code when you sign-up on VAPI: https://vapi.ai/?aff=extension-os

🚀 Getting started

Clone the extension or download the latest release.
Open the Chrome browser and navigate to chrome://extensions.
Enable the developer mode by clicking the toggle switch in the top right corner of the page.
Unpack/Unzip the chrome-mv3-prod.zip
Click on the "Load unpacked" button and select the folder you just unzipped.
The options page automatically opens, insert your API keys.

✨ Features

Prompt Factory: Effortlessly Tailor Every Prompt to Your Needs with Our Standard Installation.
Prompt Factory: Choose the Functionality for Every Prompt: From Copy-Pasting to Opening a New Sidebar.
Seamless Integration: Effortlessly access any LLM model directly from your favorite website.
Secure Storage: Your API key is securely stored in the browser's local storage, ensuring it never leaves your device.
[Beta] Mixture of Agents: Experience the innovative Mixture Of Agents feature.

Why

On the morning of July 27th, 2024, I began an exciting journey by joining the SF Hackathon x Build Club. After months of refining the concept in my mind, I decided it was time to bring it to life. I worked on enhancing my idea, updating what I had already created, and empowering everyone to unleash their creativity with custom prompts.

Data - Awareness

All your data is stored locally on your hard drive.

MAC OSX

/Users/<your-username>/Library/Application Support/Google/Chrome/Default/Sync Extension Settings/

Localhost

To utilize the localhost option and perform LLM inference, you must set up a local Ollama server. You can download and install Ollama along with the CLI here.

Pull Image

Example:

ollama pull llama3.1

Start Server

Example:

OLLAMA_ORIGINS=chrome-extension://* ollama serve

Important: You need to configure the environment variable OLLAMA_ORIGINS to chrome-extension://* to permit requests from the Chrome extension. If OLLAMA_ORIGINS is not correctly configured, you will encounter an error in the Chrome extension.

Secutity the * in chrome-extension://* should be replaced with the extension id. If you have downloaded Extension | OS from chrome, please use chrome-extension://bahjnakiionbepnlbogdkojcehaeefnp

macOS

Run launchctl setenv to set OLLAMA_ORIGINS.

launchctl setenv OLLAMA_ORIGINS "chrome-extension://bahjnakiionbepnlbogdkojcehaeefnp"

Setting environment variables on Mac (Ollama)

Docker

The Ollama server can also be run in a Docker container. The container should have the OLLAMA_ORIGINS environment variable set to chrome-extension://*.

Run docker run with the -e flag to set the OLLAMA_ORIGINS environment variable:

docker run -e OLLAMA_ORIGINS="chrome-extension://bahjnakiionbepnlbogdkojcehaeefnp" -d -v ollama:/root/.ollama -p 11434:11434 --name ollama ollama/ollama

To-Do List

Move it somewhere else ASAP:

Urgent & Important

[ ] Logging: Determine a location to store log files.

Urgent, Not Important

[ ] Prompt Factory: Add the ability to create custom prompts.
[ ] Add the ability to chat within the browser.
[ ] Encryption of keys : They are stored locally, nonetheless being my first chrome extension i need to research more about how this can be accessed.
[ ] Automated Testing
[ ] Investigate if Playwright supports Chrome extension testing.
[ ] Automated Tagging / Release
[ ] Locale

Important, Not Urgent

[ ] UI for the Prompt Factory is not intuitive and the "save all" button UX is cr@p.
[ ] The sidebar API doesn't work after the storage API is called (User Interaction must be done)
[ ] Move files to a /src folder to improve organization.
[ ] Strategically organize the codebase structure.
[ ] Decide on a package manager: npm, pnpm, or yarn.

Not Urgent, Not Important

[ ] Workflow to update the models automatically.
[ ] Prompt Factory: Add the ability to build workflows.
[ ] Prompt Factory: Add the option to select which LLM to use for each prompt.
[ ] Remove all the silly comments, maybe one day....

Youtube Video From the Hackathon

Footage

Music

https://suno.com/song/f14541af-c853-4c22-b0b7-9000194fc9c6

Voices

ElevenLabs

Special Thanks

Build Club -> Hackaton Organiser
Leonardo.ai -> Icon generated with the phoenix model
Canva -> The other images not generated with AI
ShadCn -> All the UI?
Plasmo -> The Framework
Groq -> Extra credits
Icons -> icons8
https://shadcnui-expansions.typeart.cc/

Changelog

0.0.24

Adding the ability to specify a custom URL

0.0.23

Adding the uninstall hook to understand what can we improve.

0.0.22

Fixed the X,Y positioning in page like LinkedIn, Reddit and so on.
The declarativeNetRequest has been removed to enhance the release lifecycle in light of Chrome Store authorization requirements. Ollama continue to be fully supported, and detailed configuration instructions can be found in the README.

0.0.21

Chaged the introductory GIF demonstrating how to use the Extension | OS.
PromptFactory: Implemented a notification to inform users that any selected text will be automatically appended to the end of the prompt.
Settings: Using Switch vs CheckBoxes
Implemented optional (disabled by default) anonymous tracking to monitor usage patterns, including the most frequently used models and vendors.

0.0.20

SelectionMenu: Now accessible on Reddit as well! (Consider prefixing all Tailwind classes for consistency)
PromptSelector: Resolved all React warnings for a smoother experience
Verified that pre-selection functions correctly (Thanks to E2E testing)

0.0.19

Added more instruction for ollama
localhost: Add the ability to specify the model by input text (vs select box)
Fixed a useEffect bug

0.0.18

SelectionMenu: Now you can choose to enable/disable
SelectionMenu: When a key is pressed (e.g backspace for remove, or CTRL/CMD + C for copying) the menu automatically disappear

0.0.17

Development: Integrated Playwright for testing and added a suite of automated tests

0.0.16

SelectionMenu: Fixed a bug that caused the menu to vanish unexpectedly after the onMouseUp event, leading to confusion regarding item selection for users.
SelectionMenu: Adjusted the visual gap to provide more space to the user.
UI: Eliminated the conflicting success/loading state for a clearer user experience.

0.0.15

SelectionMenu: Refined the triggering mechanism for improved responsiveness.
SelectionMenu: Reduced the size for a more compact design.
SelectionMenu: Automatically refreshes items immediately after the user updates the prompts.

0.0.14

Fixed grammar issues, thanks to Luca.
Introduced a new menu, courtesy of Denis.
The new menu currently does not support phone calls (feature coming soon).

0.0.13

Enhanced UI (tooltips are now more noticeable) thanks to Juanjo (We Move Experience) and Agostina (PepperStudio)
Prompt Factory: Utilizing AutoTextArea for improved prompt display
Prompt Factory: Removed the ID to improve user experience (non-tech users)
System: Split the systemPrompt from the userPrompt.
UX: Small improvements and removed the complicated items

0.0.12 (Not released to the public)

General: Free tier exhaustion. We haven't got a sponsor (yet) to support our community users.
Google: Added identity, identity.email to enable automatic log-in using your google credentials.

0.0.11 (Not released to the public)

General: Introduced a FREE Tier for users to explore the Extension | OS without needing to understand API Keys.
Development: Implemented the CRX Public Key to maintain a consistent extension ID across re-installations during development.
Development: Integrated OAUTH for user authentication when accessing the FREE tier.
Permissions: Added identity permissions to facilitate user identity retrieval.
Showcase: Updated images for improved visual presentation.
Prompt Factory: Set Extension | OS as the default model, enabling users to utilize the extension without prior knowledge of API Key setup.

0.0.10

Context Menu: Added a new right-click option for seamless access to configuration settings.
Context Menu: Improved the layout and organization of the context menu for enhanced user experience.
Prompt Factory: Introduced a comprehensive sheet that details the context and functionality of each feature.
Prompt Factory: Implemented a clickable icon to indicate that the tooltip contains additional information when clicked.

0.0.9

Bug fixes
Clean up codebase
UX for the functionality improved

0.0.8

Removed an unnecessary dependency to comply with Chrome Store publication guidelines.
Introduced a new icon.
Implemented a loading state.
Fixed an issue where Reddit visibility was broken.

0.0.7

Adding missing models from together.ai
Adding missing models from groq
Updated About page
MoA: Add the ability to use a custom prompt.

0.0.6

Popup: UI revamped
Popup: New Presentation image and slogan
Options: Unified fonts
Options: Minor UI updates
Content: Better error handling and UX (user get redireted to the option page when the API key is missing)
Fix for together.ai (it was using a non-chat model)

0.0.5

Vapi affilation link (help me maintain this extension, sign up with the link)
Vapi Enhancements: Prompts now support selecting a specific phone number to call.
Vapi Enhancements: Prompts can now include a custom initial message for the conversation.
Vapi Enhancements: Now every prompt can be customised using the
UI: Section for specific configurations

0.0.4

Hotfix: declarativeNetRequest was intercepting every localhost request.

0.0.3

Added github branch protection.
Changed the datastructure to achieve a clearer and more abstract way to call functions
Function to clean the datastructure to adapt to chrome.contextMenus.CreateProperties
use "side_" as hack to open the sidebar. WHY: The sidebar.open doesn't work after we call the storage.get
Allowing to change the default prompts
chrome.runtime.openOptionsPage() opens only in production environment (onInstalled)
Improved UI (switched to dark theme)
Allowing to change the functionality; The "side_" bug is annoying as it is over complicating the codebase.

0.0.2

How to install and start polishing the repository

0.0.1

Check the demo video

Gotchas

Ensure that the open.sidePanel is always initialized before the Plasmo Storage.
We currently have two menus that function similarly but not identically; we need to implement a more efficient solution to consolidate them into one.
The Plasmo handler may stop functioning unexpectedly without errors if a response is not returned; ensure to always return a response to prevent this issue.

For Tasks:

Click tags to check more tools for each tasks

fix grammar create prompts access llm models store api keys enhance browsing experience

For Jobs:

web developer content writer data analyst digital marketer ai researcher

Alternative AI tools for extensionOS

Similar Open Source Tools

extensionOS

github

: 73

llm-answer-engine

This repository contains the code and instructions needed to build a sophisticated answer engine that leverages the capabilities of Groq, Mistral AI's Mixtral, Langchain.JS, Brave Search, Serper API, and OpenAI. Designed to efficiently return sources, answers, images, videos, and follow-up questions based on user queries, this project is an ideal starting point for developers interested in natural language processing and search technologies.

github

: 4.5k

obsidian-ai-assistant

Obsidian AI Assistant is a simple plugin that enables interactions with various AI models such as OpenAI ChatGPT, Anthropic Claude, OpenAI DALL·E, and OpenAI Whisper directly from Obsidian notes. The plugin offers features like text assistance, image generation, and speech-to-text functionality. Users can chat with the AI assistant, generate images for notes, and dictate notes using speech-to-text. The plugin allows customization of text models, image generation options, and language settings for speech-to-text. It requires official API keys for using OpenAI and Anthropic Claude models.

github

: 151

Ollama-Colab-Integration

Ollama Colab Integration V4 is a tool designed to enhance the interaction and management of large language models. It allows users to quantize models within their notebook environment, access a variety of models through a user-friendly interface, and manage public endpoints efficiently. The tool also provides features like LiteLLM proxy control, model insights, and customizable model file templating. Users can troubleshoot model loading issues, CPU fallback strategies, and manage VRAM and RAM effectively. Additionally, the tool offers functionalities for downloading model files from Hugging Face, model conversion with high precision, model quantization using Q and Kquants, and securely uploading converted models to Hugging Face.

github

: 93

transcriptionstream

Transcription Stream is a self-hosted diarization service that works offline, allowing users to easily transcribe and summarize audio files. It includes a web interface for file management, Ollama for complex operations on transcriptions, and Meilisearch for fast full-text search. Users can upload files via SSH or web interface, with output stored in named folders. The tool requires a NVIDIA GPU and provides various scripts for installation and running. Ports for SSH, HTTP, Ollama, and Meilisearch are specified, along with access details for SSH server and web interface. Customization options and troubleshooting tips are provided in the documentation.

github

: 74

WritingTools

Writing Tools is an Apple Intelligence-inspired application for Windows, Linux, and macOS that supercharges your writing with an AI LLM. It allows users to instantly proofread, optimize text, and summarize content from webpages, YouTube videos, documents, etc. The tool is privacy-focused, open-source, and supports multiple languages. It offers powerful features like grammar correction, content summarization, and LLM chat mode, making it a versatile writing assistant for various tasks.

github

: 1.4k

ai-driven-dev-community

AI Driven Dev Community is a repository aimed at helping developers become more efficient by utilizing AI tools in their daily coding tasks. It provides a collection of tools, prompts, snippets, and agents for developers to integrate AI into their workflow. The repository is regularly updated with new resources and focuses on best practices for using AI in development work. Users can find tools like Espanso, ChatGPT, GitHub Copilot, and VSCode recommended for enhancing their coding experience. Additionally, the repository offers guidance on customizing AI for developers, installing AI toolbox for software engineers, and contributing to the community through easy steps.

github

: 69

LocalAIVoiceChat

LocalAIVoiceChat is an experimental alpha software that enables real-time voice chat with a customizable AI personality and voice on your PC. It integrates Zephyr 7B language model with speech-to-text and text-to-speech libraries. The tool is designed for users interested in state-of-the-art voice solutions and provides an early version of a local real-time chatbot.

github

: 362

swark

Swark is a VS Code extension that automatically generates architecture diagrams from code using large language models (LLMs). It is directly integrated with GitHub Copilot, requires no authentication or API key, and supports all languages. Swark helps users learn new codebases, review AI-generated code, improve documentation, understand legacy code, spot design flaws, and gain test coverage insights. It saves output in a 'swark-output' folder with diagram and log files. Source code is only shared with GitHub Copilot for privacy. The extension settings allow customization for file reading, file extensions, exclusion patterns, and language model selection. Swark is open source under the GNU Affero General Public License v3.0.

github

: 274

Magick

Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.

github

: 675

obsidian-smart-composer

Smart Composer is an Obsidian plugin that enhances note-taking and content creation by integrating AI capabilities. It allows users to efficiently write by referencing their vault content, providing contextual chat with precise context selection, multimedia context support for website links and images, document edit suggestions, and vault search for relevant notes. The plugin also offers features like custom model selection, local model support, custom system prompts, and prompt templates. Users can set up the plugin by installing it through the Obsidian community plugins, enabling it, and configuring API keys for supported providers like OpenAI, Anthropic, and Gemini. Smart Composer aims to streamline the writing process by leveraging AI technology within the Obsidian platform.

github

: 1.1k

kollektiv

Kollektiv is a Retrieval-Augmented Generation (RAG) system designed to enable users to chat with their favorite documentation easily. It aims to provide LLMs with access to the most up-to-date knowledge, reducing inaccuracies and improving productivity. The system utilizes intelligent web crawling, advanced document processing, vector search, multi-query expansion, smart re-ranking, AI-powered responses, and dynamic system prompts. The technical stack includes Python/FastAPI for backend, Supabase, ChromaDB, and Redis for storage, OpenAI and Anthropic Claude 3.5 Sonnet for AI/ML, and Chainlit for UI. Kollektiv is licensed under a modified version of the Apache License 2.0, allowing free use for non-commercial purposes.

github

: 74

gemini-android

Gemini Android is a repository showcasing Google's Generative AI on Android using Stream Chat SDK for Compose. It demonstrates the Gemini API for Android, implements UI elements with Jetpack Compose, utilizes Android architecture components like Hilt and AppStartup, performs background tasks with Kotlin Coroutines, and integrates chat systems with Stream Chat Compose SDK for real-time event handling. The project also provides technical content, instructions on building the project, tech stack details, architecture overview, modularization strategies, and a contribution guideline. It follows Google's official architecture guidance and offers a real-world example of app architecture implementation.

github

: 303

GPThemes

GPThemes is a GitHub repository that provides a collection of customizable themes for various programming languages and text editors. It offers a wide range of color schemes and styling options to enhance the visual appearance of code editors and terminals. Users can easily browse through the available themes and apply them to their preferred development environment to personalize the coding experience. With GPThemes, developers can quickly switch between different themes to find the one that best suits their preferences and workflow, making coding more enjoyable and visually appealing.

github

: 70

Local-File-Organizer

The Local File Organizer is an AI-powered tool designed to help users organize their digital files efficiently and securely on their local device. By leveraging advanced AI models for text and visual content analysis, the tool automatically scans and categorizes files, generates relevant descriptions and filenames, and organizes them into a new directory structure. All AI processing occurs locally using the Nexa SDK, ensuring privacy and security. With support for multiple file types and customizable prompts, this tool aims to simplify file management and bring order to users' digital lives.

github

: 1.0k

LLMstudio

LLMstudio by TensorOps is a platform that offers prompt engineering tools for accessing models from providers like OpenAI, VertexAI, and Bedrock. It provides features such as Python Client Gateway, Prompt Editing UI, History Management, and Context Limit Adaptability. Users can track past runs, log costs and latency, and export history to CSV. The tool also supports automatic switching to larger-context models when needed. Coming soon features include side-by-side comparison of LLMs, automated testing, API key administration, project organization, and resilience against rate limits. LLMstudio aims to streamline prompt engineering, provide execution history tracking, and enable effortless data export, offering an evolving environment for teams to experiment with advanced language models.

github

: 311

For similar tasks

extensionOS

github

: 73

ai-commits-intellij-plugin

AI Commits is a plugin for IntelliJ-based IDEs and Android Studio that generates commit messages using git diff and OpenAI. It offers features such as generating commit messages from diff using OpenAI API, computing diff only from selected files and lines in the commit dialog, creating custom prompts for commit message generation, using predefined variables and hints to customize prompts, choosing any of the models available in OpenAI API, setting OpenAI network proxy, and setting custom OpenAI compatible API endpoint.

github

: 516

img-prompt

IMGPrompt is an AI prompt editor tailored for image and video generation tools like Stable Diffusion, Midjourney, DALL·E, FLUX, and Sora. It offers a clean interface for viewing and combining prompts with translations in multiple languages. The tool includes features like smart recommendations, translation, random color generation, prompt tagging, interactive editing, categorized tag display, character count, and localization. Users can enhance their creative workflow by simplifying prompt creation and boosting efficiency.

github

: 180

5ire

5ire is a cross-platform desktop client that integrates a local knowledge base for multilingual vectorization, supports parsing and vectorization of various document formats, offers usage analytics to track API spending, provides a prompts library for creating and organizing prompts with variable support, allows bookmarking of conversations, and enables quick keyword searches across conversations. It is licensed under the GNU General Public License version 3.

github

: 2.1k

sidecar

Sidecar is the AI brains of Aide the editor, responsible for creating prompts, interacting with LLM, and ensuring seamless integration of all functionalities. It includes 'tool_box.rs' for handling language-specific smartness, 'symbol/' for smart and independent symbols, 'llm_prompts/' for creating prompts, and 'repomap' for creating a repository map using page rank on code symbols. Users can contribute by submitting bugs, feature requests, reviewing source code changes, and participating in the development workflow.

github

: 517

labs-ai-tools-for-devs

This repository provides AI tools for developers through Docker containers, enabling agentic workflows. It allows users to create complex workflows using Dockerized tools and Markdown, leveraging various LLM models. The core features include Dockerized tools, conversation loops, multi-model agents, project-first design, and trackable prompts stored in a git repo.

github

: 174

Prompt_Engineering

Prompt Engineering Techniques is a comprehensive repository for learning, building, and sharing prompt engineering techniques, from basic concepts to advanced strategies for leveraging large language models. It provides step-by-step tutorials, practical implementations, and a platform for showcasing innovative prompt engineering techniques. The repository covers fundamental concepts, core techniques, advanced strategies, optimization and refinement, specialized applications, and advanced applications in prompt engineering.

github

: 3.0k

mark

Mark is a CLI tool that allows users to interact with large language models (LLMs) using Markdown format. It enables users to seamlessly integrate GPT responses into Markdown files, supports image recognition, scraping of local and remote links, and image generation. Mark focuses on using Markdown as both a prompt and response medium for LLMs, offering a unique and flexible way to interact with language models for various use cases in development and documentation processes.

github

: 55

For similar jobs

weave

Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

github

: 855

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

VisionCraft

The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

github

: 94

kaito

Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

github

: 405

PyRIT

PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

github

: 2.3k

tabby

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

github

: 30.6k

spear

SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

github

: 224

Magick

github

: 675