open-computer-use

open-computer-use

Secure AI computer use powered by E2B Desktop Sandbox

Stars: 540

Visit
 screenshot

Open Computer Use is a secure cloud Linux computer powered by E2B Desktop Sandbox and controlled by open-source LLMs. It allows users to operate the computer via keyboard, mouse, and shell commands, live stream the display of the sandbox on the client computer, and pause or prompt the agent at any time. The tool is designed to work with any operating system and supports integration with various LLMs and providers following the OpenAI API specification.

README:

Open Computer Use

A secure cloud Linux computer powered by E2B Desktop Sandbox and controlled by open-source LLMs.

https://github.com/user-attachments/assets/3837c4f6-45cb-43f2-9d51-a45f742424d4

Features

  • Uses E2B for secure Desktop Sandbox
  • Supports Meta Llama, OS-Atlas and any LLM you want to integrate!
  • Operates the computer via the keyboard, mouse, and shell commands
  • Live streams the display of the sandbox on the client computer
  • User can pause and prompt the agent at any time
  • Uses Ubuntu, but designed to work with any operating system

Design

Open Computer Use Architecture Open Computer Use Architecture

The details of the design are laid out in this article: How I taught an AI to use a computer

LLM support

Open Computer Use is designed to easily support new LLMs. The LLM and provider combinations are are defined in models.py. Following the comments in this file, one can easily add any LLM and provider that adheres to the OpenAI API specification.

The list of tested models and providers currently includes:

Type Model Providers
Vision Llama 3.2 Fireworks, OpenRouter, Llama API
Vision Gemini 2.0 Flash Google
Action Llama 3.3 Fireworks, Llama API
Action DeepSeek DeepSeek
Action Gemini 2.0 Flash Google
Grounding OS-Atlas HuggingFace Spaces

The following lines of code in models.py define the default LLMs and providers:

vision_model = FireworksProvider(model_names["fireworks"]["llama3.2"])
action_model = FireworksProvider(model_names["fireworks"]["llama3.3"])

If you add a new model or provider, please make a PR to this repository!

Get started

Prerequisites

1. Install the prerequisites

In your terminal:

brew install poetry ffmpeg

2. Clone the repository

In your terminal:

git clone https://github.com/e2b-dev/open-computer-use/

3. Set the environment variables

Enter the project directory:

cd open-computer-use

Create a .env file in open-computer-use and set the following:

# Get your API key here - https://e2b.dev/
E2B_API_KEY="your-e2b-api-key"
FIREWORKS_API_KEY="your-fireworks-api-key"

4. Start the web interface

Run the following command to start the agent:

poetry install
poetry run start

The agent will start and prompt you for its first instruction.

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for open-computer-use

Similar Open Source Tools

For similar tasks

For similar jobs