Windows-Use

Windows-Use

🖥️Open-source Computer-Use for Windows

Stars: 757

Visit
 screenshot

Windows-Use is a powerful automation agent that interacts directly with the Windows OS at the GUI layer. It bridges the gap between AI agents and Windows to perform tasks such as opening apps, clicking buttons, typing, executing shell commands, and capturing UI state without relying on traditional computer vision models. It enables any large language model (LLM) to perform computer automation instead of relying on specific models for it.

README:

🪟 Windows-Use

PyPI Downloads License Python Platform: Windows 7 to 11
Follow on Twitter Join us on Discord

Windows-Use is a powerful automation agent that interact directly with the Windows at GUI layer. It bridges the gap between AI Agents and the Windows OS to perform tasks such as opening apps, clicking buttons, typing, executing shell commands, and capturing UI state all without relying on traditional computer vision models. Enabling any LLM to perform computer automation instead of relying on specific models for it.

🛠️Installation Guide

Prerequisites

  • Python 3.12 or higher
  • UV (or pip)
  • Windows 7 or 8 or 10 or 11

Installation Steps

Install using uv:

uv pip install windows-use

Or with pip:

pip install windows-use

⚙️Basic Usage

# main.py
from langchain_google_genai import ChatGoogleGenerativeAI
from windows_use.agent import Agent
from dotenv import load_dotenv

load_dotenv()

llm=ChatGoogleGenerativeAI(model='gemini-2.0-flash')
agent = Agent(llm=llm,browser='chrome',use_vision=True)
query=input("Enter your query: ")
agent_result=agent.invoke(query=query)
print(agent_result.content)

🤖 Run Agent

You can use the following to run from a script:

python main.py
Enter your query: <YOUR TASK>

🎥 Demos

PROMPT: Write a short note about LLMs and save to the desktop

https://github.com/user-attachments/assets/0faa5179-73c1-4547-b9e6-2875496b12a0

PROMPT: Change from Dark mode to Light mode

https://github.com/user-attachments/assets/47bdd166-1261-4155-8890-1b2189c0a3fd

📈 Grounding

Image Image Image Image Image

Vision

Talk to your computer. Watch it get things done.

Star History

Star History Chart

⚠️ Caution

Agent interacts directly with your Windows OS at GUI layer to perform actions. While the agent is designed to act intelligently and safely, it can make mistakes that might bring undesired system behaviour or cause unintended changes. Try to run the agent in a sandbox envirnoment.

🪪 License

This project is licensed under the MIT License - see the LICENSE file for details.

🤝 Contributing

Contributions are welcome! Please check the CONTRIBUTING file for setup and development workflow.

Made with ❤️ by Jeomon George


Citation

@software{
  author       = {George, Jeomon},
  title        = {Windows-Use: Enable AI to control Windows OS},
  year         = {2025},
  publisher    = {GitHub},
  url={https://github.com/CursorTouch/Windows-Use}
}

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for Windows-Use

Similar Open Source Tools

For similar tasks

For similar jobs