astron-rpa
Agent-ready RPA suite with out-of-the-box automation tools. Built for individuals and enterprises.
Stars: 6395
AstronRPA is an enterprise-grade Robotic Process Automation (RPA) desktop application that supports low-code/no-code development. It enables users to rapidly build workflows and automate desktop software and web pages. The tool offers comprehensive automation support for various applications, highly component-based design, enterprise-grade security and collaboration features, developer-friendly experience, native agent empowerment, and multi-channel trigger integration. It follows a frontend-backend separation architecture with components for system operations, browser automation, GUI automation, AI integration, and more. The tool is deployed via Docker and designed for complex RPA scenarios.
README:
๐ค Pioneering Open-Source Enterprise RPA Desktop Application
Astron RPA Official Site ยท Deployment Guide ยท User Documentation ยท FAQ
English | ็ฎไฝไธญๆ
AstronRPA is an enterprise-grade Robotic Process Automation (RPA) desktop application. Through a visual designer, it supports low-code/no-code development, enabling users to rapidly build workflows and automate desktop software and web pages.
Astron Agent is the native Agent platform supported by this project. Users can directly call RPA workflow nodes in Astron Agent, and also use Agent workflows in AstronRPA, achieving efficient collaboration between automation processes and intelligent agent systems, empowering broader business automation scenarios.
- ๐ ๏ธ Comprehensive Automation Support: Comprehensive coverage of Windows desktop applications and web page automation. Supports common office software like WPS and Office, financial and ERP systems like Kingdee and YonYou, and various browsers like IE, Edge, and Chrome, enabling end-to-end cross-application automation.
- ๐งฉ Highly Component-based: 300+ pre-built atomic capabilities covering UI operations, data processing, and system interactions. Supports visual orchestration and custom component extensions with high flexibility and maintainability.
- ๐ญ Enterprise-grade Security & Collaboration: Built-in excellence center and team marketplace with enterprise modules. Provides terminal monitoring, scheduling modes, robot team sharing and collaborative functions. Build a complete enterprise automation management ecosystem with process security, permission control, and cross-team collaboration.
- ๐จโ๐ป Developer-friendly Experience: Low-code, visual process design and debugging environment. Quickly build automation workflows through intuitive drag-and-drop methods, reducing development barriers, improving development efficiency, and enabling business users to participate in automation creation.
- ๐ค Native Agent Empowerment: Deep integration with Astron Agent platform supporting bi-directional calls between automation processes and AI agents with capability fusion. Achieve seamless connection between task reasoning, decision making, and automated execution, expanding automation boundaries.
- ๐ Multi-channel Trigger Integration: Supports direct execution, scheduled tasks, scheduling modes, API calls, and MCP services. Flexible integration capabilities to quickly respond to third-party system integration needs and easily embed in complex business scenarios.
- ๐ป Client Operating System: Windows 10/11 (primary support)
- ๐ง RAM >= 8 GiB
Recommended for quick deployment:
# Clone the repository
git clone https://github.com/iflytek/astron-rpa.git
cd astron-rpa
# Enter docker directory
cd docker
# Copy .env
cp .env.example .env
# Modify casdoor service configuration in .env (8000 is the default port)
CASDOOR_EXTERNAL_ENDPOINT="http://{YOUR_SERVER_IP}:8000"
# ๐ Start all services
docker compose up -d
# ๐ Check service status
docker compose ps- After all services have started, open your browser and go to:
http://{YOUR_SERVER_IP}:32742/api/rpa-auth/user/login-check(32742 is the default port; change it if you modified the configuration). - If you see
{"code":"900001","data":null,"message":"unauthorized"}, it means the deployment is correct and the connection is working properly. - Open your browser and go to:
http://{YOUR_SERVER_IP}:8000(8000 is the default port; change it if you modified the configuration). - If you see the Casdoor login page, it means Casdoor is deployed correctly.
- For production deployment and security hardening, refer to Deployment Guide
| Tool | Version | Description |
|---|---|---|
| Node.js | >= 22 | JavaScript runtime |
| Python | 3.13.x | RPA engine core |
| Java | JDK 8+ | Backend runtime |
| pnpm | >= 9 | Node.js package manager |
| UV | 0.8+ | Python package management tool |
| 7-Zip | - | Create deployment archives |
| SWIG | - | Connect Python with C/C++ |
For specific installation instructions and common issues, refer to Build Guide.
Download the latest Release Package
-
Prepare Python Environment
# Prepare a Python 3.13.x installation directory (can be a local folder or system installation path) # The script will copy this directory to create python_core
-
Run Build Script
# Full build (engine + frontend + desktop app) from project root directory ./build.bat --python-exe "C:\Program Files\Python313\python.exe" # Or use default configuration (if Python is in default path) ./build.bat # Wait for completion # Build successful when console displays "Full Build Complete!"
Note: Please ensure the specified Python interpreter is a clean installation without additional third-party packages to minimize package size.
Build process includes:
- โ
Detect/copy Python environment to
build/python_core - โ Install RPA engine dependencies
- โ
Compress Python core to
resources/python_core.7z - โ Install frontend dependencies
- โ Build frontend web application
- โ Build desktop application
- โ
Detect/copy Python environment to
-
๐ฆ Install the packaged client
โ๏ธ After installation, modify the server address in resources/conf.yaml in the installation directory:
```yaml
# 32742 is the default port; change it if you modified the configuration
remote_addr: http://YOUR_SERVER_ADDRESS:32742/
skip_engine_start: false
```
The project adopts a frontend-backend separation architecture. The frontend is built with Vue 3 + TypeScript and Electron for desktop applications; the backend uses Java Spring Boot and Python FastAPI to build microservices supporting business and AI capabilities. The engine layer is based on Python, integrating 20+ RPA components with support for image recognition and UI automation. The entire system is deployed via Docker with high observability and scalability, designed for complex RPA scenarios.
- astronverse.system: System operations, process management, screenshots
- astronverse.browser: Browser automation, web page operations
- astronverse.gui: GUI automation, mouse and keyboard operations
- astronverse.excel: Excel spreadsheet operations, data processing
- astronverse.vision: Computer vision, image recognition
- astronverse.ai: AI intelligent service integration
- astronverse.network: Network requests, API calls
- astronverse.email: Email sending and receiving
- astronverse.docx: Word document processing
- astronverse.pdf: PDF document operations
- astronverse.encrypt: Encryption and decryption functions
- astronverse.actionlib: Atomic operation definition and execution
- astronverse.executor: Workflow execution engine
- astronverse.picker: Workflow element picker engine
- astronverse.scheduler: Engine scheduler
- astronverse.trigger: Engine trigger
- astronverse.baseline: RPA framework core
- astronverse.websocketserver: WebSocket communication
- astronverse.websocketclient: WebSocket communication
- astronverse.locator: Element locating technology
- ๐ User Guide
- ๐ Deployment Guide
- ๐ API Documentation
- ๐ง Component Development Guide
- ๐ Troubleshooting
- ๐ Changelog
We welcome any form of contribution! Please check Contributing Guide
- โ Follow existing code style
- โ Add necessary test cases
- โ Update relevant documentation
- โ Ensure all checks pass
- ๐ด Fork the repository
- ๐ฟ Create your feature branch (
git checkout -b feature/AmazingFeature) - ๐พ Commit your changes (
git commit -m 'Add some AmazingFeature') - ๐ Push to the branch (
git push origin feature/AmazingFeature) - ๐ Open a Pull Request
- ๐ง Technical Support: [email protected]
- ๐ฌ Community Discussion: GitHub Discussions
- ๐ Bug Reports: Issues
- ๐ฅ WeChat Work Group:
This project is open source under the Open Source License.
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for astron-rpa
Similar Open Source Tools
For similar tasks
inferable
Inferable is an open source platform that helps users build reliable LLM-powered agentic automations at scale. It offers a managed agent runtime, durable tool calling, zero network configuration, multiple language support, and is fully open source under the MIT license. Users can define functions, register them with Inferable, and create runs that utilize these functions to automate tasks. The platform supports Node.js/TypeScript, Go, .NET, and React, and provides SDKs, core services, and bootstrap templates for various languages.
CEO
CEO is an intuitive and modular AI agent framework designed for task automation. It provides a flexible environment for building agents with specific abilities and personalities, allowing users to assign tasks and interact with the agents to automate various processes. The framework supports multi-agent collaboration scenarios and offers functionalities like instantiating agents, granting abilities, assigning queries, and executing tasks. Users can customize agent personalities and define specific abilities using decorators, making it easy to create complex automation workflows.
evi-run
evi-run is a powerful, production-ready multi-agent AI system built on Python using the OpenAI Agents SDK. It offers instant deployment, ultimate flexibility, built-in analytics, Telegram integration, and scalable architecture. The system features memory management, knowledge integration, task scheduling, multi-agent orchestration, custom agent creation, deep research, web intelligence, document processing, image generation, DEX analytics, and Solana token swap. It supports flexible usage modes like private, free, and pay mode, with upcoming features including NSFW mode, task scheduler, and automatic limit orders. The technology stack includes Python 3.11, OpenAI Agents SDK, Telegram Bot API, PostgreSQL, Redis, and Docker & Docker Compose for deployment.
Open-WebUI-Functions
Open-WebUI-Functions is a collection of Python-based functions that extend Open WebUI with custom pipelines, filters, and integrations. Users can interact with AI models, process data efficiently, and customize the Open WebUI experience. It includes features like custom pipelines, data processing filters, Azure AI support, N8N workflow integration, flexible configuration, secure API key management, and support for both streaming and non-streaming processing. The functions require an active Open WebUI instance, may need external AI services like Azure AI, and admin access for installation. Security features include automatic encryption of sensitive information like API keys. Pipelines include Azure AI Foundry, N8N, Infomaniak, and Google Gemini. Filters like Time Token Tracker measure response time and token usage. Integrations with Azure AI, N8N, Infomaniak, and Google are supported. Contributions are welcome, and the project is licensed under Apache License 2.0.
astron-rpa
AstronRPA is an enterprise-grade Robotic Process Automation (RPA) desktop application that supports low-code/no-code development. It enables users to rapidly build workflows and automate desktop software and web pages. The tool offers comprehensive automation support for various applications, highly component-based design, enterprise-grade security and collaboration features, developer-friendly experience, native agent empowerment, and multi-channel trigger integration. It follows a frontend-backend separation architecture with components for system operations, browser automation, GUI automation, AI integration, and more. The tool is deployed via Docker and designed for complex RPA scenarios.
airflow
Apache Airflow (or simply Airflow) is a platform to programmatically author, schedule, and monitor workflows. When workflows are defined as code, they become more maintainable, versionable, testable, and collaborative. Use Airflow to author workflows as directed acyclic graphs (DAGs) of tasks. The Airflow scheduler executes your tasks on an array of workers while following the specified dependencies. Rich command line utilities make performing complex surgeries on DAGs a snap. The rich user interface makes it easy to visualize pipelines running in production, monitor progress, and troubleshoot issues when needed.
rill-flow
Rill Flow is a high-performance, scalable distributed workflow orchestration service that supports the execution of tens of millions of tasks per day with task execution latency less than 100ms. It is distributed and supports the orchestration and scheduling of heterogeneous distributed systems. Rill Flow is easy to use, supporting visual process orchestration and plug-in access. It is cloud native, allowing for cloud native container deployment and cloud native function orchestration. Additionally, Rill Flow supports rapid integration of LLM model services.
PySpur
PySpur is a graph-based editor designed for LLM workflows, offering modular building blocks for easy workflow creation and debugging at node level. It allows users to evaluate final performance and promises self-improvement features in the future. PySpur is easy-to-hack, supports JSON configs for workflow graphs, and is lightweight with minimal dependencies, making it a versatile tool for workflow management in the field of AI and machine learning.
For similar jobs
aiscript
AiScript is a lightweight scripting language that runs on JavaScript. It supports arrays, objects, and functions as first-class citizens, and is easy to write without the need for semicolons or commas. AiScript runs in a secure sandbox environment, preventing infinite loops from freezing the host. It also allows for easy provision of variables and functions from the host.
askui
AskUI is a reliable, automated end-to-end automation tool that only depends on what is shown on your screen instead of the technology or platform you are running on.
bots
The 'bots' repository is a collection of guides, tools, and example bots for programming bots to play video games. It provides resources on running bots live, installing the BotLab client, debugging bots, testing bots in simulated environments, and more. The repository also includes example bots for games like EVE Online, Tribal Wars 2, and Elvenar. Users can learn about developing bots for specific games, syntax of the Elm programming language, and tools for memory reading development. Additionally, there are guides on bot programming, contributing to BotLab, and exploring Elm syntax and core library.
ain
Ain is a terminal HTTP API client designed for scripting input and processing output via pipes. It allows flexible organization of APIs using files and folders, supports shell-scripts and executables for common tasks, handles url-encoding, and enables sharing the resulting curl, wget, or httpie command-line. Users can put things that change in environment variables or .env-files, and pipe the API output for further processing. Ain targets users who work with many APIs using a simple file format and uses curl, wget, or httpie to make the actual calls.
LaVague
LaVague is an open-source Large Action Model framework that uses advanced AI techniques to compile natural language instructions into browser automation code. It leverages Selenium or Playwright for browser actions. Users can interact with LaVague through an interactive Gradio interface to automate web interactions. The tool requires an OpenAI API key for default examples and offers a Playwright integration guide. Contributors can help by working on outlined tasks, submitting PRs, and engaging with the community on Discord. The project roadmap is available to track progress, but users should exercise caution when executing LLM-generated code using 'exec'.
robocorp
Robocorp is a platform that allows users to create, deploy, and operate Python automations and AI actions. It provides an easy way to extend the capabilities of AI agents, assistants, and copilots with custom actions written in Python. Users can create and deploy tools, skills, loaders, and plugins that securely connect any AI Assistant platform to their data and applications. The Robocorp Action Server makes Python scripts compatible with ChatGPT and LangChain by automatically creating and exposing an API based on function declaration, type hints, and docstrings. It simplifies the process of developing and deploying AI actions, enabling users to interact with AI frameworks effortlessly.
Open-Interface
Open Interface is a self-driving software that automates computer tasks by sending user requests to a language model backend (e.g., GPT-4V) and simulating keyboard and mouse inputs to execute the steps. It course-corrects by sending current screenshots to the language models. The tool supports MacOS, Linux, and Windows, and requires setting up the OpenAI API key for access to GPT-4V. It can automate tasks like creating meal plans, setting up custom language model backends, and more. Open Interface is currently not efficient in accurate spatial reasoning, tracking itself in tabular contexts, and navigating complex GUI-rich applications. Future improvements aim to enhance the tool's capabilities with better models trained on video walkthroughs. The tool is cost-effective, with user requests priced between $0.05 - $0.20, and offers features like interrupting the app and primary display visibility in multi-monitor setups.
AI-Case-Sorter-CS7.1
AI-Case-Sorter-CS7.1 is a project focused on building a case sorter using machine vision and machine learning AI to sort cases by headstamp. The repository includes Arduino code and 3D models necessary for the project.

