databend

databend

๐——๐—ฎ๐˜๐—ฎ, ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜๐—ถ๐—ฐ๐˜€ & ๐—”๐—œ. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

Stars: 8082

Visit
 screenshot

Databend is an open-source cloud data warehouse built in Rust, offering fast query execution and data ingestion for complex analysis of large datasets. It integrates with major cloud platforms, provides high performance with AI-powered analytics, supports multiple data formats, ensures data integrity with ACID transactions, offers flexible indexing options, and features community-driven development. Users can try Databend through a serverless cloud or Docker installation, and perform tasks such as data import/export, querying semi-structured data, managing users/databases/tables, and utilizing AI functions.

README:

Databend: The Next-Gen Cloud [Data+AI] Analytics

The open-source, on-premise alternative to Snowflake

slack feishu
CI Status Linux Platform Gurubase

databend

๐Ÿ‹ Introduction

Databend, built in Rust, is an open-source cloud data warehouse that serves as a cost-effective alternative to Snowflake. With its focus on fast query execution and data ingestion, it's designed for complex analysis of the world's largest datasets.

Production-Proven Scale:

  • ๐Ÿค Enterprise Adoption: Trusted by over 50 organizations processing more than 100 million queries daily
  • ๐Ÿ—„๏ธ Massive Scale: Successfully managing over 800 petabytes of analytical data

โšก Performance

TPC-H Benchmark: Databend Cloud vs. Snowflake

Databend vs. Snowflake

Data Ingestion Benchmark: Databend Cloud vs. Snowflake

Databend vs. Snowflake

๐Ÿš€ Why Databend

  • Full Control: Deploy on cloud or on-prem to suit your needs.

  • Blazing-Fast Performance: Built with Rust for high-speed query execution. ๐Ÿ‘‰ ClickBench

  • Cost-Effective: Scalable architecture that boosts performance and reduces costs. ๐Ÿ‘‰ TPC-H

  • AI-Enhanced Analytics: Leverage built-in AI Functions for smarter data insights.

  • Simplified ETL: Direct data ingestion without the need for external ETL tools. ๐Ÿ‘‰ Data Loading

  • Real-Time Data Updates: Keep your analytics up-to-date with real-time incremental data updates. ๐Ÿ‘‰ Stream

  • Advanced Indexing: Boost query performance with Virtual Column, Aggregating Index, and Full-Text Index.

  • ACID Compliance + Version Control: Ensure reliable transactions with full ACID compliance and Git-like versioning.

  • Schema Flexibility: Effortlessly handle semi-structured data with the flexible VARIANT data type.

  • Community-Driven Growth: Open-source and continuously evolving with contributions from a global community.

๐Ÿ“ Architecture

Databend Architecture

๐Ÿš€ Try Databend

1. Databend Serverless Cloud

The fastest way to try Databend, Databend Cloud

2. Install Databend from Docker

Prepare the image (once) from Docker Hub (this will download about 170 MB data):

docker pull datafuselabs/databend

To run Databend quickly:

docker run --net=host  datafuselabs/databend

๐Ÿš€ Getting Started

Connecting to Databend
Data Import and Export
Loading Data From Other Databases
Querying Semi-structured Data
Visualize Tools with Databend
Managing Users
Managing Databases
Managing Tables
Managing Data
Managing Views
AI Functions
Data Management
Accessing Data Lake
Security
Performance

๐Ÿค Contributing

Databend thrives on community contributions! Whether it's through ideas, code, or documentation, every effort helps in enhancing our project. As a token of our appreciation, once your code is merged, your name will be eternally preserved in the system.contributors table.

Here are some resources to help you get started:

๐Ÿ‘ฅ Community

For guidance on using Databend, we recommend starting with the official documentation. If you need further assistance, explore the following community channels:

๐Ÿ›ฃ๏ธ Roadmap

Stay updated with Databend's development journey. Here are our roadmap milestones:

๐Ÿ“œ License

Databend is released under a combination of two licenses: the Apache License 2.0 and the Elastic License 2.0.

When contributing to Databend, you can find the relevant license header in each file.

For more information, see the LICENSE file and Licensing FAQs.

๐Ÿ™ Acknowledgement

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for databend

Similar Open Source Tools

For similar tasks

For similar jobs