Best AI tools for< Create Custom Data Processing Pipelines >
20 - AI tool Sites
FastBots.ai
FastBots.ai is an AI chatbot builder that allows users to create custom chatbots trained on their own data. These chatbots can be integrated into websites to provide customer support, sales assistance, and other services. FastBots.ai is easy to use and requires no coding. It supports a wide range of content types, including text, PDFs, and YouTube videos. FastBots.ai also offers a variety of features, such as customization options, chat history storage, and Zapier integration.
Lindy.ai
Lindy.ai is a platform that allows users to create custom AI assistants to automate tasks. The platform is designed to be easy to use, with no coding required. Lindy.ai offers a variety of pre-built AI assistants for common tasks, such as customer support, sales, and recruiting. Users can also create their own AI assistants from scratch. Lindy.ai integrates with a variety of third-party applications, including CRM systems, email clients, and document management systems.
BotGPT
BotGPT is a 24/7 custom AI chatbot assistant for websites. It offers a data-driven ChatGPT that allows users to create virtual assistants from their own data. Users can easily upload files or crawl their website to start asking questions and deploy a custom chatbot on their website within minutes. The platform provides a simple and efficient way to enhance customer engagement through AI-powered chatbots.
Quickchat AI
Quickchat AI is a custom AI assistant designed to automate customer support, lead generation, and more. It allows users to design, tweak, and deploy their own AI assistant trained on their data. Quickchat AI offers a range of features including customizable AI assistant name, conversation style, knowledge and actions, and deployment options. It also provides integrations with popular tools and systems, making it easy to use AI in everyday workflows.
ChatFast
ChatFast is a platform that allows businesses to create custom GPT chatbots using their own data. These chatbots can be used to answer customer questions, capture leads, and schedule appointments. ChatFast is easy to use and requires no coding. It is trusted by thousands of businesses and provides a range of powerful features, including the ability to train chatbots on multiple data sources, revise responses, capture leads, and create smart forms.
EmbedAI
EmbedAI is a platform that allows users to create custom AI chatbots powered by ChatGPT and trained on their own data. These chatbots can be embedded on websites and used to answer customer questions, provide support, or generate leads. EmbedAI is designed to be easy to use, even for those with no coding experience. It offers a variety of features to help users create and customize their chatbots, including a user-friendly interface, pre-built templates, and advanced training options.
Lexset
Lexset is an AI tool that provides synthetic data generation services for computer vision model training. It offers a no-code interface to create unlimited data with advanced camera controls and lighting options. Users can simulate AI-scale environments, composite objects into images, and create custom 3D scenarios. Lexset also provides access to GPU nodes, dedicated support, and feature development assistance. The tool aims to improve object detection accuracy and optimize generalization on high-quality synthetic data.
Bothatch
Bothatch is a platform that allows users to create custom chatbots powered by OpenAI's GPT technology. With Bothatch, users can upload their own data and documents to train their chatbots, which can then be used to engage in meaningful and productive conversations. Bothatch is designed to be easy to use, with no coding or technical skills required. It is also affordable, with pricing plans starting at $0 per month.
BulkGPT
BulkGPT is a no-code AI workflow automation tool that combines web scraping and AI capabilities to help users create custom workflows for mass scraping web pages, generating SEO blogs, personalized messages, and more without the need for coding. It simplifies content creation and data extraction processes by enabling users to upload data, run it in Google Sheets, or integrate it with other tools via API. BulkGPT offers features like web scraping in Google Sheets, URL crawling, SEO content creation, e-commerce product description generation, ChatGPT automation, data scraping, and marketing email campaign creation.
Booth AI
Booth AI is a platform that allows users to create custom AI solutions in minutes, not months. It is enterprise-ready, scale-ready, and disruption-ready. Booth AI offers a variety of features, including integration with over 100 apps, workplace tools, project management tools, marketing automation tools, and more. Booth AI can be used to solve a variety of business problems, including automating tasks, improving customer service, and increasing sales.
Chatbase
Chatbase is a platform that allows users to create custom chatbots for their websites. These chatbots can be used for a variety of purposes, including customer support, lead generation, and user engagement. Chatbase provides a variety of features to help users create and customize their chatbots, including the ability to import data from multiple sources, customize the chatbot's appearance and behavior, and integrate with other tools. Chatbase also offers a variety of pre-built templates and examples to help users get started.
Pointly
Pointly is an intelligent, cloud-based B2B software solution that enables efficient automatic and advanced manual classification in 3D point clouds. It offers innovative AI techniques for fast and precise data classification and vectorization, transforming point cloud analysis into an enjoyable and efficient workflow. Pointly provides standard and custom classifiers, tools for classification and vectorization, API and on-premise classification options, collaboration features, secure cloud processing, and scalability for handling large-scale point cloud data.
ThoughtfulGPT
ThoughtfulGPT is an AI-powered tool that helps teams automate tasks, improve communication, and make better decisions. With ThoughtfulGPT, you can create custom AI macros, integrate your private and public data sources, and access multiple language models. ThoughtfulGPT also offers a Long-Term AI Memory feature that allows you to save your thoughts, ideas, and processes to improve the AI's contextual understanding.
Teachable Machine
Teachable Machine is a web-based tool that makes it easy to create custom machine learning models, even if you don't have any coding experience. With Teachable Machine, you can train models to recognize images, sounds, and poses. Once you've trained a model, you can export it to use in your own projects.
Chat Thing
Chat Thing is a platform that allows users to create custom AI chatbots trained on their own data. These chatbots can be used for a variety of purposes, such as customer support, lead generation, research, and more. Chat Thing provides a variety of features to make it easy to create and manage chatbots, including a user-friendly interface, powerful training tools, and advanced analytics. With Chat Thing, businesses can save time and money by automating their customer interactions and providing personalized support to their customers.
Spine AI
Spine AI is a reliable AI analyst tool that provides conversational analytics tailored to understand your business. It empowers decision-makers by offering customized insights, deep business intelligence, proactive notifications, and flexible dashboards. The tool is designed to help users make better decisions by leveraging a purpose-built Data Processing Unit (DPU) and a semantic layer for natural language interactions. With a focus on rigorous evaluation and security, Spine AI aims to deliver explainable and customizable AI solutions for businesses.
Jekka
Jekka is an AI-powered platform that helps businesses automate their workflows and processes. It offers a range of features, including natural language processing, machine learning, and computer vision, that can be used to create custom AI solutions. Jekka is designed to be easy to use, even for those with no prior experience with AI. It provides a drag-and-drop interface that makes it simple to create and deploy AI models.
Illustrur
Illustrur is an AI-powered tool that helps users create custom illustrations for their articles or blog posts. With Illustrur, users can upload their article, select their preferred art style and colors, and receive their illustrations in just 15 minutes. Illustrur's AI analyzes the article's content and generates unique, customized illustrations that are tailored to the specific needs of the user.
Builder.ai
Builder.ai is an award-winning app development platform that empowers businesses of all sizes to create custom mobile and web applications without the need for coding knowledge. With Builder.ai, you can build a wide range of apps, including e-commerce stores, appointment booking systems, customer relationship management (CRM) tools, and more. Builder.ai's platform is easy to use and affordable, making it a great option for businesses that want to quickly and easily launch their own apps.
Connected Papers
Connected Papers is a search engine for academic papers that uses artificial intelligence to help users find and explore relevant research. It allows users to search for papers by keyword, author, or title, and then explore the connections between them. Connected Papers also provides a variety of tools to help users organize and manage their research, including the ability to create custom collections of papers, add notes and annotations, and share their research with others.
20 - Open Source AI Tools
unitxt
Unitxt is a customizable library for textual data preparation and evaluation tailored to generative language models. It natively integrates with common libraries like HuggingFace and LM-eval-harness and deconstructs processing flows into modular components, enabling easy customization and sharing between practitioners. These components encompass model-specific formats, task prompts, and many other comprehensive dataset processing definitions. The Unitxt-Catalog centralizes these components, fostering collaboration and exploration in modern textual data workflows. Beyond being a tool, Unitxt is a community-driven platform, empowering users to build, share, and advance their pipelines collaboratively.
towhee
Towhee is a cutting-edge framework designed to streamline the processing of unstructured data through the use of Large Language Model (LLM) based pipeline orchestration. It can extract insights from diverse data types like text, images, audio, and video files using generative AI and deep learning models. Towhee offers rich operators, prebuilt ETL pipelines, and a high-performance backend for efficient data processing. With a Pythonic API, users can build custom data processing pipelines easily. Towhee is suitable for tasks like sentence embedding, image embedding, video deduplication, question answering with documents, and cross-modal retrieval based on CLIP.
llm-twin-course
The LLM Twin Course is a free, end-to-end framework for building production-ready LLM systems. It teaches you how to design, train, and deploy a production-ready LLM twin of yourself powered by LLMs, vector DBs, and LLMOps good practices. The course is split into 11 hands-on written lessons and the open-source code you can access on GitHub. You can read everything and try out the code at your own pace.
data-prep-kit
Data Prep Kit is a community project aimed at democratizing and speeding up unstructured data preparation for LLM app developers. It provides high-level APIs and modules for transforming data (code, language, speech, visual) to optimize LLM performance across different use cases. The toolkit supports Python, Ray, Spark, and Kubeflow Pipelines runtimes, offering scalability from laptop to datacenter-scale processing. Developers can contribute new custom modules and leverage the data processing library for building data pipelines. Automation features include workflow automation with Kubeflow Pipelines for transform execution.
kernel-memory
Kernel Memory (KM) is a multi-modal AI Service specialized in the efficient indexing of datasets through custom continuous data hybrid pipelines, with support for Retrieval Augmented Generation (RAG), synthetic memory, prompt engineering, and custom semantic memory processing. KM is available as a Web Service, as a Docker container, a Plugin for ChatGPT/Copilot/Semantic Kernel, and as a .NET library for embedded applications. Utilizing advanced embeddings and LLMs, the system enables Natural Language querying for obtaining answers from the indexed data, complete with citations and links to the original sources. Designed for seamless integration as a Plugin with Semantic Kernel, Microsoft Copilot and ChatGPT, Kernel Memory enhances data-driven features in applications built for most popular AI platforms.
pathway
Pathway is a Python data processing framework for analytics and AI pipelines over data streams. It's the ideal solution for real-time processing use cases like streaming ETL or RAG pipelines for unstructured data. Pathway comes with an **easy-to-use Python API** , allowing you to seamlessly integrate your favorite Python ML libraries. Pathway code is versatile and robust: **you can use it in both development and production environments, handling both batch and streaming data effectively**. The same code can be used for local development, CI/CD tests, running batch jobs, handling stream replays, and processing data streams. Pathway is powered by a **scalable Rust engine** based on Differential Dataflow and performs incremental computation. Your Pathway code, despite being written in Python, is run by the Rust engine, enabling multithreading, multiprocessing, and distributed computations. All the pipeline is kept in memory and can be easily deployed with **Docker and Kubernetes**. You can install Pathway with pip: `pip install -U pathway` For any questions, you will find the community and team behind the project on Discord.
project-lakechain
Project Lakechain is a cloud-native, AI-powered framework for building document processing pipelines on AWS. It provides a composable API with built-in middlewares for common tasks, scalable architecture, cost efficiency, GPU and CPU support, and the ability to create custom transform middlewares. With ready-made examples and emphasis on modularity, Lakechain simplifies the deployment of scalable document pipelines for tasks like metadata extraction, NLP analysis, text summarization, translations, audio transcriptions, computer vision, and more.
draive
draive is an open-source Python library designed to simplify and accelerate the development of LLM-based applications. It offers abstract building blocks for connecting functionalities with large language models, flexible integration with various AI solutions, and a user-friendly framework for building scalable data processing pipelines. The library follows a function-oriented design, allowing users to represent complex programs as simple functions. It also provides tools for measuring and debugging functionalities, ensuring type safety and efficient asynchronous operations for modern Python apps.
swarms
Swarms provides simple, reliable, and agile tools to create your own Swarm tailored to your specific needs. Currently, Swarms is being used in production by RBC, John Deere, and many AI startups.
HybridAGI
HybridAGI is the first Programmable LLM-based Autonomous Agent that lets you program its behavior using a **graph-based prompt programming** approach. This state-of-the-art feature allows the AGI to efficiently use any tool while controlling the long-term behavior of the agent. Become the _first Prompt Programmers in history_ ; be a part of the AI revolution one node at a time! **Disclaimer: We are currently in the process of upgrading the codebase to integrate DSPy**
awesome-langchain
LangChain is an amazing framework to get LLM projects done in a matter of no time, and the ecosystem is growing fast. Here is an attempt to keep track of the initiatives around LangChain. Subscribe to the newsletter to stay informed about the Awesome LangChain. We send a couple of emails per month about the articles, videos, projects, and tools that grabbed our attention Contributions welcome. Add links through pull requests or create an issue to start a discussion. Please read the contribution guidelines before contributing.
awesome-mlops
Awesome MLOps is a curated list of tools related to Machine Learning Operations, covering areas such as AutoML, CI/CD for Machine Learning, Data Cataloging, Data Enrichment, Data Exploration, Data Management, Data Processing, Data Validation, Data Visualization, Drift Detection, Feature Engineering, Feature Store, Hyperparameter Tuning, Knowledge Sharing, Machine Learning Platforms, Model Fairness and Privacy, Model Interpretability, Model Lifecycle, Model Serving, Model Testing & Validation, Optimization Tools, Simplification Tools, Visual Analysis and Debugging, and Workflow Tools. The repository provides a comprehensive collection of tools and resources for individuals and teams working in the field of MLOps.
litlytics
LitLytics is an affordable analytics platform leveraging LLMs for automated data analysis. It simplifies analytics for teams without data scientists, generates custom pipelines, and allows customization. Cost-efficient with low data processing costs. Scalable and flexible, works with CSV, PDF, and plain text data formats.
sparrow
Sparrow is an innovative open-source solution for efficient data extraction and processing from various documents and images. It seamlessly handles forms, invoices, receipts, and other unstructured data sources. Sparrow stands out with its modular architecture, offering independent services and pipelines all optimized for robust performance. One of the critical functionalities of Sparrow - pluggable architecture. You can easily integrate and run data extraction pipelines using tools and frameworks like LlamaIndex, Haystack, or Unstructured. Sparrow enables local LLM data extraction pipelines through Ollama or Apple MLX. With Sparrow solution you get API, which helps to process and transform your data into structured output, ready to be integrated with custom workflows. Sparrow Agents - with Sparrow you can build independent LLM agents, and use API to invoke them from your system. **List of available agents:** * **llamaindex** - RAG pipeline with LlamaIndex for PDF processing * **vllamaindex** - RAG pipeline with LLamaIndex multimodal for image processing * **vprocessor** - RAG pipeline with OCR and LlamaIndex for image processing * **haystack** - RAG pipeline with Haystack for PDF processing * **fcall** - Function call pipeline * **unstructured-light** - RAG pipeline with Unstructured and LangChain, supports PDF and image processing * **unstructured** - RAG pipeline with Weaviate vector DB query, Unstructured and LangChain, supports PDF and image processing * **instructor** - RAG pipeline with Unstructured and Instructor libraries, supports PDF and image processing. Works great for JSON response generation
oci-data-science-ai-samples
The Oracle Cloud Infrastructure Data Science and AI services Examples repository provides demos, tutorials, and code examples showcasing various features of the OCI Data Science service and AI services. It offers tools for data scientists to develop and deploy machine learning models efficiently, with features like Accelerated Data Science SDK, distributed training, batch processing, and machine learning pipelines. Whether you're a beginner or an experienced practitioner, OCI Data Science Services provide the resources needed to build, train, and deploy models easily.
20 - OpenAI Gpts
Auto Custom Actions GPT
This GPT help you on one single task, generating valid OpenAI Schemas for Custom Actions in GPTs
3Commas API Expert
Python-focused expert on the 3Commas API, friendly and encouraging experimentation.
Apple Activity Kit Complete Code Expert
A detailed expert trained on all 1,337 pages of Apple ActivityKit, offering complete coding solutions. Saving time? https://www.buymeacoffee.com/parkerrex ☕️❤️
data trip
Dalle + custom corrupted data from every artist in the world. This is an experiment. (beta)
Custom GPT Builder
Create personalized GPTs with my simple builder. Click the conversation starter (starting with ###) to begin.
OpenAPI Wizard
Your guide for OpenAPI specs for helping make custom GPTs with reach easily!