Best AI tools for< Document Processes >
20 - AI tool Sites

Cradl AI
Cradl AI is an AI-powered tool designed to automate document workflows with no-code AI. It enables users to extract data from any document automatically, integrate with no-code tools, and build custom AI models through an easy-to-use interface. The tool empowers automation teams across industries by extracting data from complex document layouts, regardless of language or structure. Cradl AI offers features such as line item extraction, fine-tuning AI models, human-in-the-loop validation, and seamless integration with automation tools. It is trusted by organizations for business-critical document automation, providing enterprise-level features like encrypted transmission, GDPR compliance, secure data handling, and auto-scaling.

Cradl AI
Cradl AI is a no-code AI-powered document workflow automation tool that helps organizations automate document-related tasks, such as data extraction, processing, and validation. It uses AI to automatically extract data from complex document layouts, regardless of layout or language. Cradl AI also integrates with other no-code tools, making it easy to build and deploy custom AI models.

Hyperscience
Hyperscience is a leading enterprise AI platform that provides hyperautomation solutions for businesses. Its platform enables organizations to automate complex business processes with high accuracy and efficiency. Hyperscience offers a range of solutions across various industries and processes, leveraging technologies such as intelligent document processing, machine learning, and natural language processing. The platform is designed to help businesses transform their operations, improve decision-making, and gain a competitive advantage.

Whale
Whale is an AI-powered software designed to help businesses document their standard operating procedures, policies, and internal company knowledge. It streamlines the process of onboarding, training, and growing teams by leveraging AI technology to assist in creating and organizing documentation. Whale offers features such as AI-assisted SOP and process documentation, automated training flows, a single source of truth for knowledge management, and an AI assistant named Alice to help with various tasks. The platform aims to systemize and scale businesses by providing a user-friendly interface and dedicated support services.

Scribe
Scribe is a tool that allows users to create step-by-step guides for any process. It uses AI to automatically generate instructions and screenshots, and it can be used to document processes, train employees, and answer questions. Scribe is available as a Chrome extension and a desktop app.

Kopyst
Kopyst is an AI-powered documentation tool that revolutionizes the process of creating engaging video and documents. It helps users streamline workflows, create user manuals, SOPs, and training documents with unmatched accuracy and efficiency. Kopyst offers features like instant documentation, versatile application for various document types, AI-powered intelligence, easy sharing and collaboration, and seamless integration with existing tools. The application empowers users to save time, reduce errors, optimize resources, and enhance productivity in documentation tasks.

DeepOpinion
DeepOpinion is an Agentic Automation platform designed for enterprises to automate business operations using cutting-edge GenAI and LLMs. It offers solutions for various industries like financial services, insurance, and manufacturing, enabling users to create enterprise-grade AI agent apps and powerful business apps in minutes. DeepOpinion excels in automating text and document processes, providing advanced features like Generative Automation and human validation within the Control Hub.

InclusiveDocs
InclusiveDocs is an innovative AI tool that is revolutionizing accessibility. It leverages cutting-edge artificial intelligence technology to enhance inclusivity in document management. By utilizing advanced algorithms, InclusiveDocs automates the process of making documents accessible to individuals with disabilities, ensuring that everyone can access and interact with information seamlessly. With a user-friendly interface and powerful features, InclusiveDocs is leading the way in promoting equal access to information for all.

TextMine
TextMine is an AI-powered knowledge base designed for businesses to manage and analyze critical documents efficiently. It offers features such as document analysis, smart-search capabilities, automated data extraction, and structured dataset transformation. TextMine helps businesses save time and money by streamlining document management processes and enabling informed decision-making. The application caters to various industries like Technology, Legal Services, and Financial Services, providing solutions for teams in Procurement, Finance, Compliance, CIOs, and CDOs.

Peslac AI
Peslac AI is an intelligent document processing and data extraction tool that streamlines document-heavy processes with advanced AI technology. It offers solutions for automating document processing, extracting data, verifying documents, processing forms, and workflow automation. Peslac serves industries such as insurance, finance, healthcare, legal, and others by providing tailored solutions to improve efficiency and accuracy in operations.

Law.co
Law.co is an advanced AI platform designed specifically for lawyers and law firms to streamline legal operations and enhance efficiency. The platform offers a semantic database search with access to over 1 million historical legal cases and 40,000 legal contracts, enabling users to perform detailed legal research, contract drafting, document review, and more. Law.co leverages custom-trained artificial intelligence and semantic search tools to deliver measurable results, revolutionizing legal research and document preparation processes for legal professionals.

Dubble
Dubble is a free tool that helps you create step-by-step guides, tutorials, and onboarding resources for your processes. It uses AI to watch how you work and translate your actions into written instructions and screenshots. This makes it easy to document your processes without having to write anything yourself.

Sendforsign
Sendforsign is an AI-powered platform that offers eSign automation for businesses, allowing them to streamline agreements, contracts, and document management processes. The platform provides a suite of products such as Legal AI co-pilot, Contract Builder, eSign APIs, and embeddable UIs to simplify the entire agreement process. With features like AI Automation, Embedded Contracts, Contract Management, and Modules, Sendforsign aims to make contract handling efficient and user-friendly.

Artsyl Technologies
Artsyl Technologies specializes in revolutionizing document processing through advanced AI-powered automation. Their flagship intelligent process automation platform, docAlpha, utilizes cutting-edge AI, RPA, and machine learning technologies to automate and optimize document workflows. By seamlessly integrating with organizations' ERP or Document Management Systems, docAlpha ensures enhanced efficiency, accuracy, and productivity across the entire business process.

Ocrolus
Ocrolus is an intelligent document automation software that leverages AI-driven document processing automation with Human-in-the-Loop. It offers capabilities such as classifying, capturing, detecting, and analyzing documents, with use cases in cash flow, income, address, employment, and identity verification. Ocrolus caters to various industries like small business lending, mortgage, consumer finance, and multifamily housing. The platform provides resources for developers, including guides on income verification, fraud detection, and business process automation. Users can explore the API to build innovative customer experiences and make faster and more accurate financial decisions.

ASSIST
ASSIST is an AI-driven document management software designed to streamline financial paperwork processing and data entry tasks. The application offers features such as SmartDoc Entry for extracting information from invoices and receipts, Polyglot Processing for multilingual support, One-Tap Integration with accounting platforms, ExportEase for data export in CSV format, and AutoFlow Revolution for automated workflows. ASSIST aims to simplify document management, enhance efficiency, and drive digital transformation in businesses by leveraging AI technology.

Jyotax.ai
Jyotax.ai is an AI-powered tax solution that revolutionizes tax compliance by simplifying the tax process with advanced AI solutions. It offers comprehensive bookkeeping, payroll processing, worldwide tax returns and filing automation, profit recovery, contract compliance, and financial modeling and budgeting services. The platform ensures accurate reporting, real-time compliance monitoring, global tax solutions, customizable tax tools, and seamless data integration. Jyotax.ai optimizes tax workflows, ensures compliance with precise AI tax calculations, and simplifies global tax operations through innovative AI solutions.

super.AI
Super.AI provides Intelligent Document Processing (IDP) solutions powered by Large Language Models (LLMs) and human-in-the-loop (HITL) capabilities. It automates document processing tasks such as data extraction, classification, and redaction, enabling businesses to streamline their workflows and improve accuracy. Super.AI's platform leverages cutting-edge AI models from providers like Amazon, Google, and OpenAI to handle complex documents, ensuring high-quality outputs. With its focus on accuracy, flexibility, and scalability, Super.AI caters to various industries, including financial services, insurance, logistics, and healthcare.

Infrrd
Infrrd is an intelligent document automation platform that offers advanced document extraction solutions. It leverages AI technology to enhance, classify, extract, and review documents with high accuracy, eliminating the need for human review. Infrrd provides effective process transformation solutions across various industries, such as mortgage, invoice, insurance, and audit QC. The platform is known for its world-class document extraction engine, supported by over 10 patents and award-winning algorithms. Infrrd's AI-powered automation streamlines document processing, improves data accuracy, and enhances operational efficiency for businesses.

Docugami
Docugami is an AI-powered document engineering platform that enables business users to extract, analyze, and automate data from various types of documents. It empowers users with immediate impact without the need for extensive machine learning investments or IT development. Docugami's proprietary Business Document Foundation Model and Generative AI technology transform unstructured text and tables into structured information, allowing users to unlock insights, increase productivity, and ensure compliance.
20 - Open Source AI Tools

erag
ERAG is an advanced system that combines lexical, semantic, text, and knowledge graph searches with conversation context to provide accurate and contextually relevant responses. This tool processes various document types, creates embeddings, builds knowledge graphs, and uses this information to answer user queries intelligently. It includes modules for interacting with web content, GitHub repositories, and performing exploratory data analysis using various language models.

aws-ai-intelligent-document-processing
This repository is part of Intelligent Document Processing with AWS AI Services workshop. It aims to automate the extraction of information from complex content in various document formats such as insurance claims, mortgages, healthcare claims, contracts, and legal contracts using AWS Machine Learning services like Amazon Textract and Amazon Comprehend. The repository provides hands-on labs to familiarize users with these AI services and build solutions to automate business processes that rely on manual inputs and intervention across different file types and formats.

paperless-ai
Paperless-AI is an automated document analyzer tool designed for Paperless-ngx users. It utilizes the OpenAI API and Ollama (Mistral, llama, phi 3, gemma 2) to automatically scan, analyze, and tag documents. The tool offers features such as automatic document scanning, AI-powered document analysis, automatic title and tag assignment, manual mode for analyzing documents, easy setup through a web interface, document processing dashboard, error handling, and Docker support. Users can configure the tool through a web interface and access a debug interface for monitoring and troubleshooting. Paperless-AI aims to streamline document organization and analysis processes for users with access to Paperless-ngx and AI capabilities.

rlama
RLAMA is a powerful AI-driven question-answering tool that seamlessly integrates with local Ollama models. It enables users to create, manage, and interact with Retrieval-Augmented Generation (RAG) systems tailored to their documentation needs. RLAMA follows a clean architecture pattern with clear separation of concerns, focusing on lightweight and portable RAG capabilities with minimal dependencies. The tool processes documents, generates embeddings, stores RAG systems locally, and provides contextually-informed responses to user queries. Supported document formats include text, code, and various document types, with troubleshooting steps available for common issues like Ollama accessibility, text extraction problems, and relevance of answers.

DB-GPT
DB-GPT is a personal database administrator that can solve database problems by reading documents, using various tools, and writing analysis reports. It is currently undergoing an upgrade. **Features:** * **Online Demo:** * Import documents into the knowledge base * Utilize the knowledge base for well-founded Q&A and diagnosis analysis of abnormal alarms * Send feedbacks to refine the intermediate diagnosis results * Edit the diagnosis result * Browse all historical diagnosis results, used metrics, and detailed diagnosis processes * **Language Support:** * English (default) * Chinese (add "language: zh" in config.yaml) * **New Frontend:** * Knowledgebase + Chat Q&A + Diagnosis + Report Replay * **Extreme Speed Version for localized llms:** * 4-bit quantized LLM (reducing inference time by 1/3) * vllm for fast inference (qwen) * Tiny LLM * **Multi-path extraction of document knowledge:** * Vector database (ChromaDB) * RESTful Search Engine (Elasticsearch) * **Expert prompt generation using document knowledge** * **Upgrade the LLM-based diagnosis mechanism:** * Task Dispatching -> Concurrent Diagnosis -> Cross Review -> Report Generation * Synchronous Concurrency Mechanism during LLM inference * **Support monitoring and optimization tools in multiple levels:** * Monitoring metrics (Prometheus) * Flame graph in code level * Diagnosis knowledge retrieval (dbmind) * Logical query transformations (Calcite) * Index optimization algorithms (for PostgreSQL) * Physical operator hints (for PostgreSQL) * Backup and Point-in-time Recovery (Pigsty) * **Continuously updated papers and experimental reports** This project is constantly evolving with new features. Don't forget to star ⭐ and watch 👀 to stay up to date.

blinkid-ios
BlinkID iOS is a mobile SDK that enables developers to easily integrate ID scanning and data extraction capabilities into their iOS applications. The SDK supports scanning and processing various types of identity documents, such as passports, driver's licenses, and ID cards. It provides accurate and fast data extraction, including personal information and document details. With BlinkID iOS, developers can enhance their apps with secure and reliable ID verification functionality, improving user experience and streamlining identity verification processes.

ai-chunking
AI Chunking is a powerful Python library for semantic document chunking and enrichment using AI. It provides intelligent document chunking capabilities with various strategies to split text while preserving semantic meaning, particularly useful for processing markdown documentation. The library offers multiple chunking strategies such as Recursive Text Splitting, Section-based Semantic Chunking, and Base Chunking. Users can configure chunk sizes, overlap, and support various text formats. The tool is easy to extend with custom chunking strategies, making it versatile for different document processing needs.

ExtractThinker
ExtractThinker is a library designed for extracting data from files and documents using Language Model Models (LLMs). It offers ORM-style interaction between files and LLMs, supporting multiple document loaders such as Tesseract OCR, Azure Form Recognizer, AWS TextExtract, and Google Document AI. Users can customize extraction using contract definitions, process documents asynchronously, handle various document formats efficiently, and split and process documents. The project is inspired by the LangChain ecosystem and focuses on Intelligent Document Processing (IDP) using LLMs to achieve high accuracy in document extraction tasks.

nous
Nous is an open-source TypeScript platform for autonomous AI agents and LLM based workflows. It aims to automate processes, support requests, review code, assist with refactorings, and more. The platform supports various integrations, multiple LLMs/services, CLI and web interface, human-in-the-loop interactions, flexible deployment options, observability with OpenTelemetry tracing, and specific agents for code editing, software engineering, and code review. It offers advanced features like reasoning/planning, memory and function call history, hierarchical task decomposition, and control-loop function calling options. Nous is designed to be a flexible platform for the TypeScript community to expand and support different use cases and integrations.

sophia
Sophia is an open-source TypeScript platform designed for autonomous AI agents and LLM based workflows. It aims to automate processes, review code, assist with refactorings, and support various integrations. The platform offers features like advanced autonomous agents, reasoning/planning inspired by Google's Self-Discover paper, memory and function call history, adaptive iterative planning, and more. Sophia supports multiple LLMs/services, CLI and web interface, human-in-the-loop interactions, flexible deployment options, observability with OpenTelemetry tracing, and specific agents for code editing, software engineering, and code review. It provides a flexible platform for the TypeScript community to expand and support various use cases and integrations.

serverless-pdf-chat
The serverless-pdf-chat repository contains a sample application that allows users to ask natural language questions of any PDF document they upload. It leverages serverless services like Amazon Bedrock, AWS Lambda, and Amazon DynamoDB to provide text generation and analysis capabilities. The application architecture involves uploading a PDF document to an S3 bucket, extracting metadata, converting text to vectors, and using a LangChain to search for information related to user prompts. The application is not intended for production use and serves as a demonstration and educational tool.

OpenAdapt
OpenAdapt is an open-source software adapter between Large Multimodal Models (LMMs) and traditional desktop and web Graphical User Interfaces (GUIs). It aims to automate repetitive GUI workflows by leveraging the power of LMMs. OpenAdapt records user input and screenshots, converts them into tokenized format, and generates synthetic input via transformer model completions. It also analyzes recordings to generate task trees and replay synthetic input to complete tasks. OpenAdapt is model agnostic and generates prompts automatically by learning from human demonstration, ensuring that agents are grounded in existing processes and mitigating hallucinations. It works with all types of desktop GUIs, including virtualized and web, and is open source under the MIT license.

TypeGPT
TypeGPT is a Python application that enables users to interact with ChatGPT or Google Gemini from any text field in their operating system using keyboard shortcuts. It provides global accessibility, keyboard shortcuts for communication, and clipboard integration for larger text inputs. Users need to have Python 3.x installed along with specific packages and API keys from OpenAI for ChatGPT access. The tool allows users to run the program normally or in the background, manage processes, and stop the program. Users can use keyboard shortcuts like `/ask`, `/see`, `/stop`, `/chatgpt`, `/gemini`, `/check`, and `Shift + Cmd + Enter` to interact with the application in any text field. Customization options are available by modifying files like `keys.txt` and `system_prompt.txt`. Contributions are welcome, and future plans include adding support for other APIs and a user-friendly GUI.

GenAIExamples
This project provides a collective list of Generative AI (GenAI) and Retrieval-Augmented Generation (RAG) examples such as chatbot with question and answering (ChatQnA), code generation (CodeGen), document summary (DocSum), etc.

panda-etl
PandaETL is an open-source, no-code ETL tool designed to extract and parse data from various document types including PDFs, emails, websites, audio files, and more. With an intuitive interface and powerful backend, PandaETL simplifies the process of data extraction and transformation, making it accessible to users without programming skills.

unstract
Unstract is a no-code platform that enables users to launch APIs and ETL pipelines to structure unstructured documents. With Unstract, users can go beyond co-pilots by enabling machine-to-machine automation. Unstract's Prompt Studio provides a simple, no-code approach to creating prompts for LLMs, vector databases, embedding models, and text extractors. Users can then configure Prompt Studio projects as API deployments or ETL pipelines to automate critical business processes that involve complex documents. Unstract supports a wide range of LLM providers, vector databases, embeddings, text extractors, ETL sources, and ETL destinations, providing users with the flexibility to choose the best tools for their needs.

learn-applied-generative-ai-fundamentals
This repository is part of the Certified Cloud Native Applied Generative AI Engineer program, focusing on Applied Generative AI Fundamentals. It covers prompt engineering, developing custom GPTs, and Multi AI Agent Systems. The course helps in building a strong understanding of generative AI, applying Large Language Models (LLMs) and diffusion models practically. It introduces principles of prompt engineering to work efficiently with AI, creating custom AI models and GPTs using OpenAI, Azure, and Google technologies. It also utilizes open source libraries like LangChain, CrewAI, and LangGraph to automate tasks and business processes.

EAGLE
Eagle is a family of Vision-Centric High-Resolution Multimodal LLMs that enhance multimodal LLM perception using a mix of vision encoders and various input resolutions. The model features a channel-concatenation-based fusion for vision experts with different architectures and knowledge, supporting up to over 1K input resolution. It excels in resolution-sensitive tasks like optical character recognition and document understanding.

FinalRip
FinalRip is a distributed video processing tool based on FFmpeg and VapourSynth. It cuts the original video into multiple clips, processes each clip in parallel, and merges them into the final video. Users can deploy the system in a distributed way, configure settings via environment variables or remote config files, and develop/test scripts in the vs-playground environment. It supports Nvidia GPU, AMD GPU with ROCm support, and provides a dashboard for selecting compatible scripts to process videos.

vertex-ai-mlops
Vertex AI is a platform for end-to-end model development. It consist of core components that make the processes of MLOps possible for design patterns of all types.
20 - OpenAI Gpts

ChatUML
Expert in all UML diagrams: Requirements in, Diagram Out – your precise solution for every specification.

Conveyance AI
ConveyanceAI streamlines property conveyancing, offering automated legal document handling, compliance guidance, and efficient workflow management for UK and European lawyers and conveyancers

Policy Communication Advisor
Communicates policy processes and changes effectively within the organization.

AI Legal Mediator
Facilitates mediation processes using AI to suggest fair compromises in legal disputes, informed by similar cases and outcomes.

Operations Department Assistant
An Operations Department Assistant aids the operations team by handling administrative tasks, process documentation, and data analysis, helping to streamline and optimize various operational processes within an organization.

Legal Insight Analyst "LIA"
Legal Insight Analyst-Lia is an advanced AI companion designed to empower legal professionals with unparalleled support in the world of law. By leveraging cutting-edge technologies, Lia simplifies and enhances legal research, analysis, and decision-making processes.

Form Filler
Expert in populating Word .docx forms with data from other documents, prioritizing accuracy and formal communication.

DocFlow
DocFlow is designed to assist in the creation and management of business-related documents. The assistant should leverage its knowledge base and language processing capabilities to provide detailed guidance, draft documents, and offer insights specific to business ventures.

Visionary Scholar
Assistant to help researchers with thesis research and documentation process.

PlanGPT
Formal, professional urban planning expert, skilled in document analysis and feedback interpretation.