Best AI tools for< Store Documents >
20 - AI tool Sites
Croduct PDF
Croduct PDF is an AI-powered tool that transforms static PDF documents into interactive conversations. Users can upload their PDFs and interact with a chatbot to ask questions, summarize content, and receive quick answers. The tool enhances productivity by providing lightning-fast responses, personalized experiences, and secure cloud storage for documents. With an intuitive viewer, users can engage with their PDFs on any device seamlessly.
Akeeva
Akeeva is a user-friendly online platform that simplifies end-of-life planning. It helps individuals organize and manage their affairs, such as wills, funeral arrangements, and legacy planning, in a convenient and efficient manner. Akeeva aims to alleviate the stress and burden associated with end-of-life decisions by providing a comprehensive toolkit for users to plan ahead and ensure their wishes are carried out.
Dart
Dart is the ultimate AI project management tool designed to save time and streamline project management processes. It offers features like task execution, subtask generation, project planning, duplicate detection, roadmaps, calendar views, document storage, meeting notes, integrations with workplace tools, and more. Dart is used by teams across various roles like engineering, product management, leadership, design, and sales to enhance productivity and efficiency in task management. The application leverages AI capabilities to automate tasks, generate reports, and assist in project ideation and execution.
SPUN
SPUN is a platform that helps foreigners relocate to or extend their stay in Indonesia. It provides a range of services, including visa and permit assistance, travel insurance, and accommodation options. SPUN is powered by a network of virtual assistants and AI, which helps to guide users through the relocation process and answer their questions.
DocAI
DocAI is an AI-driven document solution that transforms documents into interactive conversations. It streamlines document workflows, enhances productivity, and offers blazing fast responses to inquiries. The platform features an intelligent chatbot, interactive PDF viewer, affordable pricing, multilingual support, and advanced AI capabilities. DocAI is trusted by industry leaders for its transformative impact on document handling and analytics.
GPTs.Fan
GPTs.Fan is a comprehensive platform dedicated to GPT designers, providing a wealth of resources and support. It offers a vibrant community forum where designers can connect, share knowledge, and collaborate on projects. Additionally, GPTs.Fan features a curated collection of GPT-related tools, tutorials, and articles, empowering designers to stay up-to-date with the latest advancements in the field.
Copalot AI Copilot
Copalot is an AI copilot application designed to provide AI chat and visual video support for small businesses. It helps in reducing customer interaction and support costs by offering AI chat and video FAQ bots that can be embedded in websites or linked to products. Copalot allows users to create custom ChatGPT and FAQs based on their own content, supporting multiple file formats and webpages. The application is user-friendly and multilingual, catering to a global customer base.
Scanner Go
Scanner Go is a free PDF tool that offers easy and high-quality scanning capabilities. It allows users to quickly scan various types of documents, images, and books, and convert them to PDF format. The tool features powerful OCR technology for extracting text from PDFs and images, as well as options for managing, editing, printing, and sharing documents. Users can also access their scanned documents from any device and store them securely in the cloud. Scanner Go simplifies the process of digitizing documents and offers a range of popular tools for PDF conversion and optimization.
Zefort
Zefort is an AI-powered contract management solution that offers a zero-effort approach to managing contracts. It allows users to create, sign, and store contracts with ease, providing features like eSignatures, automated reminders, and secure storage. Zefort is designed to streamline contract processes for legal teams, procurement, HR teams, sales teams, and company administration. The platform integrates advanced AI technology to enhance contract management efficiency and accuracy, catering to organizations of all sizes. With bank-level security measures and a user-friendly interface, Zefort ensures a seamless contract management experience.
Loata
Loata is an AI-powered platform that serves as a learning orchestrator for adaptive text analyses. It allows users to store their notes and documents in the cloud, which are then ingested and transformed into knowledge bases. The platform features smart AI agents powered by LLMs to provide intelligent answers based on the content. With end-to-end encryption and controlled ingestion, Loata ensures the security and privacy of user data. Users can choose from different subscription plans to access varying levels of storage and query capacity, making it suitable for individuals and professionals alike.
Speak4Me
Speak4Me is a text-to-speech application that converts any text file, including PDFs and websites, into audible content. It enables users to listen to their documents or school materials anytime, anywhere. With features like scanning physical or digital text, reading web pages aloud, and a new ChatWithMe function, Speak4Me aims to enhance reading experiences and improve focus for individuals with reading issues. The application is trusted by over 15,000 people on the App Store and offers a free version for schools, making education more accessible for everyone.
H2O.ai
H2O.ai is an AI platform that offers a convergence of the world's best predictive and generative AI solutions. It provides end-to-end GenAI platform for air-gapped, on-premises, or cloud VPC deployments, allowing users to own every part of the stack. With features like h2oGPTe, h2oGPT, H2O Danube3, H2O Eval Studio, and GenAI App Store, H2O.ai empowers users to customize and deploy AI models, assess performance, develop safe applications, and more. The platform is known for democratizing AI with automated machine learning and open-source distributed machine learning.
Google Drive
Google Drive is a cloud-based storage and file-sharing service that allows users to store, share, and collaborate on files from any device. It is part of the Google Workspace suite of productivity tools, which also includes Gmail, Docs, Sheets, and Slides. Drive is designed to be easy to use and secure, and it offers a variety of features that make it a valuable tool for businesses and individuals alike.
SvectorDB
SvectorDB is a vector database built from the ground up for serverless applications. It is designed to be highly scalable, performant, and easy to use. SvectorDB can be used for a variety of applications, including recommendation engines, document search, and image search.
Knit
Knit is an AI playground for prompt designers. It provides professional prompt editors with various models, including GPT-4-turbo/vision, Claude-3, Gemini-pro, and more. Users can store, edit, and run their prompts in Knit. It also offers project management features, allowing users to organize prompts with projects, set up projects for different use cases, and collaborate with team members. Knit supports different kinds of models, including OpenAI, Claude, Azure OpenAI, and plans to support more in the future. It allows users to control API parameters, export code instantly, and provides security features such as encryption and version control.
Linguix
Linguix is a GPT-4 writing and productivity copilot for teams. It uses artificial intelligence to improve grammar, spelling, and style, and to help users write more clearly and effectively. Linguix is available as a browser extension and a web editor, and it can be used with a variety of online platforms, including Gmail, Google Docs, and OpenAI. Linguix is trusted by over 310,000 users, including Google Chrome Store Featured App, Edge Store Featured App, Product Hunt Top #1 writing assistant, G2 reviews website Top proofreading tool, and Linguix for Figma Featured App.
Opal
Opal is an AI-powered study tool designed to supercharge studying for students. It offers features such as AI-powered notes, flashcards, quizzes, and advanced performance tracking. Opal allows users to upload various document types, provides multilingual support, and ensures bank-level security for stored documents. The tool is built to help students summarize, review, and learn efficiently with the assistance of AI technology.
O.Translator
O.Translator is an online artificial intelligence translation website that offers unparalleled translation accuracy with AI while perfectly preserving the original format of documents. It is powered by a sophisticated AI engine for context-aware translations and is compatible with GPT-3.5, GPT-4, Gemini, and GeminiFlash. The tool provides post-editing and glossary features for precise noun translations and effortless content adjustments. Users can enjoy free previews before paying for translations, with affordable rates of $1 for up to 20,000 words. Privacy is a top priority, with all translated documents stored securely. O.Translator supports a wide range of document formats and over 60 languages worldwide, including the translation of scanned PDFs.
AI Writer
This website provides a suite of AI-powered writing tools, including an AI writer, text summarizer, story writer, and headline generator. These tools can help you generate high-quality content quickly and easily.
Bibit AI
Bibit AI is a real estate marketing AI designed to enhance the efficiency and effectiveness of real estate marketing and sales. It can help create listings, descriptions, and property content, and offers a host of other features. Bibit AI is the world's first AI for Real Estate. We are transforming the real estate industry by boosting efficiency and simplifying tasks like listing creation and content generation.
20 - Open Source AI Tools
ragtacts
Ragtacts is a Clojure library that allows users to easily interact with Large Language Models (LLMs) such as OpenAI's GPT-4. Users can ask questions to LLMs, create question templates, call Clojure functions in natural language, and utilize vector databases for more accurate answers. Ragtacts also supports RAG (Retrieval-Augmented Generation) method for enhancing LLM output by incorporating external data. Users can use Ragtacts as a CLI tool, API server, or through a RAG Playground for interactive querying.
llms
The 'llms' repository is a comprehensive guide on Large Language Models (LLMs), covering topics such as language modeling, applications of LLMs, statistical language modeling, neural language models, conditional language models, evaluation methods, transformer-based language models, practical LLMs like GPT and BERT, prompt engineering, fine-tuning LLMs, retrieval augmented generation, AI agents, and LLMs for computer vision. The repository provides detailed explanations, examples, and tools for working with LLMs.
chromem-go
chromem-go is an embeddable vector database for Go with a Chroma-like interface and zero third-party dependencies. It enables retrieval augmented generation (RAG) and similar embeddings-based features in Go apps without the need for a separate database. The focus is on simplicity and performance for common use cases, allowing querying of documents with minimal memory allocations. The project is in beta and may introduce breaking changes before v1.0.0.
RAGMeUp
RAG Me Up is a generic framework that enables users to perform Retrieve and Generate (RAG) on their own dataset easily. It consists of a small server and UIs for communication. Best run on GPU with 16GB vRAM. Users can combine RAG with fine-tuning using LLaMa2Lang repository. The tool allows configuration for LLM, data, LLM parameters, prompt, and document splitting. Funding is sought to democratize AI and advance its applications.
serverless-chat-langchainjs
This sample shows how to build a serverless chat experience with Retrieval-Augmented Generation using LangChain.js and Azure. The application is hosted on Azure Static Web Apps and Azure Functions, with Azure Cosmos DB for MongoDB vCore as the vector database. You can use it as a starting point for building more complex AI applications.
langchain_dart
LangChain.dart is a Dart port of the popular LangChain Python framework created by Harrison Chase. LangChain provides a set of ready-to-use components for working with language models and a standard interface for chaining them together to formulate more advanced use cases (e.g. chatbots, Q&A with RAG, agents, summarization, extraction, etc.). The components can be grouped into a few core modules: * **Model I/O:** LangChain offers a unified API for interacting with various LLM providers (e.g. OpenAI, Google, Mistral, Ollama, etc.), allowing developers to switch between them with ease. Additionally, it provides tools for managing model inputs (prompt templates and example selectors) and parsing the resulting model outputs (output parsers). * **Retrieval:** assists in loading user data (via document loaders), transforming it (with text splitters), extracting its meaning (using embedding models), storing (in vector stores) and retrieving it (through retrievers) so that it can be used to ground the model's responses (i.e. Retrieval-Augmented Generation or RAG). * **Agents:** "bots" that leverage LLMs to make informed decisions about which available tools (such as web search, calculators, database lookup, etc.) to use to accomplish the designated task. The different components can be composed together using the LangChain Expression Language (LCEL).
vectorflow
VectorFlow is an open source, high throughput, fault tolerant vector embedding pipeline. It provides a simple API endpoint for ingesting large volumes of raw data, processing, and storing or returning the vectors quickly and reliably. The tool supports text-based files like TXT, PDF, HTML, and DOCX, and can be run locally with Kubernetes in production. VectorFlow offers functionalities like embedding documents, running chunking schemas, custom chunking, and integrating with vector databases like Pinecone, Qdrant, and Weaviate. It enforces a standardized schema for uploading data to a vector store and supports features like raw embeddings webhook, chunk validation webhook, S3 endpoint, and telemetry. The tool can be used with the Python client and provides detailed instructions for running and testing the functionalities.
catalyst
Catalyst is a C# Natural Language Processing library designed for speed, inspired by spaCy's design. It provides pre-trained models, support for training word and document embeddings, and flexible entity recognition models. The library is fast, modern, and pure-C#, supporting .NET standard 2.0. It is cross-platform, running on Windows, Linux, macOS, and ARM. Catalyst offers non-destructive tokenization, named entity recognition, part-of-speech tagging, language detection, and efficient binary serialization. It includes pre-built models for language packages and lemmatization. Users can store and load models using streams. Getting started with Catalyst involves installing its NuGet Package and setting the storage to use the online repository. The library supports lazy loading of models from disk or online. Users can take advantage of C# lazy evaluation and native multi-threading support to process documents in parallel. Training a new FastText word2vec embedding model is straightforward, and Catalyst also provides algorithms for fast embedding search and dimensionality reduction.
lloco
LLoCO is a technique that learns documents offline through context compression and in-domain parameter-efficient finetuning using LoRA, which enables LLMs to handle long context efficiently.
document-ai-samples
The Google Cloud Document AI Samples repository contains code samples and Community Samples demonstrating how to analyze, classify, and search documents using Google Cloud Document AI. It includes various projects showcasing different functionalities such as integrating with Google Drive, processing documents using Python, content moderation with Dialogflow CX, fraud detection, language extraction, paper summarization, tax processing pipeline, and more. The repository also provides access to test document files stored in a publicly-accessible Google Cloud Storage Bucket. Additionally, there are codelabs available for optical character recognition (OCR), form parsing, specialized processors, and managing Document AI processors. Community samples, like the PDF Annotator Sample, are also included. Contributions are welcome, and users can seek help or report issues through the repository's issues page. Please note that this repository is not an officially supported Google product and is intended for demonstrative purposes only.
conversational-agent-langchain
This repository contains a Rest-Backend for a Conversational Agent that allows embedding documents, semantic search, QA based on documents, and document processing with Large Language Models. It uses Aleph Alpha and OpenAI Large Language Models to generate responses to user queries, includes a vector database, and provides a REST API built with FastAPI. The project also features semantic search, secret management for API keys, installation instructions, and development guidelines for both backend and frontend components.
chroma
Chroma is an open-source embedding database that provides a simple, scalable, and feature-rich way to build Python or JavaScript LLM apps with memory. It offers a fully-typed, fully-tested, and fully-documented API that makes it easy to get started and scale your applications. Chroma also integrates with popular tools like LangChain and LlamaIndex, and supports a variety of embedding models, including Sentence Transformers, OpenAI embeddings, and Cohere embeddings. With Chroma, you can easily add documents to your database, query relevant documents with natural language, and compose documents into the context window of an LLM like GPT3 for additional summarization or analysis.
banks
Banks is a linguist professor tool that helps generate meaningful LLM prompts using a template language. It provides a user-friendly way to create prompts for various tasks such as blog writing, summarizing documents, lemmatizing text, and generating text using a LLM. The tool supports async operations and comes with predefined filters for data processing. Banks leverages Jinja's macro system to create prompts and interact with OpenAI API for text generation. It also offers a cache mechanism to avoid regenerating text for the same template and context.
sparrow
Sparrow is an innovative open-source solution for efficient data extraction and processing from various documents and images. It seamlessly handles forms, invoices, receipts, and other unstructured data sources. Sparrow stands out with its modular architecture, offering independent services and pipelines all optimized for robust performance. One of the critical functionalities of Sparrow - pluggable architecture. You can easily integrate and run data extraction pipelines using tools and frameworks like LlamaIndex, Haystack, or Unstructured. Sparrow enables local LLM data extraction pipelines through Ollama or Apple MLX. With Sparrow solution you get API, which helps to process and transform your data into structured output, ready to be integrated with custom workflows. Sparrow Agents - with Sparrow you can build independent LLM agents, and use API to invoke them from your system. **List of available agents:** * **llamaindex** - RAG pipeline with LlamaIndex for PDF processing * **vllamaindex** - RAG pipeline with LLamaIndex multimodal for image processing * **vprocessor** - RAG pipeline with OCR and LlamaIndex for image processing * **haystack** - RAG pipeline with Haystack for PDF processing * **fcall** - Function call pipeline * **unstructured-light** - RAG pipeline with Unstructured and LangChain, supports PDF and image processing * **unstructured** - RAG pipeline with Weaviate vector DB query, Unstructured and LangChain, supports PDF and image processing * **instructor** - RAG pipeline with Unstructured and Instructor libraries, supports PDF and image processing. Works great for JSON response generation
memfree
MemFree is an open-source hybrid AI search engine that allows users to simultaneously search their personal knowledge base (bookmarks, notes, documents, etc.) and the Internet. It features a self-hosted super fast serverless vector database, local embedding and rerank service, one-click Chrome bookmarks index, and full code open source. Users can contribute by opening issues for bugs or making pull requests for new features or improvements.
LARS
LARS is an application that enables users to run Large Language Models (LLMs) locally on their devices, upload their own documents, and engage in conversations where the LLM grounds its responses with the uploaded content. The application focuses on Retrieval Augmented Generation (RAG) to increase accuracy and reduce AI-generated inaccuracies. LARS provides advanced citations, supports various file formats, allows follow-up questions, provides full chat history, and offers customization options for LLM settings. Users can force enable or disable RAG, change system prompts, and tweak advanced LLM settings. The application also supports GPU-accelerated inferencing, multiple embedding models, and text extraction methods. LARS is open-source and aims to be the ultimate RAG-centric LLM application.
h2ogpt
h2oGPT is an Apache V2 open-source project that allows users to query and summarize documents or chat with local private GPT LLMs. It features a private offline database of any documents (PDFs, Excel, Word, Images, Video Frames, Youtube, Audio, Code, Text, MarkDown, etc.), a persistent database (Chroma, Weaviate, or in-memory FAISS) using accurate embeddings (instructor-large, all-MiniLM-L6-v2, etc.), and efficient use of context using instruct-tuned LLMs (no need for LangChain's few-shot approach). h2oGPT also offers parallel summarization and extraction, reaching an output of 80 tokens per second with the 13B LLaMa2 model, HYDE (Hypothetical Document Embeddings) for enhanced retrieval based upon LLM responses, a variety of models supported (LLaMa2, Mistral, Falcon, Vicuna, WizardLM. With AutoGPTQ, 4-bit/8-bit, LORA, etc.), GPU support from HF and LLaMa.cpp GGML models, and CPU support using HF, LLaMa.cpp, and GPT4ALL models. Additionally, h2oGPT provides Attention Sinks for arbitrarily long generation (LLaMa-2, Mistral, MPT, Pythia, Falcon, etc.), a UI or CLI with streaming of all models, the ability to upload and view documents through the UI (control multiple collaborative or personal collections), Vision Models LLaVa, Claude-3, Gemini-Pro-Vision, GPT-4-Vision, Image Generation Stable Diffusion (sdxl-turbo, sdxl) and PlaygroundAI (playv2), Voice STT using Whisper with streaming audio conversion, Voice TTS using MIT-Licensed Microsoft Speech T5 with multiple voices and Streaming audio conversion, Voice TTS using MPL2-Licensed TTS including Voice Cloning and Streaming audio conversion, AI Assistant Voice Control Mode for hands-free control of h2oGPT chat, Bake-off UI mode against many models at the same time, Easy Download of model artifacts and control over models like LLaMa.cpp through the UI, Authentication in the UI by user/password via Native or Google OAuth, State Preservation in the UI by user/password, Linux, Docker, macOS, and Windows support, Easy Windows Installer for Windows 10 64-bit (CPU/CUDA), Easy macOS Installer for macOS (CPU/M1/M2), Inference Servers support (oLLaMa, HF TGI server, vLLM, Gradio, ExLLaMa, Replicate, OpenAI, Azure OpenAI, Anthropic), OpenAI-compliant, Server Proxy API (h2oGPT acts as drop-in-replacement to OpenAI server), Python client API (to talk to Gradio server), JSON Mode with any model via code block extraction. Also supports MistralAI JSON mode, Claude-3 via function calling with strict Schema, OpenAI via JSON mode, and vLLM via guided_json with strict Schema, Web-Search integration with Chat and Document Q/A, Agents for Search, Document Q/A, Python Code, CSV frames (Experimental, best with OpenAI currently), Evaluate performance using reward models, and Quality maintained with over 1000 unit and integration tests taking over 4 GPU-hours.
DB-GPT
DB-GPT is a personal database administrator that can solve database problems by reading documents, using various tools, and writing analysis reports. It is currently undergoing an upgrade. **Features:** * **Online Demo:** * Import documents into the knowledge base * Utilize the knowledge base for well-founded Q&A and diagnosis analysis of abnormal alarms * Send feedbacks to refine the intermediate diagnosis results * Edit the diagnosis result * Browse all historical diagnosis results, used metrics, and detailed diagnosis processes * **Language Support:** * English (default) * Chinese (add "language: zh" in config.yaml) * **New Frontend:** * Knowledgebase + Chat Q&A + Diagnosis + Report Replay * **Extreme Speed Version for localized llms:** * 4-bit quantized LLM (reducing inference time by 1/3) * vllm for fast inference (qwen) * Tiny LLM * **Multi-path extraction of document knowledge:** * Vector database (ChromaDB) * RESTful Search Engine (Elasticsearch) * **Expert prompt generation using document knowledge** * **Upgrade the LLM-based diagnosis mechanism:** * Task Dispatching -> Concurrent Diagnosis -> Cross Review -> Report Generation * Synchronous Concurrency Mechanism during LLM inference * **Support monitoring and optimization tools in multiple levels:** * Monitoring metrics (Prometheus) * Flame graph in code level * Diagnosis knowledge retrieval (dbmind) * Logical query transformations (Calcite) * Index optimization algorithms (for PostgreSQL) * Physical operator hints (for PostgreSQL) * Backup and Point-in-time Recovery (Pigsty) * **Continuously updated papers and experimental reports** This project is constantly evolving with new features. Don't forget to star ⭐ and watch 👀 to stay up to date.
blinkid-ios
BlinkID iOS is a mobile SDK that enables developers to easily integrate ID scanning and data extraction capabilities into their iOS applications. The SDK supports scanning and processing various types of identity documents, such as passports, driver's licenses, and ID cards. It provides accurate and fast data extraction, including personal information and document details. With BlinkID iOS, developers can enhance their apps with secure and reliable ID verification functionality, improving user experience and streamlining identity verification processes.
SummaryYou
Summary You is a tool that utilizes AI to summarize YouTube videos, articles, images, and documents. Users can set the length of the summary and have the option to listen to the summaries. The tool also includes a history section, intelligent paywall detection, OLED-Dark Mode, and a user-friendly Material Design 3 style UI with dynamic color themes. It uses GPT-3.5 OpenAI/Mixtral 8x7B Groq for summarization. The backend is implemented in Python with Chaquopy, and some UI designs and codes are borrowed from Seal Material color utilities.
20 - OpenAI Gpts
GPT Store
A GPT specialized in curating, documenting, and updating GPTs on Github at https://github.com/prajwalsouza/GPT-Store
Story book
Chinese-speaking assistant for weather, translation, and sequential Little Rabbit stories over 1000 words
GAPP STORE
Welcome to GAPP Store: Chat, create, customize—your all-in-one AI app universe
GPT store
Enthusiastic assistant showcasing the latest GPT technologies with a focus on security.
HaGiPT
Regele GPT ce încearcă să 'paseze' răspunsuri precise și să 'marcheze' puncte cu inteligența sa artificială.
Creative Writing Coach
I'm eager to read your work and give you feedback to improve your skills.
三国志bot
青空文庫の吉川英治『三国志』を学習したAI:吉川 英治 https://www.aozora.gr.jp/index_pages/person1562.html#sakuhin_list_1