Best AI tools for< Store Documents >
20 - AI tool Sites
Croduct PDF
Croduct PDF is an AI-powered tool that transforms static PDF documents into interactive conversations. Users can upload their PDFs and interact with a chatbot to ask questions, summarize content, and receive quick answers. The tool enhances productivity by providing lightning-fast responses, personalized experiences, and secure cloud storage for documents. With an intuitive viewer, users can engage with their PDFs on any device seamlessly.
Dart
Dart is an AI project management software designed to streamline task execution, subtask generation, project planning, and duplicate detection. It offers features like intelligent task management, sprints, roadmaps, calendar views, document storage, and meeting notes. Dart integrates with various workplace tools and is suitable for roles in engineering, product leadership, design, and sales. Users can benefit from AI-driven features such as automatic property filling, roadmap creation, and task assignment. Dart aims to enhance team productivity, project organization, and task management efficiency through its innovative AI capabilities.
SPUN
SPUN is a platform that helps foreigners relocate to or extend their stay in Indonesia. It provides a range of services, including visa and permit assistance, travel insurance, and accommodation options. SPUN is powered by a network of virtual assistants and AI, which helps to guide users through the relocation process and answer their questions.
DocAI
DocAI is an AI-driven document solution that transforms documents into interactive conversations. It streamlines document workflows, enhances productivity, and offers blazing fast responses to inquiries. The platform features an intelligent chatbot, interactive PDF viewer, affordable pricing, multilingual support, and advanced AI capabilities. DocAI is trusted by industry leaders for its transformative impact on document handling and analytics.
Bitrix24
Bitrix24 is an all-in-one online workspace platform that offers a wide range of tools and features for businesses, including CRM, task management, online meetings, collaboration, project management, automation, HR management, and more. It provides solutions for sales, marketing, customer service, and internal communication, all in a single platform. With features like CoPilot AI assistance, mobile apps, integrations, and customizable options, Bitrix24 aims to streamline business processes and improve productivity.
GPTs.Fan
GPTs.Fan is a comprehensive platform dedicated to GPT designers, providing a wealth of resources and support. It offers a vibrant community forum where designers can connect, share knowledge, and collaborate on projects. Additionally, GPTs.Fan features a curated collection of GPT-related tools, tutorials, and articles, empowering designers to stay up-to-date with the latest advancements in the field.
Copalot AI Copilot
Copalot is an AI copilot application designed to provide AI chat and visual video support for small businesses. It helps in reducing customer interaction and support costs by offering AI chat and video FAQ bots that can be embedded in websites or linked to products. Copalot allows users to create custom ChatGPT and FAQs based on their own content, supporting multiple file formats and webpages. The application is user-friendly and multilingual, catering to a global customer base.
Bitrix24
Bitrix24 is an all-in-one online workspace for businesses, offering tools for CRM, tasks, online meetings, and more. It provides features such as sales management, team collaboration, task and project management, online document storage, website creation, online store setup, HR and automation tools, and AI-powered assistance through CoPilot. Bitrix24 aims to streamline business operations and enhance team productivity by centralizing various business functions in one platform.
Scanner Go
Scanner Go is a free PDF tool that offers easy and high-quality scanning capabilities. It allows users to quickly scan various types of documents, images, and books, and convert them to PDF format. The tool features powerful OCR technology for extracting text from PDFs and images, as well as options for managing, editing, printing, and sharing documents. Users can also access their scanned documents from any device and store them securely in the cloud. Scanner Go simplifies the process of digitizing documents and offers a range of popular tools for PDF conversion and optimization.
Zefort
Zefort is an AI-powered contract management solution that offers a zero-effort approach to managing contracts. It allows users to create, sign, and store contracts with ease, providing features like eSignatures, automated reminders, and secure storage. Zefort is designed to streamline contract processes for legal teams, procurement, HR teams, sales teams, and company administration. The platform integrates advanced AI technology to enhance contract management efficiency and accuracy, catering to organizations of all sizes. With bank-level security measures and a user-friendly interface, Zefort ensures a seamless contract management experience.
Loata
Loata is an AI-powered platform that serves as a learning orchestrator for adaptive text analyses. It allows users to store their notes and documents in the cloud, which are then ingested and transformed into knowledge bases. The platform features smart AI agents powered by LLMs to provide intelligent answers based on the content. With end-to-end encryption and controlled ingestion, Loata ensures the security and privacy of user data. Users can choose from different subscription plans to access varying levels of storage and query capacity, making it suitable for individuals and professionals alike.
Speak4Me
Speak4Me is a text-to-speech application that converts any text file, including PDFs and websites, into audible content. It enables users to listen to their documents or school materials anytime, anywhere. With features like scanning physical or digital text, reading web pages aloud, and a new ChatWithMe function, Speak4Me aims to enhance reading experiences and improve focus for individuals with reading issues. The application is trusted by over 15,000 people on the App Store and offers a free version for schools, making education more accessible for everyone.
Frequently by Ecomtent
Frequently by Ecomtent is an AI-powered Ecommerce Automation tool that provides fast, accurate, and comprehensive answers to questions about selling on Amazon, Ebay, or any other ecommerce store. It offers features such as AI product image generation, infographics creation, optimized copywriting, and more. The tool is built with over 100 proprietary SOPs and documents containing expert knowledge and experiences from experienced sellers and former Amazon employees. Frequently aims to deliver ongoing updates and enhancements to ensure the best outcomes for businesses.
Google Drive
Google Drive is a cloud-based storage and file-sharing service that allows users to store, share, and collaborate on files from any device. It is part of the Google Workspace suite of productivity tools, which also includes Gmail, Docs, Sheets, and Slides. Drive is designed to be easy to use and secure, and it offers a variety of features that make it a valuable tool for businesses and individuals alike.
ThinkDiffusion
ThinkDiffusion is a cloud-based platform that offers stable diffusion services. It provides businesses and educational institutions with a reliable solution for managing and sharing information. With ThinkDiffusion, users can easily create, store, and distribute content in a secure and efficient manner. The platform is designed to streamline collaboration and enhance productivity by facilitating seamless communication and data sharing.
SvectorDB
SvectorDB is a vector database built from the ground up for serverless applications. It is designed to be highly scalable, performant, and easy to use. SvectorDB can be used for a variety of applications, including recommendation engines, document search, and image search.
The Drive AI
The Drive AI is an AI-powered storage and collaborative workspace platform designed for students, educators, researchers, and writers. It allows users to store class materials, ask questions, write content, and chat with friends and AI. The platform offers insights and analysis of class resources, collaborative document editing, chat channels, and auto-complete summarization features. Trusted by thousands of students from top universities, The Drive AI enhances productivity and efficiency in academic and professional settings.
Knit
Knit is an AI playground for prompt designers. It provides professional prompt editors with various models, including GPT-4-turbo/vision, Claude-3, Gemini-pro, and more. Users can store, edit, and run their prompts in Knit. It also offers project management features, allowing users to organize prompts with projects, set up projects for different use cases, and collaborate with team members. Knit supports different kinds of models, including OpenAI, Claude, Azure OpenAI, and plans to support more in the future. It allows users to control API parameters, export code instantly, and provides security features such as encryption and version control.
Linguix
Linguix is a GPT-4 writing and productivity copilot for teams. It uses artificial intelligence to improve grammar, spelling, and style, and to help users write more clearly and effectively. Linguix is available as a browser extension and a web editor, and it can be used with a variety of online platforms, including Gmail, Google Docs, and OpenAI. Linguix is trusted by over 310,000 users, including Google Chrome Store Featured App, Edge Store Featured App, Product Hunt Top #1 writing assistant, G2 reviews website Top proofreading tool, and Linguix for Figma Featured App.
Opal
Opal is an AI-powered study tool designed to supercharge studying for students. It offers features such as AI-powered notes, flashcards, quizzes, and advanced performance tracking. Opal allows users to upload various document types, provides multilingual support, and ensures bank-level security for stored documents. The tool is built to help students summarize, review, and learn efficiently with the assistance of AI technology.
20 - Open Source AI Tools
ragtacts
Ragtacts is a Clojure library that allows users to easily interact with Large Language Models (LLMs) such as OpenAI's GPT-4. Users can ask questions to LLMs, create question templates, call Clojure functions in natural language, and utilize vector databases for more accurate answers. Ragtacts also supports RAG (Retrieval-Augmented Generation) method for enhancing LLM output by incorporating external data. Users can use Ragtacts as a CLI tool, API server, or through a RAG Playground for interactive querying.
llms
The 'llms' repository is a comprehensive guide on Large Language Models (LLMs), covering topics such as language modeling, applications of LLMs, statistical language modeling, neural language models, conditional language models, evaluation methods, transformer-based language models, practical LLMs like GPT and BERT, prompt engineering, fine-tuning LLMs, retrieval augmented generation, AI agents, and LLMs for computer vision. The repository provides detailed explanations, examples, and tools for working with LLMs.
chromem-go
chromem-go is an embeddable vector database for Go with a Chroma-like interface and zero third-party dependencies. It enables retrieval augmented generation (RAG) and similar embeddings-based features in Go apps without the need for a separate database. The focus is on simplicity and performance for common use cases, allowing querying of documents with minimal memory allocations. The project is in beta and may introduce breaking changes before v1.0.0.
sample-apps
Vespa is an open-source search and AI engine that provides a unified platform for building and deploying search and AI applications. Vespa sample applications showcase various use cases and features of Vespa, including basic search, recommendation, semantic search, image search, text ranking, e-commerce search, question answering, search-as-you-type, and ML inference serving.
RAGMeUp
RAG Me Up is a generic framework that enables users to perform Retrieve and Generate (RAG) on their own dataset easily. It consists of a small server and UIs for communication. Best run on GPU with 16GB vRAM. Users can combine RAG with fine-tuning using LLaMa2Lang repository. The tool allows configuration for LLM, data, LLM parameters, prompt, and document splitting. Funding is sought to democratize AI and advance its applications.
serverless-chat-langchainjs
This sample shows how to build a serverless chat experience with Retrieval-Augmented Generation using LangChain.js and Azure. The application is hosted on Azure Static Web Apps and Azure Functions, with Azure Cosmos DB for MongoDB vCore as the vector database. You can use it as a starting point for building more complex AI applications.
langchain_dart
LangChain.dart is a Dart port of the popular LangChain Python framework created by Harrison Chase. LangChain provides a set of ready-to-use components for working with language models and a standard interface for chaining them together to formulate more advanced use cases (e.g. chatbots, Q&A with RAG, agents, summarization, extraction, etc.). The components can be grouped into a few core modules: * **Model I/O:** LangChain offers a unified API for interacting with various LLM providers (e.g. OpenAI, Google, Mistral, Ollama, etc.), allowing developers to switch between them with ease. Additionally, it provides tools for managing model inputs (prompt templates and example selectors) and parsing the resulting model outputs (output parsers). * **Retrieval:** assists in loading user data (via document loaders), transforming it (with text splitters), extracting its meaning (using embedding models), storing (in vector stores) and retrieving it (through retrievers) so that it can be used to ground the model's responses (i.e. Retrieval-Augmented Generation or RAG). * **Agents:** "bots" that leverage LLMs to make informed decisions about which available tools (such as web search, calculators, database lookup, etc.) to use to accomplish the designated task. The different components can be composed together using the LangChain Expression Language (LCEL).
vectorflow
VectorFlow is an open source, high throughput, fault tolerant vector embedding pipeline. It provides a simple API endpoint for ingesting large volumes of raw data, processing, and storing or returning the vectors quickly and reliably. The tool supports text-based files like TXT, PDF, HTML, and DOCX, and can be run locally with Kubernetes in production. VectorFlow offers functionalities like embedding documents, running chunking schemas, custom chunking, and integrating with vector databases like Pinecone, Qdrant, and Weaviate. It enforces a standardized schema for uploading data to a vector store and supports features like raw embeddings webhook, chunk validation webhook, S3 endpoint, and telemetry. The tool can be used with the Python client and provides detailed instructions for running and testing the functionalities.
catalyst
Catalyst is a C# Natural Language Processing library designed for speed, inspired by spaCy's design. It provides pre-trained models, support for training word and document embeddings, and flexible entity recognition models. The library is fast, modern, and pure-C#, supporting .NET standard 2.0. It is cross-platform, running on Windows, Linux, macOS, and ARM. Catalyst offers non-destructive tokenization, named entity recognition, part-of-speech tagging, language detection, and efficient binary serialization. It includes pre-built models for language packages and lemmatization. Users can store and load models using streams. Getting started with Catalyst involves installing its NuGet Package and setting the storage to use the online repository. The library supports lazy loading of models from disk or online. Users can take advantage of C# lazy evaluation and native multi-threading support to process documents in parallel. Training a new FastText word2vec embedding model is straightforward, and Catalyst also provides algorithms for fast embedding search and dimensionality reduction.
lloco
LLoCO is a technique that learns documents offline through context compression and in-domain parameter-efficient finetuning using LoRA, which enables LLMs to handle long context efficiently.
document-ai-samples
The Google Cloud Document AI Samples repository contains code samples and Community Samples demonstrating how to analyze, classify, and search documents using Google Cloud Document AI. It includes various projects showcasing different functionalities such as integrating with Google Drive, processing documents using Python, content moderation with Dialogflow CX, fraud detection, language extraction, paper summarization, tax processing pipeline, and more. The repository also provides access to test document files stored in a publicly-accessible Google Cloud Storage Bucket. Additionally, there are codelabs available for optical character recognition (OCR), form parsing, specialized processors, and managing Document AI processors. Community samples, like the PDF Annotator Sample, are also included. Contributions are welcome, and users can seek help or report issues through the repository's issues page. Please note that this repository is not an officially supported Google product and is intended for demonstrative purposes only.
conversational-agent-langchain
This repository contains a Rest-Backend for a Conversational Agent that allows embedding documents, semantic search, QA based on documents, and document processing with Large Language Models. It uses Aleph Alpha and OpenAI Large Language Models to generate responses to user queries, includes a vector database, and provides a REST API built with FastAPI. The project also features semantic search, secret management for API keys, installation instructions, and development guidelines for both backend and frontend components.
chroma
Chroma is an open-source embedding database that provides a simple, scalable, and feature-rich way to build Python or JavaScript LLM apps with memory. It offers a fully-typed, fully-tested, and fully-documented API that makes it easy to get started and scale your applications. Chroma also integrates with popular tools like LangChain and LlamaIndex, and supports a variety of embedding models, including Sentence Transformers, OpenAI embeddings, and Cohere embeddings. With Chroma, you can easily add documents to your database, query relevant documents with natural language, and compose documents into the context window of an LLM like GPT3 for additional summarization or analysis.
paperless-ai
Paperless-AI is an automated document analyzer tool designed for Paperless-ngx users. It utilizes the OpenAI API and Ollama (Mistral, llama, phi 3, gemma 2) to automatically scan, analyze, and tag documents. The tool offers features such as automatic document scanning, AI-powered document analysis, automatic title and tag assignment, manual mode for analyzing documents, easy setup through a web interface, document processing dashboard, error handling, and Docker support. Users can configure the tool through a web interface and access a debug interface for monitoring and troubleshooting. Paperless-AI aims to streamline document organization and analysis processes for users with access to Paperless-ngx and AI capabilities.
text-extract-api
The text-extract-api is a powerful tool that allows users to convert images, PDFs, or Office documents to Markdown text or JSON structured documents with high accuracy. It is built using FastAPI and utilizes Celery for asynchronous task processing, with Redis for caching OCR results. The tool provides features such as PDF/Office to Markdown and JSON conversion, improving OCR results with LLama, removing Personally Identifiable Information from documents, distributed queue processing, caching using Redis, switchable storage strategies, and a CLI tool for task management. Users can run the tool locally or on cloud services, with support for GPU processing. The tool also offers an online demo for testing purposes.
banks
Banks is a linguist professor tool that helps generate meaningful LLM prompts using a template language. It provides a user-friendly way to create prompts for various tasks such as blog writing, summarizing documents, lemmatizing text, and generating text using a LLM. The tool supports async operations and comes with predefined filters for data processing. Banks leverages Jinja's macro system to create prompts and interact with OpenAI API for text generation. It also offers a cache mechanism to avoid regenerating text for the same template and context.
ExtractThinker
ExtractThinker is a library designed for extracting data from files and documents using Language Model Models (LLMs). It offers ORM-style interaction between files and LLMs, supporting multiple document loaders such as Tesseract OCR, Azure Form Recognizer, AWS TextExtract, and Google Document AI. Users can customize extraction using contract definitions, process documents asynchronously, handle various document formats efficiently, and split and process documents. The project is inspired by the LangChain ecosystem and focuses on Intelligent Document Processing (IDP) using LLMs to achieve high accuracy in document extraction tasks.
memfree
MemFree is an open-source hybrid AI search engine that allows users to simultaneously search their personal knowledge base (bookmarks, notes, documents, etc.) and the Internet. It features a self-hosted super fast serverless vector database, local embedding and rerank service, one-click Chrome bookmarks index, and full code open source. Users can contribute by opening issues for bugs or making pull requests for new features or improvements.
LARS
LARS is an application that enables users to run Large Language Models (LLMs) locally on their devices, upload their own documents, and engage in conversations where the LLM grounds its responses with the uploaded content. The application focuses on Retrieval Augmented Generation (RAG) to increase accuracy and reduce AI-generated inaccuracies. LARS provides advanced citations, supports various file formats, allows follow-up questions, provides full chat history, and offers customization options for LLM settings. Users can force enable or disable RAG, change system prompts, and tweak advanced LLM settings. The application also supports GPU-accelerated inferencing, multiple embedding models, and text extraction methods. LARS is open-source and aims to be the ultimate RAG-centric LLM application.
raglite
RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite. It offers configurable options for choosing LLM providers, database types, and rerankers. The toolkit is fast and permissive, utilizing lightweight dependencies and hardware acceleration. RAGLite provides features like PDF to Markdown conversion, multi-vector chunk embedding, optimal semantic chunking, hybrid search capabilities, adaptive retrieval, and improved output quality. It is extensible with a built-in Model Context Protocol server, customizable ChatGPT-like frontend, document conversion to Markdown, and evaluation tools. Users can configure RAGLite for various tasks like configuring, inserting documents, running RAG pipelines, computing query adapters, evaluating performance, running MCP servers, and serving frontends.
20 - OpenAI Gpts
GPT Store
A GPT specialized in curating, documenting, and updating GPTs on Github at https://github.com/prajwalsouza/GPT-Store
Story book
Chinese-speaking assistant for weather, translation, and sequential Little Rabbit stories over 1000 words
GAPP STORE
Welcome to GAPP Store: Chat, create, customize—your all-in-one AI app universe
GPT store
Enthusiastic assistant showcasing the latest GPT technologies with a focus on security.
HaGiPT
Regele GPT ce încearcă să 'paseze' răspunsuri precise și să 'marcheze' puncte cu inteligența sa artificială.
Creative Writing Coach
I'm eager to read your work and give you feedback to improve your skills.
三国志bot
青空文庫の吉川英治『三国志』を学習したAI:吉川 英治 https://www.aozora.gr.jp/index_pages/person1562.html#sakuhin_list_1