Best AI tools for< Retrieve From Corpus >
20 - AI tool Sites

Fluent
Fluent is an AI-powered data analytics platform that helps businesses explore their data and uncover insights. It uses natural language processing to understand user questions and generate SQL queries to retrieve data from a variety of sources. Fluent also provides visualizations and dashboards to help users understand their data and make informed decisions.

neurons.bio
neurons.bio is an AI application that offers a unique collection of over 100 AI agents designed for drug development, medicine, and life science research. These agents perform specific tasks efficiently, retrieve data from various sources, and provide insights to accelerate research processes. The platform aims to revolutionize drug discovery and development by integrating cutting-edge LLM technology with domain-specific agents, reducing research costs and time to clinic.

Reworkd
Reworkd is a web data extraction tool that uses AI to generate and repair web extractors on the fly. It allows users to retrieve data from hundreds of websites without the need for developers. Reworkd is used by businesses in a variety of industries, including manufacturing, e-commerce, recruiting, lead generation, and real estate.

Shieldbase
Shieldbase is an AI-powered enterprise search tool designed to provide secure and efficient search capabilities for businesses. It utilizes advanced artificial intelligence algorithms to index and retrieve information from various data sources within an organization, ensuring quick and accurate search results. With a focus on security, Shieldbase offers encryption and access control features to protect sensitive data. The platform is user-friendly and customizable, making it easy for businesses to implement and integrate into their existing systems. Shieldbase enhances productivity by enabling employees to quickly find the information they need, ultimately improving decision-making processes and overall operational efficiency.

ExplainDev
ExplainDev is a platform that allows users to ask and answer technical coding questions. It uses computer vision to retrieve technical context from images or videos. The platform is designed to help developers get the best answers to their technical questions and guide others to theirs.

Wondershare Help Center
Wondershare Help Center provides comprehensive support for Wondershare products, including video editing, video creation, diagramming, PDF solutions, and data management. It offers a wide range of resources such as tutorials, FAQs, troubleshooting guides, and access to customer support.

Extracta.ai
Extracta.ai is an AI data extraction tool for documents and images that automates data extraction processes with easy integration. It allows users to define custom templates for extracting structured data without the need for training. The platform can extract data from various document types, including invoices, resumes, contracts, receipts, and more, providing accurate and efficient results. Extracta.ai ensures data security, encryption, and GDPR compliance, making it a reliable solution for businesses looking to streamline document processing.

Callin.io
Callin.io is an innovative AI solution that offers AI-driven virtual phone agents and assistants to enhance customer engagement and support. The platform provides customizable AI voice agents tailored to meet the specific needs of businesses, handling inbound and outbound customer conversations efficiently. With features like answering missed calls, assisting with appointment bookings, and responding to FAQs, Callin.io aims to revolutionize customer service operations and improve overall customer experience. The AI technology is designed to seamlessly integrate with existing CRM solutions and call center technology, providing real-time call transcripts and valuable insights from every conversation.

Octoparse
Octoparse is an AI web scraping tool that offers a no-coding solution for turning web pages into structured data with just a few clicks. It provides users with the ability to build reliable web scrapers without any coding knowledge, thanks to its intuitive workflow designer. With features like AI assistance, automation, and template libraries, Octoparse is a powerful tool for data extraction and analysis across various industries.

ONERECOVERY
ONERECOVERY is a professional data recovery solution for Windows that offers comprehensive and expert solutions to recover lost data from various storage devices. The software is designed to handle data loss for over 1,000 scenarios, including accidental deletion, formatting errors, virus attacks, and more. ONERECOVERY provides features such as file recovery for Windows and Mac, file duplicate finder, photo and video recovery, hard drive recovery, SD card data recovery, and more. With a user-friendly interface, quick and efficient scanning, and compatibility with diverse operating systems and storage devices, ONERECOVERY is a reliable and secure data recovery tool trusted by millions of users worldwide.

Ubblu
Ubblu is an AI-driven note-taking application that aims to help users search less and create more by providing a seamless experience for capturing, organizing, and retrieving ideas and information. It offers features like note capture, card writing, tag categorization, instant knowledge retrieval, and 'Ask' functionality for quick access to stored information. Ubblu is designed to liberate users' minds from information retention, allowing them to focus on innovation and creativity. The application is desktop-based with a mobile version in development.

xPDF AI by PDFChat
xPDF AI by PDFChat is a personal AI assistant designed for PDF files. It offers advanced features to analyze tables, figures, and text from PDF documents, providing users with instant answers and insights. The AI assistant uses a chat interface for effortless interaction and is capable of summarizing PDF files, retrieving relevant figures, processing tables intelligently, and performing accurate calculations. Users can also benefit from voice chat, advanced search tools, performance analytics, report generation, and document assistance. With over 10,000 users trusting the platform, PDFChat aims to revolutionize document analysis and enhance productivity.

Knowledge Drive
Knowledge Drive is the world's only self-organizing, self-maintaining, and fully integrated work knowledge system. It utilizes AI technology to automatically build a knowledge base by extracting useful information from documents. The system ensures knowledge freshness, easy access to information, and seamless integration across various platforms like Microsoft Office 365, Google Workspace, and Slack. Knowledge Drive aims to revolutionize knowledge management and boost productivity in teams by providing a central source of truth and eliminating the need for manual documentation.

MemFree
MemFree is a hybrid AI search tool that allows users to search for information instantly and receive accurate answers from the internet, bookmarks, notes, and documents. With MemFree, users can easily index their bookmarks and web pages with just one click. The tool leverages GPT-4o mini for enhanced search capabilities, making it a powerful and efficient AI application for information retrieval.

NPI Lookup
NPI Lookup is an AI-powered platform that offers advanced search and validation services for National Provider Identifier (NPI) numbers of healthcare providers in the United States. The tool uses cutting-edge artificial intelligence technology, including Natural Language Processing (NLP) algorithms and GPT models, to provide comprehensive insights and answers related to NPI profiles. It allows users to search and validate NPI records of doctors, hospitals, and other healthcare providers using everyday language queries, ensuring accurate and up-to-date information from the NPPES NPI database.

OpenHands
OpenHands is an AI tool developed by All Hands AI, Inc. that empowers users to streamline their development process by leveraging artificial intelligence. The tool allows users to automate tasks typically performed by human developers, such as modifying code, running commands, browsing the web, calling APIs, and even copying code snippets from StackOverflow. OpenHands aims to help users save time and increase productivity by handling repetitive and time-consuming tasks efficiently.

Bloks
Bloks is an AI-powered application designed to help users prepare for meetings by providing personalized briefs, meeting minutes, and relationship insights. It acts as a personal sidekick, maximizing every interaction by generating context-aware summaries, templates, and meeting dynamics based on personality types. Bloks eliminates the need for manual note-taking by automatically capturing and summarizing conversations from various sources like emails, attachments, and CRM. Users can chat with Bloks to get detailed meeting insights, relationship progress summaries, and identify conversation trends. The application is used in leading organizations to enhance productivity and streamline meeting preparation.

LotseAI
LotseAI is an AI-powered program management tool designed to streamline projects, align teams, and enhance decision-making. It offers secure integrations with popular tools like Notion, Google Docs, Jira, Trello, Asana, Slack, and more. LotseAI provides features such as effortless information retrieval, seamless tool integration, AI-powered reporting & insights, and dynamic tools for dynamic leaders. It helps users free up time from manual tasks, stay ahead of potential roadblocks, and keep stakeholders informed and engaged.

Exa
Exa is a search engine that uses embeddings-based search to retrieve the best content on the web. It is trusted by companies and developers from all over the world. Exa is like Google, but it is better at understanding the meaning of your queries and returning results that are more relevant to your needs. Exa can be used for a variety of tasks, including finding information on the web, conducting research, and building AI applications.

Unlost
Unlost is a memory recall tool that allows users to instantly retrieve information with zero effort. It helps users never lose track or forget any details by recording and intelligently understanding their screen layout and content. Unlost operates privately and offline, respecting user space and copyright law. The tool offers quick access, powerful filtering, and familiar keyboard shortcuts for effortless searching. Users can search meeting transcripts, copy text from screenshots, and exclude capturing specific apps or websites. Unlost aims to delegate memory and enhance user capacity effortlessly.
20 - Open Source AI Tools

bocoel
BoCoEL is a tool that leverages Bayesian Optimization to efficiently evaluate large language models by selecting a subset of the corpus for evaluation. It encodes individual entries into embeddings, uses Bayesian optimization to select queries, retrieves from the corpus, and provides easily managed evaluations. The tool aims to reduce computation costs during evaluation with a dynamic budget, supporting models like GPT2, Pythia, and LLAMA through integration with Hugging Face transformers and datasets. BoCoEL offers a modular design and efficient representation of the corpus to enhance evaluation quality.

rag-experiment-accelerator
The RAG Experiment Accelerator is a versatile tool that helps you conduct experiments and evaluations using Azure AI Search and RAG pattern. It offers a rich set of features, including experiment setup, integration with Azure AI Search, Azure Machine Learning, MLFlow, and Azure OpenAI, multiple document chunking strategies, query generation, multiple search types, sub-querying, re-ranking, metrics and evaluation, report generation, and multi-lingual support. The tool is designed to make it easier and faster to run experiments and evaluations of search queries and quality of response from OpenAI, and is useful for researchers, data scientists, and developers who want to test the performance of different search and OpenAI related hyperparameters, compare the effectiveness of various search strategies, fine-tune and optimize parameters, find the best combination of hyperparameters, and generate detailed reports and visualizations from experiment results.

ScholarCopilot
Scholar Copilot is an intelligent academic writing assistant that enhances the research writing process through AI-powered text completion and citation suggestions. It aims to streamline academic writing while maintaining high scholarly standards. The tool provides features such as smart text generation with next-3-sentence suggestions, full section auto-completion, and context-aware writing. It also offers intelligent citation management with real-time citation suggestions, one-click citation insertion, and citation Bibtex generation. Scholar Copilot employs a unified model architecture that integrates retrieval and generation through a dynamic switching mechanism, ensuring coherent text generation with appropriate citation points.

Google_GenerativeAI
Google GenerativeAI (Gemini) is an unofficial C# .Net SDK based on REST APIs for accessing Google Gemini models. It offers a complete rewrite of the previous SDK with improved performance, flexibility, and ease of use. The SDK seamlessly integrates with LangChain.net, providing easy methods for JSON-based interactions and function calling with Google Gemini models. It includes features like enhanced JSON mode handling, function calling with code generator, multi-modal functionality, Vertex AI support, multimodal live API, image generation and captioning, retrieval-augmented generation with Vertex RAG Engine and Google AQA, easy JSON handling, Gemini tools and function calling, multimodal live API, and more.

awesome-transformer-nlp
This repository contains a hand-curated list of great machine (deep) learning resources for Natural Language Processing (NLP) with a focus on Generative Pre-trained Transformer (GPT), Bidirectional Encoder Representations from Transformers (BERT), attention mechanism, Transformer architectures/networks, Chatbot, and transfer learning in NLP.

ragoon
RAGoon is a high-level library designed for batched embeddings generation, fast web-based RAG (Retrieval-Augmented Generation) processing, and quantized indexes processing. It provides NLP utilities for multi-model embedding production, high-dimensional vector visualization, and enhancing language model performance through search-based querying, web scraping, and data augmentation techniques.

AutoRAG
AutoRAG is an AutoML tool designed to automatically find the optimal RAG pipeline for your data. It simplifies the process of evaluating various RAG modules to identify the best pipeline for your specific use-case. The tool supports easy evaluation of different module combinations, making it efficient to find the most suitable RAG pipeline for your needs. AutoRAG also offers a cloud beta version to assist users in running and optimizing the tool, along with building RAG evaluation datasets for a starting price of $9.99 per optimization.

awesome-hallucination-detection
This repository provides a curated list of papers, datasets, and resources related to the detection and mitigation of hallucinations in large language models (LLMs). Hallucinations refer to the generation of factually incorrect or nonsensical text by LLMs, which can be a significant challenge for their use in real-world applications. The resources in this repository aim to help researchers and practitioners better understand and address this issue.

gfm-rag
The GFM-RAG is a graph foundation model-powered pipeline that combines graph neural networks to reason over knowledge graphs and retrieve relevant documents for question answering. It features a knowledge graph index, efficiency in multi-hop reasoning, generalizability to unseen datasets, transferability for fine-tuning, compatibility with agent-based frameworks, and interpretability of reasoning paths. The tool can be used for conducting retrieval and question answering tasks using pre-trained models or fine-tuning on custom datasets.

graphrag-local-ollama
GraphRAG Local Ollama is a repository that offers an adaptation of Microsoft's GraphRAG, customized to support local models downloaded using Ollama. It enables users to leverage local models with Ollama for large language models (LLMs) and embeddings, eliminating the need for costly OpenAPI models. The repository provides a simple setup process and allows users to perform question answering over private text corpora by building a graph-based text index and generating community summaries for closely-related entities. GraphRAG Local Ollama aims to improve the comprehensiveness and diversity of generated answers for global sensemaking questions over datasets.

awesome-llm-attributions
This repository focuses on unraveling the sources that large language models tap into for attribution or citation. It delves into the origins of facts, their utilization by the models, the efficacy of attribution methodologies, and challenges tied to ambiguous knowledge reservoirs, biases, and pitfalls of excessive attribution.

LLPhant
LLPhant is a comprehensive PHP Generative AI Framework that provides a simple and powerful way to build apps. It supports Symfony and Laravel and offers a wide range of features, including text generation, chatbots, text summarization, and more. LLPhant is compatible with OpenAI and Ollama and can be used to perform a variety of tasks, including creating semantic search, chatbots, personalized content, and text summarization.

LLPhant
LLPhant is a comprehensive PHP Generative AI Framework designed to be simple yet powerful, compatible with Symfony and Laravel. It supports various LLMs like OpenAI, Anthropic, Mistral, Ollama, and services compatible with OpenAI API. The framework enables tasks such as semantic search, chatbots, personalized content creation, text summarization, personal shopper creation, autonomous AI agents, and coding tool assistance. It provides tools for generating text, images, speech-to-text transcription, and customizing system messages for question answering. LLPhant also offers features for embeddings, vector stores, document stores, and question answering with various query transformations and reranking techniques.

LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse is a comprehensive and curated guide designed to empower developers, researchers, and enthusiasts to harness the true capabilities of Large Language Models (LLMs) and build intelligent applications that push the boundaries of natural language understanding. This GitHub repository provides in-depth articles, codebase mastery, LLM PlayLab, and resources for cost analysis and network visualization. It covers various aspects of LLMs, including NLP, models, training, evaluation metrics, open LLMs, and more. The repository also includes a collection of code examples and tutorials to help users build and deploy LLM-based applications.

HippoRAG
HippoRAG is a novel retrieval augmented generation (RAG) framework inspired by the neurobiology of human long-term memory that enables Large Language Models (LLMs) to continuously integrate knowledge across external documents. It provides RAG systems with capabilities that usually require a costly and high-latency iterative LLM pipeline for only a fraction of the computational cost. The tool facilitates setting up retrieval corpus, indexing, and retrieval processes for LLMs, offering flexibility in choosing different online LLM APIs or offline LLM deployments through LangChain integration. Users can run retrieval on pre-defined queries or integrate directly with the HippoRAG API. The tool also supports reproducibility of experiments and provides data, baselines, and hyperparameter tuning scripts for research purposes.

LLM-for-Healthcare
The repository 'LLM-for-Healthcare' provides a comprehensive survey of large language models (LLMs) for healthcare, covering data, technology, applications, and accountability and ethics. It includes information on various LLM models, training data, evaluation methods, and computation costs. The repository also discusses tasks such as NER, text classification, question answering, dialogue systems, and generation of medical reports from images in the healthcare domain.
18 - OpenAI Gpts

Downloader
Download data from the internet. Fetch the content of sites and make it available to the session, given a URL.

MagicUnprotect
This GPT allows to interact with the Unprotect DB to retrieve knowledge about malware evasion techniques

MemoryGPT
Never lose data again. Store entire conversations for later retrieve or sharing. Do not share sensible information, data is publicly available.

MyGoogle
Connect and interact with your Google accounts. Organize, retrieve, and manipulate data with A.I
AskYourPDF Research Assistantxxxx
Unlock the power of your research with the AskYourPDF Research Assistant. Bring information to your fingertips today.

Lambeth Planning Policy Bot
I search Lambeth's planning site to provide links to policies and documents.

Comprehensive Second Brain Assistant
Expert in Tiago Forte's Second Brain methodology for digital organization.

Efficient Assistant - Dr. Cho 😎
Efficient Assistant for task management, info retrieval, and scheduling. Offers dynamic, personalized support while ensuring user privacy and data security. Ideal for organizing tasks, setting reminders, and providing up-to-date information.

Help Me Think of That Thing
Can't quite remember that thought you had? Use this GPT to help guide you back to your memory.

RSS Finder | Find the RSS in any website
Finds and provides RSS feed URLs for given website links.

Golden Retriever Training Assistant and Consultant
Golden Retriever training expert providing advice and tips

Hunting Planner
Retrieves hunting-related data for each state. Providing insightful data analysis on trends in hunting statistics. (beta)

How to Train a Chessie
Comprehensive training and wellness guide for Chesapeake Bay Retrievers.