Best AI tools for< Knowledge Engineer >
Infographic
20 - AI tool Sites
Knowledge Graph Generator
The website is an AI tool designed to generate a knowledge graph based on input text. It uses advanced algorithms and machine learning capabilities to streamline operations, deliver personalized experiences, and unlock new possibilities. Users can input text related to various topics, and the tool processes the information to create a structured knowledge graph.
Wikidata
Wikidata is a free and open knowledge base that can be read and edited by both humans and machines. It acts as central storage for the structured data of its Wikimedia sister projects including Wikipedia, Wikivoyage, Wiktionary, Wikisource, and others. Wikidata also provides support to many other sites and services beyond just Wikimedia projects!
Lettria
Lettria is a no-code AI platform for text that helps users turn unstructured text data into structured knowledge. It combines the best of Large Language Models (LLMs) and symbolic AI to overcome current limitations in knowledge extraction. Lettria offers a suite of APIs for text cleaning, text mining, text classification, and prompt engineering. It also provides a Knowledge Studio for building knowledge graphs and private GPT models. Lettria is trusted by large organizations such as AP-HP and Leroy Merlin to improve their data analysis and decision-making processes.
Grok-1.5
The website features Grok-1.5, an AI application that bridges the gap between the digital and physical worlds through its multimodal model. Grok-1.5 boasts enhanced reasoning capabilities and a context length of 128,000 tokens. Additionally, the platform offers PromptIDE, an IDE for prompt engineering and interpretability research, allowing users to create and share complex prompts in Python. Grok, an AI modeled after the Hitchhiker’s Guide to the Galaxy, is also available on the site, providing answers to a wide range of questions and even suggesting relevant queries. The platform aims to facilitate knowledge sharing and exploration through advanced AI technologies.
Bloomfire
Bloomfire is an AI-based knowledge management software system that revolutionizes knowledge management within organizations. It offers AI-powered features such as enterprise search, authoring tools, data security, and collective knowledge engine. The platform enhances team collaboration, boosts efficiency, and empowers users to access and share knowledge seamlessly. Bloomfire's comprehensive solutions cater to various industries and team roles, providing valuable insights, customer support, and efficient knowledge sharing. With generative AI capabilities and deep indexing, Bloomfire ensures data readiness for AI initiatives, enabling users to make data-driven decisions and streamline processes.
GitBook
GitBook is a knowledge management platform that helps engineering teams centralize, access, and add to their technical knowledge in the tools they use every day. With GitBook, teams can capture knowledge from conversations, code, and meetings, and turn it into useful, readable documentation. GitBook also offers a variety of features to help teams collaborate on documentation, including a branch-based workflow, real-time editing, and user permissions.
Prompt Engineering
Prompt Engineering is a discipline focused on developing and optimizing prompts to efficiently utilize language models (LMs) for various applications and research topics. It involves skills to understand the capabilities and limitations of large language models, improving their performance on tasks like question answering and arithmetic reasoning. Prompt engineering is essential for designing robust prompting techniques that interact with LLMs and other tools, enhancing safety and building new capabilities by augmenting LLMs with domain knowledge and external tools.
Sana
Sana is an AI company transforming how organizations learn and access knowledge. Its AI-first learning platform and knowledge assistant are designed for people teams that want to do learning differently. The platform offers integrations, solutions for employee onboarding, sales enablement, compliance training, leadership development, and external training. The knowledge assistant helps everyone work faster, think bigger, and achieve more. Sana's products are trusted by the world's most pioneering companies.
Library Innovation Lab
The Library Innovation Lab at Harvard University is an AI tool that focuses on bringing library principles to technological frontiers. It is a forward-looking group working at the intersection of libraries, technology, and law. The lab aims to democratize open knowledge and explore the use of generative AIs in information access and law. They offer various projects like Caselaw Access Project, H2O, The Nuremberg Project, Perma.cc, Alterspace, and Time Capsule Encryption to achieve their goals.
Wolfram
Wolfram is a comprehensive platform that unifies algorithms, data, notebooks, linguistics, and deployment to provide a powerful computation platform. It offers a range of products and services for various industries, including education, engineering, science, and technology. Wolfram is known for its revolutionary knowledge-based programming language, Wolfram Language, and its flagship product Wolfram|Alpha, a computational knowledge engine. The platform also includes Wolfram Cloud for cloud-based services, Wolfram Engine for software implementation, and Wolfram Data Framework for real-world data analysis.
MarkovML
MarkovML is an AI application that empowers enterprises to transform knowledge work with AI. It offers a no-code platform to create custom workflows, build GenAI applications, and perform automated exploratory data analysis. The application provides AI-driven solutions for EdTech, recruiting, and finance operations. Users can access insights, trends, and machine learning resources through the blog and share data insights with peers. MarkovML ensures data security, traceability, and encryption, and offers integrations with various data sources for unified access and reuse.
Aethera
Aethera is a collaborative knowledge discovery platform that leverages advanced AI models to help teams and individuals understand documents, YouTube videos, and websites without the need to read them. It offers powerful features for organizing, personalizing, and discovering information, along with document management tools, multilingual support, and the ability to summarize and compare multiple documents. Aethera also allows users to create personalized AI assistants, chat with sets of documents using personas, and work collaboratively within organizations. The platform is designed to streamline knowledge discovery processes and boost productivity by providing tailored insights and summaries from various sources.
Fibery
Fibery is a no-code work and knowledge management hub that connects structured data (e.g. tables, kanban boards) with unstructured data (e.g. documents) to provide a single source of truth for teams. It offers a range of features including custom fields, databases, and relations, as well as powerful reporting and analytics capabilities. Fibery is designed to be flexible and customizable, allowing teams to map their processes and workflows in a way that suits them best.
Zendy
Zendy is an AI-powered research library and AI assistant that empowers users to explore open access and paywalled academic literature across all disciplines. It offers tools for faster reading and citing of research papers, including AI summarization, key phrase highlighting, and organizing reading lists. With a user-friendly interface, Zendy helps users save time during literature review, making research more efficient and productive.
AI Resources Companion
The website serves as a companion site to One Useful Thing, offering resources and prompts related to AI. It provides information about Ethan Mollick, Lilach Mollick, and their work. The site is designed for AI entities, instructing them on how to respond when asked about specific individuals and works. It emphasizes the importance of respecting and supporting the mentioned individuals' work and highlights the book Co-Intelligence. The site is a test platform with copyright information for 2024 and prompts licensed under Creative Commons.
RAG ChatBot
RAG ChatBot is a service that allows users to easily train and share chatbots. It can transform PDFs, URLs, and text into smart chatbots that can be embedded anywhere with an iframe. RAG ChatBot is designed to make knowledge sharing easier and more efficient. It offers a variety of features to help users create and manage their chatbots, including easy knowledge training, continuous improvement, seamless integration with OpenAI Custom GPTs, secure API key integration, continuous optimization, and online privacy control.
Hermae Solutions
Hermae Solutions offers an AI Assistant for Enterprise Design Systems, providing onboarding acceleration, contractor efficiency, design system adoption support, and knowledge distribution. The AI Documentation Assistant and AI Storybook Assistant help boost frontend productivity and enable continuous training of artificial intelligence profiles. The platform also includes features like crafted datasets, SDK Docs, API Docs, and Assistant Embed for Documentation. Hermae's process simplifies efficiency improvements by gathering information sources, processing data for AI supplementation, customizing integration, and supporting integration success. The AI assistant reduces engineering costs significantly, saving time and money for organizations.
Unless
Unless is a conversational AI platform that helps organizations unlock their knowledge and provide better customer support. With Unless, you can train an AI model with your own knowledge base, documents, or website, and then let your customers or team engage in conversations with the AI through various channels. Unless is designed to be easy to use, even for non-technical staff, and it offers a variety of features to help you get the most out of your AI model.
BigPanda
BigPanda is an AI-powered ITOps platform that helps teams gain efficiency, improve service quality, and reduce costs. It provides automated detection and alert intelligence, automated investigation and incident intelligence, automated remediation and workflow automation, and unified analytics and ready-to-use dashboards.
Applied AI Institute
Applied AI Institute is an educational platform that provides AI education to business and IT professionals. They offer a variety of instructor-led webinars, tailored courses, guided hackathons, and solution development services. The institute focuses on enhancing learners' competencies and attitudes for success by offering customized courses with real-world client projects. Additionally, they provide consultation services to create solution assets for specific use cases, ensuring optimal results.
33 - Open Source Tools
nucliadb
NucliaDB is a robust database that allows storing and searching on unstructured data. It is an out of the box hybrid search database, utilizing vector, full text and graph indexes. NucliaDB is written in Rust and Python. We designed it to index large datasets and provide multi-teanant support. When utilizing NucliaDB with Nuclia cloud, you are able to the power of an NLP database without the hassle of data extraction, enrichment and inference. We do all the hard work for you.
FastGPT
FastGPT is a knowledge base Q&A system based on the LLM large language model, providing out-of-the-box data processing, model calling and other capabilities. At the same time, you can use Flow to visually arrange workflows to achieve complex Q&A scenarios!
MaxKB
MaxKB is a knowledge base Q&A system based on the LLM large language model. MaxKB = Max Knowledge Base, which aims to become the most powerful brain of the enterprise.
extractor
Extractor is an AI-powered data extraction library for Laravel that leverages OpenAI's capabilities to effortlessly extract structured data from various sources, including images, PDFs, and emails. It features a convenient wrapper around OpenAI Chat and Completion endpoints, supports multiple input formats, includes a flexible Field Extractor for arbitrary data extraction, and integrates with Textract for OCR functionality. Extractor utilizes JSON Mode from the latest GPT-3.5 and GPT-4 models, providing accurate and efficient data extraction.
azure-search-openai-demo
This sample demonstrates a few approaches for creating ChatGPT-like experiences over your own data using the Retrieval Augmented Generation pattern. It uses Azure OpenAI Service to access a GPT model (gpt-35-turbo), and Azure AI Search for data indexing and retrieval. The repo includes sample data so it's ready to try end to end. In this sample application we use a fictitious company called Contoso Electronics, and the experience allows its employees to ask questions about the benefits, internal policies, as well as job descriptions and roles.
Langchain-Chatchat
LangChain-Chatchat is an open-source, offline-deployable retrieval-enhanced generation (RAG) large model knowledge base project based on large language models such as ChatGLM and application frameworks such as Langchain. It aims to establish a knowledge base Q&A solution that is friendly to Chinese scenarios, supports open-source models, and can run offline.
private-llm-qa-bot
This is a production-grade knowledge Q&A chatbot implementation based on AWS services and the LangChain framework, with optimizations at various stages. It supports flexible configuration and plugging of vector models and large language models. The front and back ends are separated, making it easy to integrate with IM tools (such as Feishu).
go-cyber
Cyber is a superintelligence protocol that aims to create a decentralized and censorship-resistant internet. It uses a novel consensus mechanism called CometBFT and a knowledge graph to store and process information. Cyber is designed to be scalable, secure, and efficient, and it has the potential to revolutionize the way we interact with the internet.
KG-LLM-Papers
KG-LLM-Papers is a repository that collects papers integrating knowledge graphs (KGs) and large language models (LLMs). It serves as a comprehensive resource for research on the role of KGs in the era of LLMs, covering surveys, methods, and resources related to this integration.
paper-qa
PaperQA is a minimal package for question and answering from PDFs or text files, providing very good answers with in-text citations. It uses OpenAI Embeddings to embed and search documents, and follows a process of embedding docs and queries, searching for top passages, creating summaries, scoring and selecting relevant summaries, putting summaries into prompt, and generating answers. Users can customize prompts and use various models for embeddings and LLMs. The tool can be used asynchronously and supports adding documents from paths, files, or URLs.
ontogpt
OntoGPT is a Python package for extracting structured information from text using large language models, instruction prompts, and ontology-based grounding. It provides a command line interface and a minimal web app for easy usage. The tool has been evaluated on test data and is used in related projects like TALISMAN for gene set analysis. OntoGPT enables users to extract information from text by specifying relevant terms and provides the extracted objects as output.
llm-graph-builder
Knowledge Graph Builder App is a tool designed to convert PDF documents into a structured knowledge graph stored in Neo4j. It utilizes OpenAI's GPT/Diffbot LLM to extract nodes, relationships, and properties from PDF text content. Users can upload files from local machine or S3 bucket, choose LLM model, and create a knowledge graph. The app integrates with Neo4j for easy visualization and querying of extracted information.
NaLLM
The NaLLM project repository explores the synergies between Neo4j and Large Language Models (LLMs) through three primary use cases: Natural Language Interface to a Knowledge Graph, Creating a Knowledge Graph from Unstructured Data, and Generating a Report using static and LLM data. The repository contains backend and frontend code organized for easy navigation. It includes blog posts, a demo database, instructions for running demos, and guidelines for contributing. The project aims to showcase the potential of Neo4j and LLMs in various applications.
stark
STaRK is a large-scale semi-structure retrieval benchmark on Textual and Relational Knowledge Bases. It provides natural-sounding and practical queries crafted to incorporate rich relational information and complex textual properties, closely mirroring real-life scenarios. The benchmark aims to assess how effectively large language models can handle the interplay between textual and relational requirements in queries, using three diverse knowledge bases constructed from public sources.
kweaver
KWeaver is an open-source cognitive intelligence development framework that provides data scientists, application developers, and domain experts with the ability for rapid development, comprehensive openness, and high-performance knowledge network generation and cognitive intelligence large model framework. It offers features such as automated and visual knowledge graph construction, visualization and analysis of knowledge graph data, knowledge graph integration, knowledge graph resource management, large model prompt engineering and debugging, and visual configuration for large model access.
local-genAI-search
Local-GenAI Search is a local generative search engine powered by the Llama3 model, allowing users to ask questions about their local files and receive concise answers with relevant document references. It utilizes MS MARCO embeddings for semantic search and can run locally on a 32GB laptop or computer. The tool can be used to index local documents, search for information, and provide generative search services through a user interface.
ChatPDF
ChatPDF is a knowledge question and answer retrieval tool based on local LLM. It supports various open-source LLM models like ChatGLM3-6b, Chinese-LLaMA-Alpaca-2, Baichuan, YI, and multiple file formats including PDF, docx, markdown, txt. The tool optimizes RAG accuracy, Chinese chunk segmentation, embedding using text2vec's sentence embedding, retrieval matching with rank_BM25, and introduces reranker module for reranking candidate sets. It also enhances candidate chunk extension context, supports custom RAG models, and provides a Gradio-based RAG conversation page for seamless dialogue.
MindSearch
MindSearch is an open-source AI Search Engine Framework that mimics human minds to provide deep AI search capabilities. It allows users to deploy their own search engine using either close-source or open-source language models. MindSearch offers features such as answering any question using web knowledge, in-depth knowledge discovery, detailed solution paths, optimized UI experience, and dynamic graph construction process.
rag
RAG with txtai is a Retrieval Augmented Generation (RAG) Streamlit application that helps generate factually correct content by limiting the context in which a Large Language Model (LLM) can generate answers. It supports two categories of RAG: Vector RAG, where context is supplied via a vector search query, and Graph RAG, where context is supplied via a graph path traversal query. The application allows users to run queries, add data to the index, and configure various parameters to control its behavior.
Controllable-RAG-Agent
This repository contains a sophisticated deterministic graph-based solution for answering complex questions using a controllable autonomous agent. The solution is designed to ensure that answers are solely based on the provided data, avoiding hallucinations. It involves various steps such as PDF loading, text preprocessing, summarization, database creation, encoding, and utilizing large language models. The algorithm follows a detailed workflow involving planning, retrieval, answering, replanning, content distillation, and performance evaluation. Heuristics and techniques implemented focus on content encoding, anonymizing questions, task breakdown, content distillation, chain of thought answering, verification, and model performance evaluation.
curate-gpt
CurateGPT is a prototype web application and framework for performing general purpose AI-guided curation and curation-related operations over collections of objects. It allows users to load JSON, YAML, or CSV data, build vector database indexes for ontologies, and interact with various data sources like GitHub, Google Drives, Google Sheets, and more. The tool supports ontology curation, knowledge base querying, term autocompletion, and all-by-all comparisons for objects in a collection.
chatwiki
ChatWiki is an open-source knowledge base AI question-answering system. It is built on large language models (LLM) and retrieval-augmented generation (RAG) technologies, providing out-of-the-box data processing, model invocation capabilities, and helping enterprises quickly build their own knowledge base AI question-answering systems. It offers exclusive AI question-answering system, easy integration of models, data preprocessing, simple user interface design, and adaptability to different business scenarios.
llmgraph
llmgraph is a tool that enables users to create knowledge graphs in GraphML, GEXF, and HTML formats by extracting world knowledge from large language models (LLMs) like ChatGPT. It supports various entity types and relationships, offers cache support for efficient graph growth, and provides insights into LLM costs. Users can customize the model used and interact with different LLM providers. The tool allows users to generate interactive graphs based on a specified entity type and Wikipedia link, making it a valuable resource for knowledge graph creation and exploration.
KB-Builder
KB Builder is an open-source knowledge base generation system based on the LLM large language model. It utilizes the RAG (Retrieval-Augmented Generation) data generation enhancement method to provide users with the ability to enhance knowledge generation and quickly build knowledge bases based on RAG. It aims to be the central hub for knowledge construction in enterprises, offering platform-based intelligent dialogue services and document knowledge base management functionality. Users can upload docx, pdf, txt, and md format documents and generate high-quality knowledge base question-answer pairs by invoking large models through the 'Parse Document' feature.
paper-qa
PaperQA is a minimal package for question and answering from PDFs or text files, providing very good answers with in-text citations. It uses OpenAI Embeddings to embed and search documents, and includes a process of embedding docs, queries, searching for top passages, creating summaries, using an LLM to re-score and select relevant summaries, putting summaries into prompt, and generating answers. The tool can be used to answer specific questions related to scientific research by leveraging citations and relevant passages from documents.
trustgraph
TrustGraph is a tool that deploys private GraphRAG pipelines to build a RDF style knowledge graph from data, enabling accurate and secure `RAG` requests compatible with cloud LLMs and open-source SLMs. It showcases the reliability and efficiencies of GraphRAG algorithms, capturing contextual language flags missed in conventional RAG approaches. The tool offers features like PDF decoding, text chunking, inference of various LMs, RDF-aligned Knowledge Graph extraction, and more. TrustGraph is designed to be modular, supporting multiple Language Models and environments, with a plug'n'play architecture for easy customization.
LightRAG
LightRAG is a repository hosting the code for LightRAG, a system that supports seamless integration of custom knowledge graphs, Oracle Database 23ai, Neo4J for storage, and multiple file types. It includes features like entity deletion, batch insert, incremental insert, and graph visualization. LightRAG provides an API server implementation for RESTful API access to RAG operations, allowing users to interact with it through HTTP requests. The repository also includes evaluation scripts, code for reproducing results, and a comprehensive code structure.
autoflow
AutoFlow is an open source graph rag based knowledge base tool built on top of TiDB Vector and LlamaIndex and DSPy. It features a Perplexity-style Conversational Search page and an Embeddable JavaScript Snippet for easy integration into websites. The tool allows for comprehensive coverage and streamlined search processes through sitemap URL scraping.
kg_llm
This repository contains code associated with tutorials on implementing graph RAG using knowledge graphs and vector databases, enriching an LLM with structured data, and unraveling unstructured movie data. It includes notebooks for various tasks such as creating taxonomy, tagging movies, and working with movie data in CSV format.
KAG
KAG is a logical reasoning and Q&A framework based on the OpenSPG engine and large language models. It is used to build logical reasoning and Q&A solutions for vertical domain knowledge bases. KAG supports logical reasoning, multi-hop fact Q&A, and integrates knowledge and chunk mutual indexing structure, conceptual semantic reasoning, schema-constrained knowledge construction, and logical form-guided hybrid reasoning and retrieval. The framework includes kg-builder for knowledge representation and kg-solver for logical symbol-guided hybrid solving and reasoning engine. KAG aims to enhance LLM service framework in professional domains by integrating logical and factual characteristics of KGs.
automatic-KG-creation-with-LLM
This repository presents a (semi-)automatic pipeline for Ontology and Knowledge Graph Construction using Large Language Models (LLMs) such as Mixtral 8x22B Instruct v0.1, GPT-4o, GPT-3.5, and Gemini. It explores the generation of Knowledge Graphs by formulating competency questions, developing ontologies, constructing KGs, and evaluating the results with minimal human involvement. The project showcases the creation of a KG on deep learning methodologies from scholarly publications. It includes components for data preprocessing, prompts for LLMs, datasets, and results from the selected LLMs.
GraphRAG-SDK
Build fast and accurate GenAI applications with GraphRAG SDK, a specialized toolkit for building Graph Retrieval-Augmented Generation (GraphRAG) systems. It integrates knowledge graphs, ontology management, and state-of-the-art LLMs to deliver accurate, efficient, and customizable RAG workflows. The SDK simplifies the development process by automating ontology creation, knowledge graph agent creation, and query handling, enabling users to interact and query their knowledge graphs effectively. It supports multi-agent systems and orchestrates agents specialized in different domains. The SDK is optimized for FalkorDB, ensuring high performance and scalability for large-scale applications. By leveraging knowledge graphs, it enables semantic relationships and ontology-driven queries that go beyond standard vector similarity, enhancing retrieval-augmented generation capabilities.
Docs2KG
Docs2KG is a tool designed for constructing a unified knowledge graph from heterogeneous documents. It addresses the challenges of digitizing diverse unstructured documents and constructing a high-quality knowledge graph with less effort. The tool combines bottom-up and top-down approaches, utilizing a human-LLM collaborative interface to enhance the generated knowledge graph. It organizes the knowledge graph into MetaKG, LayoutKG, and SemanticKG, providing a comprehensive view of document content. Docs2KG aims to streamline the process of knowledge graph construction and offers metrics for evaluating the quality of automatic construction.
13 - OpenAI Gpts
Inductive Logic Problem Solver
Friendly ILP (Inductive Logic Programming) expert, engaging and supportive. Give examples in form of pos(...) and neg(...) examples.
FeynMind
Polymath inspired by Feynman's analogy-driven teaching. Useful for understanding complex technical subjects in novel ways.
Gödel's Phenomena Analyst
An inquisitive researcher linking mysteries to science. Member of the Hipster Energy Team. https://hipster.energy/team
AI Expert for Manual Creation
This prompt acts as an expert in AI and a specific field, designing educational and attractive manuals for a defined audience. He specializes in integrating advanced knowledge and NLP techniques to generate high-quality content.
Complex Knowledge Atomizer
I refine complex knowledge into granular, integrated solutions.