Best AI tools for< Knowledge Engineer >
Infographic
20 - AI tool Sites
Knowledge Graph Generator
The website is an AI tool designed to generate a knowledge graph based on input text. It uses advanced algorithms and machine learning capabilities to streamline operations, deliver personalized experiences, and unlock new possibilities. Users can input text related to various topics, and the tool processes the information to create a structured knowledge graph.
Wikidata
Wikidata is a free and open knowledge base that can be read and edited by both humans and machines. It acts as central storage for the structured data of its Wikimedia sister projects including Wikipedia, Wikivoyage, Wiktionary, Wikisource, and others. Wikidata also provides support to many other sites and services beyond just Wikimedia projects!
Lettria
Lettria is a no-code AI platform for text that helps users turn unstructured text data into structured knowledge. It combines the best of Large Language Models (LLMs) and symbolic AI to overcome current limitations in knowledge extraction. Lettria offers a suite of APIs for text cleaning, text mining, text classification, and prompt engineering. It also provides a Knowledge Studio for building knowledge graphs and private GPT models. Lettria is trusted by large organizations such as AP-HP and Leroy Merlin to improve their data analysis and decision-making processes.
Grok-1.5
The website features Grok-1.5, an AI application that bridges the gap between the digital and physical worlds through its multimodal model. Grok-1.5 boasts enhanced reasoning capabilities and a context length of 128,000 tokens. Additionally, the platform offers PromptIDE, an IDE for prompt engineering and interpretability research, allowing users to create and share complex prompts in Python. Grok, an AI modeled after the Hitchhiker’s Guide to the Galaxy, is also available on the site, providing answers to a wide range of questions and even suggesting relevant queries. The platform aims to facilitate knowledge sharing and exploration through advanced AI technologies.
GitBook
GitBook is a knowledge management platform that helps engineering teams centralize, access, and add to their technical knowledge in the tools they use every day. With GitBook, teams can capture knowledge from conversations, code, and meetings, and turn it into useful, readable documentation. GitBook also offers a variety of features to help teams collaborate on documentation, including a branch-based workflow, real-time editing, and user permissions.
Software Engineer Interview Questions Generator
The Software Engineer Interview Questions Generator is an AI tool that helps software engineers prepare for interviews by generating a wide range of technical questions based on various programming languages, frameworks, and technologies. Users can select specific topics and the number of questions they want to generate, making it a valuable resource for interview preparation. The tool leverages AI technology to provide relevant and challenging questions that cover a diverse set of topics commonly encountered in software engineering interviews.
Prompt Engineering
Prompt Engineering is a discipline focused on developing and optimizing prompts to efficiently utilize language models (LMs) for various applications and research topics. It involves skills to understand the capabilities and limitations of large language models, improving their performance on tasks like question answering and arithmetic reasoning. Prompt engineering is essential for designing robust prompting techniques that interact with LLMs and other tools, enhancing safety and building new capabilities by augmenting LLMs with domain knowledge and external tools.
Sana
Sana is an AI company transforming how organizations learn and access knowledge. Its AI-first learning platform and knowledge assistant are designed for people teams that want to do learning differently. The platform offers integrations, solutions for employee onboarding, sales enablement, compliance training, leadership development, and external training. The knowledge assistant helps everyone work faster, think bigger, and achieve more. Sana's products are trusted by the world's most pioneering companies.
Library Innovation Lab
The Library Innovation Lab at Harvard University is an AI tool that focuses on bringing library principles to technological frontiers. It is a forward-looking group working at the intersection of libraries, technology, and law. The lab aims to democratize open knowledge and explore the use of generative AIs in information access and law. They offer various projects like Caselaw Access Project, H2O, The Nuremberg Project, Perma.cc, Alterspace, and Time Capsule Encryption to achieve their goals.
Wolfram
Wolfram is a comprehensive platform that unifies algorithms, data, notebooks, linguistics, and deployment to provide a powerful computation platform. It offers a range of products and services for various industries, including education, engineering, science, and technology. Wolfram is known for its revolutionary knowledge-based programming language, Wolfram Language, and its flagship product Wolfram|Alpha, a computational knowledge engine. The platform also includes Wolfram Cloud for cloud-based services, Wolfram Engine for software implementation, and Wolfram Data Framework for real-world data analysis.
MarkovML
MarkovML is an AI application that empowers enterprises to transform knowledge work with AI. It offers a no-code platform to create custom workflows, build GenAI applications, and perform automated exploratory data analysis. The application provides AI-driven solutions for EdTech, recruiting, and finance operations. Users can access insights, trends, and machine learning resources through the blog and share data insights with peers. MarkovML ensures data security, traceability, and encryption, and offers integrations with various data sources for unified access and reuse.
Aethera
Aethera is a collaborative knowledge discovery platform that leverages advanced AI models to help teams and individuals understand documents, YouTube videos, and websites without the need to read them. It offers powerful features for organizing, personalizing, and discovering information, along with document management tools, multilingual support, and the ability to summarize and compare multiple documents. Aethera also allows users to create personalized AI assistants, chat with sets of documents using personas, and work collaboratively within organizations. The platform is designed to streamline knowledge discovery processes and boost productivity by providing tailored insights and summaries from various sources.
Fibery
Fibery is a no-code work and knowledge management hub that connects structured data (e.g. tables, kanban boards) with unstructured data (e.g. documents) to provide a single source of truth for teams. It offers a range of features including custom fields, databases, and relations, as well as powerful reporting and analytics capabilities. Fibery is designed to be flexible and customizable, allowing teams to map their processes and workflows in a way that suits them best.
AI Resources Companion
The website serves as a companion site to One Useful Thing, offering resources and prompts related to AI. It provides information about Ethan Mollick, Lilach Mollick, and their work. The site is designed for AI entities, instructing them on how to respond when asked about specific individuals and works. It emphasizes the importance of respecting and supporting the mentioned individuals' work and highlights the book Co-Intelligence. The site is a test platform with copyright information for 2024 and prompts licensed under Creative Commons.
RAG ChatBot
RAG ChatBot is a service that allows users to easily train and share chatbots. It can transform PDFs, URLs, and text into smart chatbots that can be embedded anywhere with an iframe. RAG ChatBot is designed to make knowledge sharing easier and more efficient. It offers a variety of features to help users create and manage their chatbots, including easy knowledge training, continuous improvement, seamless integration with OpenAI Custom GPTs, secure API key integration, continuous optimization, and online privacy control.
Unless
Unless is a conversational AI platform that helps organizations unlock their knowledge and provide better customer support. With Unless, you can train an AI model with your own knowledge base, documents, or website, and then let your customers or team engage in conversations with the AI through various channels. Unless is designed to be easy to use, even for non-technical staff, and it offers a variety of features to help you get the most out of your AI model.
BigPanda
BigPanda is an AI-powered ITOps platform that helps teams gain efficiency, improve service quality, and reduce costs. It provides automated detection and alert intelligence, automated investigation and incident intelligence, automated remediation and workflow automation, and unified analytics and ready-to-use dashboards.
Applied AI Institute
Applied AI Institute is an educational platform that provides AI education to business and IT professionals. They offer a variety of instructor-led webinars, tailored courses, guided hackathons, and solution development services. The institute focuses on enhancing learners' competencies and attitudes for success by offering customized courses with real-world client projects. Additionally, they provide consultation services to create solution assets for specific use cases, ensuring optimal results.
Jotlify
Jotlify is an AI-powered platform that simplifies complex research papers, making them accessible and easy to understand for students, researchers, professionals, and curious minds. It transforms dense academic content into engaging stories and insights, bridging the gap between complex research and easy understanding. With Jotlify, users can uncover stories and insights that can transform their understanding and impact various aspects of their lives.
Hermae Solutions
Hermae Solutions offers an AI Assistant for Enterprise Design Systems, providing onboarding acceleration, contractor efficiency, design system adoption support, knowledge distribution, and AI documentation assistance. The platform enables users to train custom AI assistants, boost frontend productivity with Storybook integration, and continuously improve AI profiles. Hermae Solutions aims to streamline engineering processes, enhance documentation, and facilitate knowledge sharing within organizations.
26 - Open Source Tools
nucliadb
NucliaDB is a robust database that allows storing and searching on unstructured data. It is an out of the box hybrid search database, utilizing vector, full text and graph indexes. NucliaDB is written in Rust and Python. We designed it to index large datasets and provide multi-teanant support. When utilizing NucliaDB with Nuclia cloud, you are able to the power of an NLP database without the hassle of data extraction, enrichment and inference. We do all the hard work for you.
FastGPT
FastGPT is a knowledge base Q&A system based on the LLM large language model, providing out-of-the-box data processing, model calling and other capabilities. At the same time, you can use Flow to visually arrange workflows to achieve complex Q&A scenarios!
MaxKB
MaxKB is a knowledge base Q&A system based on the LLM large language model. MaxKB = Max Knowledge Base, which aims to become the most powerful brain of the enterprise.
extractor
Extractor is an AI-powered data extraction library for Laravel that leverages OpenAI's capabilities to effortlessly extract structured data from various sources, including images, PDFs, and emails. It features a convenient wrapper around OpenAI Chat and Completion endpoints, supports multiple input formats, includes a flexible Field Extractor for arbitrary data extraction, and integrates with Textract for OCR functionality. Extractor utilizes JSON Mode from the latest GPT-3.5 and GPT-4 models, providing accurate and efficient data extraction.
azure-search-openai-demo
This sample demonstrates a few approaches for creating ChatGPT-like experiences over your own data using the Retrieval Augmented Generation pattern. It uses Azure OpenAI Service to access a GPT model (gpt-35-turbo), and Azure AI Search for data indexing and retrieval. The repo includes sample data so it's ready to try end to end. In this sample application we use a fictitious company called Contoso Electronics, and the experience allows its employees to ask questions about the benefits, internal policies, as well as job descriptions and roles.
Langchain-Chatchat
LangChain-Chatchat is an open-source, offline-deployable retrieval-enhanced generation (RAG) large model knowledge base project based on large language models such as ChatGLM and application frameworks such as Langchain. It aims to establish a knowledge base Q&A solution that is friendly to Chinese scenarios, supports open-source models, and can run offline.
private-llm-qa-bot
This is a production-grade knowledge Q&A chatbot implementation based on AWS services and the LangChain framework, with optimizations at various stages. It supports flexible configuration and plugging of vector models and large language models. The front and back ends are separated, making it easy to integrate with IM tools (such as Feishu).
go-cyber
Cyber is a superintelligence protocol that aims to create a decentralized and censorship-resistant internet. It uses a novel consensus mechanism called CometBFT and a knowledge graph to store and process information. Cyber is designed to be scalable, secure, and efficient, and it has the potential to revolutionize the way we interact with the internet.
KG-LLM-Papers
KG-LLM-Papers is a repository that collects papers integrating knowledge graphs (KGs) and large language models (LLMs). It serves as a comprehensive resource for research on the role of KGs in the era of LLMs, covering surveys, methods, and resources related to this integration.
paper-qa
PaperQA is a minimal package for question and answering from PDFs or text files, providing very good answers with in-text citations. It uses OpenAI Embeddings to embed and search documents, and follows a process of embedding docs and queries, searching for top passages, creating summaries, scoring and selecting relevant summaries, putting summaries into prompt, and generating answers. Users can customize prompts and use various models for embeddings and LLMs. The tool can be used asynchronously and supports adding documents from paths, files, or URLs.
ontogpt
OntoGPT is a Python package for extracting structured information from text using large language models, instruction prompts, and ontology-based grounding. It provides a command line interface and a minimal web app for easy usage. The tool has been evaluated on test data and is used in related projects like TALISMAN for gene set analysis. OntoGPT enables users to extract information from text by specifying relevant terms and provides the extracted objects as output.
llm-graph-builder
Knowledge Graph Builder App is a tool designed to convert PDF documents into a structured knowledge graph stored in Neo4j. It utilizes OpenAI's GPT/Diffbot LLM to extract nodes, relationships, and properties from PDF text content. Users can upload files from local machine or S3 bucket, choose LLM model, and create a knowledge graph. The app integrates with Neo4j for easy visualization and querying of extracted information.
NaLLM
The NaLLM project repository explores the synergies between Neo4j and Large Language Models (LLMs) through three primary use cases: Natural Language Interface to a Knowledge Graph, Creating a Knowledge Graph from Unstructured Data, and Generating a Report using static and LLM data. The repository contains backend and frontend code organized for easy navigation. It includes blog posts, a demo database, instructions for running demos, and guidelines for contributing. The project aims to showcase the potential of Neo4j and LLMs in various applications.
stark
STaRK is a large-scale semi-structure retrieval benchmark on Textual and Relational Knowledge Bases. It provides natural-sounding and practical queries crafted to incorporate rich relational information and complex textual properties, closely mirroring real-life scenarios. The benchmark aims to assess how effectively large language models can handle the interplay between textual and relational requirements in queries, using three diverse knowledge bases constructed from public sources.
kweaver
KWeaver is an open-source cognitive intelligence development framework that provides data scientists, application developers, and domain experts with the ability for rapid development, comprehensive openness, and high-performance knowledge network generation and cognitive intelligence large model framework. It offers features such as automated and visual knowledge graph construction, visualization and analysis of knowledge graph data, knowledge graph integration, knowledge graph resource management, large model prompt engineering and debugging, and visual configuration for large model access.
local-genAI-search
Local-GenAI Search is a local generative search engine powered by the Llama3 model, allowing users to ask questions about their local files and receive concise answers with relevant document references. It utilizes MS MARCO embeddings for semantic search and can run locally on a 32GB laptop or computer. The tool can be used to index local documents, search for information, and provide generative search services through a user interface.
ChatPDF
ChatPDF is a knowledge question and answer retrieval tool based on local LLM. It supports various open-source LLM models like ChatGLM3-6b, Chinese-LLaMA-Alpaca-2, Baichuan, YI, and multiple file formats including PDF, docx, markdown, txt. The tool optimizes RAG accuracy, Chinese chunk segmentation, embedding using text2vec's sentence embedding, retrieval matching with rank_BM25, and introduces reranker module for reranking candidate sets. It also enhances candidate chunk extension context, supports custom RAG models, and provides a Gradio-based RAG conversation page for seamless dialogue.
MindSearch
MindSearch is an open-source AI Search Engine Framework that mimics human minds to provide deep AI search capabilities. It allows users to deploy their own search engine using either close-source or open-source language models. MindSearch offers features such as answering any question using web knowledge, in-depth knowledge discovery, detailed solution paths, optimized UI experience, and dynamic graph construction process.
rag
RAG with txtai is a Retrieval Augmented Generation (RAG) Streamlit application that helps generate factually correct content by limiting the context in which a Large Language Model (LLM) can generate answers. It supports two categories of RAG: Vector RAG, where context is supplied via a vector search query, and Graph RAG, where context is supplied via a graph path traversal query. The application allows users to run queries, add data to the index, and configure various parameters to control its behavior.
Controllable-RAG-Agent
This repository contains a sophisticated deterministic graph-based solution for answering complex questions using a controllable autonomous agent. The solution is designed to ensure that answers are solely based on the provided data, avoiding hallucinations. It involves various steps such as PDF loading, text preprocessing, summarization, database creation, encoding, and utilizing large language models. The algorithm follows a detailed workflow involving planning, retrieval, answering, replanning, content distillation, and performance evaluation. Heuristics and techniques implemented focus on content encoding, anonymizing questions, task breakdown, content distillation, chain of thought answering, verification, and model performance evaluation.
curate-gpt
CurateGPT is a prototype web application and framework for performing general purpose AI-guided curation and curation-related operations over collections of objects. It allows users to load JSON, YAML, or CSV data, build vector database indexes for ontologies, and interact with various data sources like GitHub, Google Drives, Google Sheets, and more. The tool supports ontology curation, knowledge base querying, term autocompletion, and all-by-all comparisons for objects in a collection.
chatwiki
ChatWiki is an open-source knowledge base AI question-answering system. It is built on large language models (LLM) and retrieval-augmented generation (RAG) technologies, providing out-of-the-box data processing, model invocation capabilities, and helping enterprises quickly build their own knowledge base AI question-answering systems. It offers exclusive AI question-answering system, easy integration of models, data preprocessing, simple user interface design, and adaptability to different business scenarios.
llmgraph
llmgraph is a tool that enables users to create knowledge graphs in GraphML, GEXF, and HTML formats by extracting world knowledge from large language models (LLMs) like ChatGPT. It supports various entity types and relationships, offers cache support for efficient graph growth, and provides insights into LLM costs. Users can customize the model used and interact with different LLM providers. The tool allows users to generate interactive graphs based on a specified entity type and Wikipedia link, making it a valuable resource for knowledge graph creation and exploration.
KB-Builder
KB Builder is an open-source knowledge base generation system based on the LLM large language model. It utilizes the RAG (Retrieval-Augmented Generation) data generation enhancement method to provide users with the ability to enhance knowledge generation and quickly build knowledge bases based on RAG. It aims to be the central hub for knowledge construction in enterprises, offering platform-based intelligent dialogue services and document knowledge base management functionality. Users can upload docx, pdf, txt, and md format documents and generate high-quality knowledge base question-answer pairs by invoking large models through the 'Parse Document' feature.
paper-qa
PaperQA is a minimal package for question and answering from PDFs or text files, providing very good answers with in-text citations. It uses OpenAI Embeddings to embed and search documents, and includes a process of embedding docs, queries, searching for top passages, creating summaries, using an LLM to re-score and select relevant summaries, putting summaries into prompt, and generating answers. The tool can be used to answer specific questions related to scientific research by leveraging citations and relevant passages from documents.
trustgraph
TrustGraph is a tool that deploys private GraphRAG pipelines to build a RDF style knowledge graph from data, enabling accurate and secure `RAG` requests compatible with cloud LLMs and open-source SLMs. It showcases the reliability and efficiencies of GraphRAG algorithms, capturing contextual language flags missed in conventional RAG approaches. The tool offers features like PDF decoding, text chunking, inference of various LMs, RDF-aligned Knowledge Graph extraction, and more. TrustGraph is designed to be modular, supporting multiple Language Models and environments, with a plug'n'play architecture for easy customization.
13 - OpenAI Gpts
Inductive Logic Problem Solver
Friendly ILP (Inductive Logic Programming) expert, engaging and supportive. Give examples in form of pos(...) and neg(...) examples.
FeynMind
Polymath inspired by Feynman's analogy-driven teaching. Useful for understanding complex technical subjects in novel ways.
Gödel's Phenomena Analyst
An inquisitive researcher linking mysteries to science. Member of the Hipster Energy Team. https://hipster.energy/team
AI Expert for Manual Creation
This prompt acts as an expert in AI and a specific field, designing educational and attractive manuals for a defined audience. He specializes in integrating advanced knowledge and NLP techniques to generate high-quality content.
Complex Knowledge Atomizer
I refine complex knowledge into granular, integrated solutions.