Awesome-Neuro-Symbolic-Learning-with-LLM

✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models

Stars: 53

Visit

The Awesome-Neuro-Symbolic-Learning-with-LLM repository is a curated collection of papers and resources focusing on improving reasoning and planning capabilities of Large Language Models (LLMs) and Multi-Modal Large Language Models (MLLMs) through neuro-symbolic learning. It covers a wide range of topics such as neuro-symbolic visual reasoning, program synthesis, logical reasoning, mathematical reasoning, code generation, visual reasoning, geometric reasoning, classical planning, game AI planning, robotic planning, AI agent planning, and more. The repository provides a comprehensive overview of tutorials, workshops, talks, surveys, papers, datasets, and benchmarks related to neuro-symbolic learning with LLMs and MLLMs.

README:

Towards Improving Reasoning & Planning Capabilities of LLMs with Neuro-Symbolic Learning

✨✨ Curated collection of papers and resources on latest advances on improving reasoning and planning abilities of LLM/MLLMs with neuro-symbolic learning

🗂️ Table of Contents

Awesome Tutorials & Workshops & Talks
Awesome Survey
Basic Neuro-Symbolic Frameworks
Symbolic to LLM

Symbolic Generation, LLM Imitation
LLM Formalize, Symbolic Augment

LLM to Symbolic

Symbolic Solver Aided Methods
Program Aided Methods
Tool Aided Methods
Search Augmented Methods

LLM plus Symbolic

Symbolic Formatted Reasoning
Differential Symbolic Module
Symbolic Feedback

Awesome Datasets & Benchmarks

📌 Awesome Tutorials & Workshops & Talks

Neuro-Symbolic Visual Reasoning and Program Synthesis Tutorials in CVPR 2020
Neuro-Symbolic Methods for Language and Vision Tutorials in AAAI 2022
AAAI 2022 Tutorial on AI Planning: Theory and Practice Tutorials in AAAI 2022
Advances in Neuro Symbolic Reasoning and Learning Tutorials in AAAI 2023
Neuro-Symbolic Approaches: Large Language Models + Tool Use Tutorials in ACL 2023
Neuro-Symbolic Generative Models Workshop in ICLR 2023
Neuro-Symbolic Learning and Reasoning in the Era of Large Language Models Workshop in AAAI 2024
Neuro-Symbolic Concepts for Robotic Manipulation Talk given by Jiayuan Mao [Video]
Building General-Purpose Robots with Compositional Action Abstractions Talk given by Jiayuan Mao
Summer School on Neurosymbolic Programming
MIT 6.S191: Neuro-Symbolic AI Talk given by David Cox [Video]
NeuroSymbolic Programming [Slides]
LLM Reasoning: Key Ideas and Limitations Talk give by Denny Zhou
Inference-Time Techniques for LLM Reasoning Talk given by Xinyun Chen
Neurosymbolic Reasoning for Large Language Models Neuro-Symbolic AI Summer School in UCLA, 2024

🔍 Survey

Survey on LLM Reasoning

Survey on LLM Planning

Survey on Neuro-Symbolic Learning

📖 Basic Neuro-Symbolic Frameworks

Title	Venue	Date	Code
Semantic-based regularization for learning and inference	Artificial Intelligence	2017	-
DeepProbLog: Neural Probabilistic Logic Programming	NeurIPS	2018	Github
Learning Explanatory Rules from Noisy Data	Journal of Artificial Intelligence Research	2018	Github
Augmenting Neural Networks with First-order Logic	ACL	2019	-
Neural Logic Machines	ICLR	2019	Github
Bridging Machine Learning and Logical Reasoning by Abductive Learning	NeurIPS	2019	Github
SATNet: Bridging deep learning and logical reasoning using a differentiable satisfiability solver	ICML	2019	Github
DL2: Training and Querying Neural Networks with Logic	ICML	2019	Github
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision	ICLR	2019	Github
NeurASP: Embracing Neural Networks into Answer Set Programming	IJCAI	2020	Github
Learning programs by learning from failures	Machine Learning	2020	-
Logical Neural Networks	Arxiv	2020	Github
Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning	ICML	2020	Github
Ontology Reasoning with Deep Neural Networks	Artificial Intelligence	2020	-
MultiplexNet: Towards Fully Satisfied Logical Constraints in Neural Networks	AAAI	2022	-
Neuro-Symbolic Hierarchical Rule Induction	ICML	2022	-
Logic Tensor Networks	Artificial Intelligence	2022	Github
Neuro-symbolic Learning Yielding Logical Constraints	NeurIPS	2023	Github
Neural-Symbolic Recursive Machine for Systematic Generalization	ICLR	2024	Github

📖 Symbolic to LLM

Symbolic Generation, LLM Imitation

Title	Venue	Date	Domain	Code
Solving Olympiad geometry without human demonstrations	Nature	2024	Geometric Math Reasoning	Github
Chain of Thought Imitation with Procedure Cloning	NeurIPS	2022	Planning	Github
Plansformer: Generating Symbolic Plans using Transformers	Arxiv	2022	Planning	-
*Beyond A$^$: Better Planning with Transformers via Search Dynamics Bootstrapping**	Arxiv	2024	Planning	Github
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces	Arxiv	2024	Planning	Github
Stream of Search (SoS): Learning to Search in Language	COLM	2024	Reasoning	Github
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models	Arxiv	2025	Reasoning	Github
Language Models can be Deductive Solvers	NAACL	2024	Logical Reasoning	Github
INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving	ICLR	2021	Theorem Proving	Github
Generating Millions Of Lean Theorems With Proofs By Exploring State Transition Graphs	Arxiv	2025	Theorem Proving	-
Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus	NeurIPS	2024	Logical Reasoning	-
A Symbolic Framework for Evaluating Mathematical Reasoning and Generalization with Transformers	NAACL	2024	Math Reasoning	Github
Proving Olympiad Algebraic Inequalities without Human Demonstrations	NeurIPS	2024	Theorem Proving	Github
LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models	EMNLP	2024	Logical Reasoning	Github

LLM Formalize, Symbolic Augment

Title	Venue	Date	Domain	Code
AMR-DA: Data Augmentation by Abstract Meaning Representation	ACL	2022	Logic Reasoning	Github
Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning	ACL	2024	Logic Reasoning	Github
Neuro-Symbolic Data Generation for Math Reasoning	NeurIPS	2024	Math Reasoning	-
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM	SCI-FM Workshop @ ICLR	2025	Legal Reasoning	Github
AlphaIntegrator: Transformer Action Search for Symbolic Integration Proofs	Arxiv	2024	Theorem Proving	-
Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation	ICLR	2025	Logic Reasoning	Github

📖 LLM to Symbolic

Symbolic Solver Aided Methods

Title	Venue	Date	Domain	Code
MRKL Systems: A modular, Neuro-Symbolic Architecture that Combines Large Language models, External Knowledge Sources and Discrete Reasoning	Arxiv	2022	Reasoning	-
Leveraging Large Language Models to Generate Answer Set Programs	KR	2023	Reasoning	Github
Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning	EMNLP	2023	Reasoning	Github
LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations	ACL	2024	Reasoning	-
LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers	EMNLP	2023	Reasoning	Github
Faithful Chain-of-Thought Reasoning	IJCNLP-AACL	2023	Reasoning	Github
SATLM: Satisfiability-Aided Language Models Using Declarative Prompting	NeurIPS	2023	Math Reasoning	Github
Faithful Logical Reasoning via Symbolic Chain-of-Thought	ACL	2023	Reasoning	Github
Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving	EMNLP	2024	Theorem Proving	Github
StackSight: Unveiling WebAssembly through Large Language Models and Neurosymbolic Chain-of-Thought Decompilation	ICML	2024	Code Generation	-
Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context	NeurIPS	2023	Code Generation	Github
Solving Math Word Problems by Combining Language Models With Symbolic Solvers	Arxiv	2023	Math Reasoning	Github
AutoSAT: Automatically Optimize SAT Solvers via Large Language Models	Arxiv	2024	SAT Problem	Github
Prototype-then-Refine: A Neuro-Symbolic Approach for Improved Logical Reasoning with LLMs	Arxiv	2024	Reasoning	-
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs	Arxiv	2024	Reasoning	Github
SymBa: Symbolic Backward Chaining for Structured Natural Language Reasoning	Arxiv	2025	Reasoning	Github
Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning	AAAI	2024	Math Reasoning	-
Lemur: Integrating Large Language Models in Automated Program Verification	ICLR	2024	Code Generation	-
Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human Activity Reasoning	NeurIPS	2023	Robotics	Github
Parsel: Algorithmic Reasoning with Language Models by Composing Decompositions	NeurIPS	2023	Robotics	Github
Disentangling Extraction and Reasoning in Multi-hop Spatial Reasoning	EMNLP	2024	Spatial Reasoning	-
Generalized Planning in PDDL Domains with Pretrained Large Language Models	AAAI	2024	Planning	Github
Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning	NeurIPS	2023	Planning	Github
Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text	ACL Findings	2023	Planning	Github
LLM+P: Empowering Large Language Models with Optimal Planning Proficiency	Arxiv	2023	Planning/Robotics	Github
Dynamic Planning with a LLM	LanGame Workshop @ NeurIPS	2023	Planning/Robotics	-
Neuro-Symbolic Procedural Planning with Commonsense Prompting	ICLR	2023	Planning/Robotics	Github
Leveraging Environment Interaction for Automated PDDL Translation and Planning with Large Language Models	NeurIPS	2024	Planning	Github
A Framework for Neurosymbolic Robot Action Planning using Large Language Models	Frontiers in Neurorobotics	2024	Robotics	Github
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation	Arxiv	2024	Robotics	-
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models	NeurIPS D&B	2023	Theorem Proving	Github
LEGO-Prover: Neural Theorem Proving with Growing Libraries	ICLR	2024	Theorem Proving	Github
Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning	ICLR	2025	Theorem Proving	Github
Autoformalization with Large Language Models	NeurIPS	2022	Theorem Proving	-
Autoformalizing Euclidean Geometry	ICML	2024	Geometry Reasoning	Github
Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency	NeurIPS	2024	Theorem Proving	Github
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs	ICLR	2023	Theorem Proving	Github
Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization	ICLR	2024	Theorem Proving	Github
Large Language Models as Planning Domain Generators	ICAPS	2024	Planning	Github
Generating Symbolic World Models via Test-time Scaling of Large Language Models	ICML	2024	Planning	-

Program-Aided Methods

Title	Venue	Date	Domain	Code
PAL: Program-aided Language Models	ICML	2023	Reasoning	Github
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks	TMLR	2023	Math Reasoning	Github
Binding Language Models in Symbolic Languages	ICLR	2023	Reasoning	Github
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator	ICML	2024	Reasoning	Github
CODE4STRUCT: Code Generation for Few-Shot Event Structure Prediction	ACL	2023	Reasoning	Github
MathPrompter: Mathematical Reasoning using Large Language Models	ACL	2023	Math Reasoning	Github
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning	ACL	2024	Reasoning	Github
Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments	Arxiv	2025	Reasoning	-
Code as Policies: Language Model Programs for Embodied Control	Arxiv	2023	Robotics	Github

Tool-Aided Methods

Title	Venue	Date	Code
Visual Programming: Compositional visual reasoning without training	CVPR (Best Paper)	2023	Visual Reasoning
ViperGPT: Visual Inference via Python Execution for Reasoning	CVPR	2023	Visual Reasoning
GENOME: Generative Neuro-Symbolic Visual Reasoning by Growing and Reusing Modules	ICLR	2024	Visual Reasoning
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models	NeurIPS	2023	Multi-Modal Reasoning
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face	NeurIPS	2023	Visual Reasoning
ART: Automatic Multi-Step Reasoning and Tool-use for Large Language Models	Arxiv	2023	Reasoning
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models	EMNLP	2023	Reasoning
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving	ICLR	2024	Math Reasoning
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings	NeurIPS	2023	Agent
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs	Arxiv	2023	Agent
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs	ICLR	2024	Agent
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage	ICLR	2025	Visual Reasoning
ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback	EMNLP	2024	Reasoning
START: Self-taught Reasoner with Tools	Arxiv	2025	Reasoning

Search Augmented Methods

Title	Venue	Date	Domain	Code
Self-Evaluation Guided Beam Search for Reasoning	NeurIPS	2023	Reasoning	Github
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning	COLM	2024	Reasoning	Github
MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time	Arxiv	2024	Reasoning	-
*NEUROLOGIC Aesque Decoding:Constrained Text Generation with Lookahead Heuristics**	NAACL	2022	Reasoning	-
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions	Arxiv	2024	Reasoning	Github
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning	Arxiv	2024	Reasoning	Github
LLaMA-Berry: Pairwise Optimization for Olympiad-level Mathematical Reasoning via O1-like Monte Carlo Tree Search	Arxiv	2024	Reasoning	Github
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search	Arxiv	2024	Visual Reasoning	Github
Large Language Models as Commonsense Knowledge for Large-Scale Task Planning	NeurIPS	2023	Planning	Github
Planning with Large Language Models for Code Generation	ICLR	2023	Code Generation	Github
Tree of Thoughts: Deliberate Problem Solving with Large Language Models	NeurIPS	2023	Reasoning	Github
CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models	Arxiv	2024	Code Generation	-
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training	Arxiv	2024	Reasoning	-
Reasoning with Language Model is Planning with World Model	EMNLP	2023	Reasoning	-
Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning	ICLR	2024	Reasoning	-
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models	ICLR	2025	Reasoning	Github
Leveraging Constrained Monte Carlo Tree Search to Generate Reliable Long Chain-of-Thought for Mathematical Reasoning	Arxiv	2025	Math Reasoning	-
FGeo-DRL:Deductive Reasoning for Geometric Problems through Deep Reinforcement Learning	Arxiv	2024	Geometry Reasoning	Github
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models	Arxiv	2024	Reasoning	Github
AlphaMath Almost Zero: Process Supervision without Process	NeurIPS	2024	Math Reasoning	Github
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations	ACL	2024	Math Reasoning	Github
Improve Mathematical Reasoning in Language Models by Automated Process Supervision	DeepMind Report	2024	Math Reasoning	-
ReST-MCTS: LLM Self-Training via Process Reward Guided Tree Search	NeurIPS	2024	Math/Chemistry/Physics	Github
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking	ICLR	2025	Math Reasoning	Github
Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search	NeurMAD Workshop @ AAAI	2025	Math Reasoning	-
STP: Self-play LLM Theorem Provers with Iterative Conjecturing and Proving	Arxiv	2025	Theorem Proving	Github
SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation	Arxiv	2024	Reasoning	Github

📖 LLM plus Symbolic

Symbolic Formatted Reasoning

Title	Venue	Date	Domain	Code
Deductive Verification of Chain-of-Thought Reasoning	NeurIPS	2023	Reasoning	Github
Show Your Work: Scratchpads for Intermediate Computation with Language Models	DL4C Workshop @ ICLR	2022	Code Generation	-
Chain-of-Symbol Prompting Elicits Planning in Large Langauge Models	COLM	2024	Spatial Reasoning/Path Planning	Github
Learning to Reason via Program Generation, Emulation, and Search	NeurIPS	2024	Reasoning	Github
Symbolic Working Memory Enhances Language Models for Complex Rule Application	EMNLP	2024	Reasoning	Github
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction	Arxiv	2025	Reasoning	Github
SKIntern: Internalizing Symbolic Knowledge for Distilling Better CoT Capabilities into Small Language Models	COLING	2025	Reasoning	Github
Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions	Arxiv	2025	Reasoning	-
ProgPrompt: Generating Situated Robot Task Plans using Large Language Models	ICRA	2023	Robotics	Github
Programmatically Grounded, Compositionally Generalizable Robotic Manipulation	ICLR	2023	Robotics	Github
CodePlan: Unlocking Reasoning Potential in Large Language Models by Scaling Code-form Planning	ICLR	2025	Reasoning	Github

Differential Symbolic Modules

Title	Venue	Date	Domain	Code
Neuro-Symbolic Visual Reasoning: Disentangling “Visual” from “Reasoning”	ICML	2020	Visual Reasoning	Github
JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents	Arxiv	2022	Robotics	-
What's Left? Concept Grounding with Logic-Enhanced Foundation Models	NeurIPS	2023	Visual Reasoning	Github
Take A Step Back: Rethinking the Two Stages in Visual Reasoning	ECCV	2024	Visual Reasoning	Github
DiLA: Enhancing LLM Tool Learning with Differential Logic Layer	Arxiv	2024	Reasoning	-
Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks	ICLR	2024	Reasoning	Github
Empowering Language Models with Knowledge Graph Reasoning for Question Answering	EMNLP	2022	Reasoning	-
Neuro-symbolic Training for Spatial Reasoning over Natural Language	Arxiv	2025	Spatial Reasoning	Github
NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization	Arxiv	2024	Visual Reasoning	Github

Symbolic Feedback

Title	Venue	Date	Domain	Code
CoTran: An LLM-based Code Translator using Reinforcement Learning with Feedback from Compiler and Symbolic Execution	ECAI	2024	Code Generation	-
Position: LLMs Can’t Plan, But Can Help Planning in LLM-Modulo Frameworks	ICML	2024	Planning	-
RLSF: Reinforcement Learning via Symbolic Feedback	Arxiv	2025	Reasoning	Github
Rule Based Rewards for Language Model Safety	NeurIPS	2024	-

Misc on Neuro-Symbolic Learning

Title	Venue	Date	Code
Neuro-Symbolic Entropy Regularization	UAI	2022	Github
RuleMatch: Matching Abstract Rules for Semi-supervised Learning of Human Standard Intelligence Tests	IJCAI	2023	Github
Learning with Logical Constraints but without Shortcut Satisfaction	ICLR	2023	Github
Neuro-Symbolic Continual Learning:Knowledge, Reasoning Shortcuts and Concept Rehearsal	ICML	2023	Github
Out-of-Distribution Generalization by Neural-Symbolic Joint Training	AAAI	2023	Github
Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts	NeurIPS	2023	-
Localized Symbolic Knowledge Distillation for Visual Commonsense Models	NeurIPS	2023	-
A-NeSI: A Scalable Approximate Method for Probabilistic Neurosymbolic Inference	NeurIPS	2023	-
Neuro-Symbolic Continual Learning:Knowledge, Reasoning Shortcuts and Concept Rehearsal	ICML	2023	Github
Out-of-Distribution Generalization by Neural-Symbolic Joint Training	AAAI	2023	Github
Large Language Models Are NeuroSymbolic Reasoners	AAAI	2024	Github
On the Hardness of Probabilistic Neurosymbolic Learning	ICML	2024	-
On the Independence Assumption in Neurosymbolic Learning	ICML	2024	-
Analysis for Abductive Learning and Neural-Symbolic Reasoning Shortcuts	ICML	2024	-
Convex and Bilevel Optimization for Neural-Symbolic Inference and Learning	ICML	2024	-
Bridging Neural and Symbolic Representations with Transitional Dictionary Learning	ICLR	2024	-
LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints	ICLR	2024	-
Adaptable Logical Control for Large Language Models	NeurIPS	2024	-
Rule Extrapolation in Language Models: A Study of Compositional Generalization on OOD Prompts	NeurIPS	2024	-
Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-Thought	NeurIPS	2024	Github

🛠️ Awesome Datasets & Benchmarks

Others

RSbench: A Neuro-Symbolic Benchmark Suite for Concept Quality and Reasoning Shortcuts

↑ Back to Top ↑

For Tasks:

Click tags to check more tools for each tasks

improve reasoning enhance planning conduct surveys explore datasets benchmark models

For Jobs:

ai researcher machine learning engineer data scientist research scientist robotics engineer

Alternative AI tools for Awesome-Neuro-Symbolic-Learning-with-LLM

Similar Open Source Tools

Awesome-Neuro-Symbolic-Learning-with-LLM

github

: 53

Awesome-Model-Merging-Methods-Theories-Applications

A comprehensive repository focusing on 'Model Merging in LLMs, MLLMs, and Beyond', providing an exhaustive overview of model merging methods, theories, applications, and future research directions. The repository covers various advanced methods, applications in foundation models, different machine learning subfields, and tasks like pre-merging methods, architecture transformation, weight alignment, basic merging methods, and more.

github

: 519

Awesome-LLM-Large-Language-Models-Notes

Awesome-LLM-Large-Language-Models-Notes is a repository that provides a comprehensive collection of information on various Large Language Models (LLMs) classified by year, size, and name. It includes details on known LLM models, their papers, implementations, and specific characteristics. The repository also covers LLM models classified by architecture, must-read papers, blog articles, tutorials, and implementations from scratch. It serves as a valuable resource for individuals interested in understanding and working with LLMs in the field of Natural Language Processing (NLP).

github

: 156

awesome-mobile-llm

Awesome Mobile LLMs is a curated list of Large Language Models (LLMs) and related studies focused on mobile and embedded hardware. The repository includes information on various LLM models, deployment frameworks, benchmarking efforts, applications, multimodal LLMs, surveys on efficient LLMs, training LLMs on device, mobile-related use-cases, industry announcements, and related repositories. It aims to be a valuable resource for researchers, engineers, and practitioners interested in mobile LLMs.

github

: 154

Awesome-LLM-Constrained-Decoding

Awesome-LLM-Constrained-Decoding is a curated list of papers, code, and resources related to constrained decoding of Large Language Models (LLMs). The repository aims to facilitate reliable, controllable, and efficient generation with LLMs by providing a comprehensive collection of materials in this domain.

github

: 180

AI-For-Beginners

AI-For-Beginners is a comprehensive 12-week, 24-lesson curriculum designed by experts at Microsoft to introduce beginners to the world of Artificial Intelligence (AI). The curriculum covers various topics such as Symbolic AI, Neural Networks, Computer Vision, Natural Language Processing, Genetic Algorithms, and Multi-Agent Systems. It includes hands-on lessons, quizzes, and labs using popular frameworks like TensorFlow and PyTorch. The focus is on providing a foundational understanding of AI concepts and principles, making it an ideal starting point for individuals interested in AI.

github

: 42.1k

ZhiLight

ZhiLight is a highly optimized large language model (LLM) inference engine developed by Zhihu and ModelBest Inc. It accelerates the inference of models like Llama and its variants, especially on PCIe-based GPUs. ZhiLight offers significant performance advantages compared to mainstream open-source inference engines. It supports various features such as custom defined tensor and unified global memory management, optimized fused kernels, support for dynamic batch, flash attention prefill, prefix cache, and different quantization techniques like INT8, SmoothQuant, FP8, AWQ, and GPTQ. ZhiLight is compatible with OpenAI interface and provides high performance on mainstream NVIDIA GPUs with different model sizes and precisions.

github

: 832

TrustLLM

TrustLLM is a comprehensive study of trustworthiness in LLMs, including principles for different dimensions of trustworthiness, established benchmark, evaluation, and analysis of trustworthiness for mainstream LLMs, and discussion of open challenges and future directions. Specifically, we first propose a set of principles for trustworthy LLMs that span eight different dimensions. Based on these principles, we further establish a benchmark across six dimensions including truthfulness, safety, fairness, robustness, privacy, and machine ethics. We then present a study evaluating 16 mainstream LLMs in TrustLLM, consisting of over 30 datasets. The document explains how to use the trustllm python package to help you assess the performance of your LLM in trustworthiness more quickly. For more details about TrustLLM, please refer to project website.

github

: 535

mcp-for-beginners

The Model Context Protocol (MCP) Curriculum for Beginners is an open-source framework designed to standardize interactions between AI models and client applications. It offers a structured learning path with practical coding examples and real-world use cases in popular programming languages like C#, Java, JavaScript, Rust, Python, and TypeScript. Whether you're an AI developer, system architect, or software engineer, this guide provides comprehensive resources for mastering MCP fundamentals and implementation strategies.

github

: 10.7k

YuLan-Mini

YuLan-Mini is a lightweight language model with 2.4 billion parameters that achieves performance comparable to industry-leading models despite being pre-trained on only 1.08T tokens. It excels in mathematics and code domains. The repository provides pre-training resources, including data pipeline, optimization methods, and annealing approaches. Users can pre-train their own language models, perform learning rate annealing, fine-tune the model, research training dynamics, and synthesize data. The team behind YuLan-Mini is AI Box at Renmin University of China. The code is released under the MIT License with future updates on model weights usage policies. Users are advised on potential safety concerns and ethical use of the model.

github

: 168

awesome-generative-ai-data-scientist

A curated list of 50+ resources to help you become a Generative AI Data Scientist. This repository includes resources on building GenAI applications with Large Language Models (LLMs), and deploying LLMs and GenAI with Cloud-based solutions.

github

: 425

PredictorLLM

PredictorLLM is an advanced trading agent framework that utilizes large language models to automate trading in financial markets. It includes a profiling module to establish agent characteristics, a layered memory module for retaining and prioritizing financial data, and a decision-making module to convert insights into trading strategies. The framework mimics professional traders' behavior, surpassing human limitations in data processing and continuously evolving to adapt to market conditions for superior investment outcomes.

github

: 57

Awesome-Knowledge-Distillation-of-LLMs

A collection of papers related to knowledge distillation of large language models (LLMs). The repository focuses on techniques to transfer advanced capabilities from proprietary LLMs to smaller models, compress open-source LLMs, and refine their performance. It covers various aspects of knowledge distillation, including algorithms, skill distillation, verticalization distillation in fields like law, medical & healthcare, finance, science, and miscellaneous domains. The repository provides a comprehensive overview of the research in the area of knowledge distillation of LLMs.

github

: 890

CogVLM2

CogVLM2 is a new generation of open source models that offer significant improvements in benchmarks such as TextVQA and DocVQA. It supports 8K content length, image resolution up to 1344 * 1344, and both Chinese and English languages. The project provides basic calling methods, fine-tuning examples, and OpenAI API format calling examples to help developers quickly get started with the model.

github

: 83

Prompt-Engineering-Holy-Grail

The Prompt Engineering Holy Grail repository is a curated resource for prompt engineering enthusiasts, providing essential resources, tools, templates, and best practices to support learning and working in prompt engineering. It covers a wide range of topics related to prompt engineering, from beginner fundamentals to advanced techniques, and includes sections on learning resources, online courses, books, prompt generation tools, prompt management platforms, prompt testing and experimentation, prompt crafting libraries, prompt libraries and datasets, prompt engineering communities, freelance and job opportunities, contributing guidelines, code of conduct, support for the project, and contact information.

github

: 366

edge-ai-libraries

The Edge AI Libraries project is a collection of libraries, microservices, and tools for Edge application development. It includes sample applications showcasing generic AI use cases. Key components include Anomalib, Dataset Management Framework, Deep Learning Streamer, ECAT EnableKit, EtherCAT Masterstack, FLANN, OpenVINO toolkit, Audio Analyzer, ORB Extractor, PCL, PLCopen Servo, Real-time Data Agent, RTmotion, Audio Intelligence, Deep Learning Streamer Pipeline Server, Document Ingestion, Model Registry, Multimodal Embedding Serving, Time Series Analytics, Vector Retriever, Visual-Data Preparation, VLM Inference Serving, Intel Geti, Intel SceneScape, Visual Pipeline and Platform Evaluation Tool, Chat Question and Answer, Document Summarization, PLCopen Benchmark, PLCopen Databus, Video Search and Summarization, Isolation Forest Classifier, Random Forest Microservices. Visit sub-directories for instructions and guides.

github

: 96

For similar tasks

byteir

The ByteIR Project is a ByteDance model compilation solution. ByteIR includes compiler, runtime, and frontends, and provides an end-to-end model compilation solution. Although all ByteIR components (compiler/runtime/frontends) are together to provide an end-to-end solution, and all under the same umbrella of this repository, each component technically can perform independently. The name, ByteIR, comes from a legacy purpose internally. The ByteIR project is NOT an IR spec definition project. Instead, in most scenarios, ByteIR directly uses several upstream MLIR dialects and Google Mhlo. Most of ByteIR compiler passes are compatible with the selected upstream MLIR dialects and Google Mhlo.

github

: 418

ScandEval

ScandEval is a framework for evaluating pretrained language models on mono- or multilingual language tasks. It provides a unified interface for benchmarking models on a variety of tasks, including sentiment analysis, question answering, and machine translation. ScandEval is designed to be easy to use and extensible, making it a valuable tool for researchers and practitioners alike.

github

: 81

opencompass

OpenCompass is a one-stop platform for large model evaluation, aiming to provide a fair, open, and reproducible benchmark for large model evaluation. Its main features include: * Comprehensive support for models and datasets: Pre-support for 20+ HuggingFace and API models, a model evaluation scheme of 70+ datasets with about 400,000 questions, comprehensively evaluating the capabilities of the models in five dimensions. * Efficient distributed evaluation: One line command to implement task division and distributed evaluation, completing the full evaluation of billion-scale models in just a few hours. * Diversified evaluation paradigms: Support for zero-shot, few-shot, and chain-of-thought evaluations, combined with standard or dialogue-type prompt templates, to easily stimulate the maximum performance of various models. * Modular design with high extensibility: Want to add new models or datasets, customize an advanced task division strategy, or even support a new cluster management system? Everything about OpenCompass can be easily expanded! * Experiment management and reporting mechanism: Use config files to fully record each experiment, and support real-time reporting of results.

github

: 4.8k

openvino.genai

The GenAI repository contains pipelines that implement image and text generation tasks. The implementation uses OpenVINO capabilities to optimize the pipelines. Each sample covers a family of models and suggests certain modifications to adapt the code to specific needs. It includes the following pipelines: 1. Benchmarking script for large language models 2. Text generation C++ samples that support most popular models like LLaMA 2 3. Stable Diffuison (with LoRA) C++ image generation pipeline 4. Latent Consistency Model (with LoRA) C++ image generation pipeline

github

: 339

GPT4Point

GPT4Point is a unified framework for point-language understanding and generation. It aligns 3D point clouds with language, providing a comprehensive solution for tasks such as 3D captioning and controlled 3D generation. The project includes an automated point-language dataset annotation engine, a novel object-level point cloud benchmark, and a 3D multi-modality model. Users can train and evaluate models using the provided code and datasets, with a focus on improving models' understanding capabilities and facilitating the generation of 3D objects.

github

: 253

octopus-v4

The Octopus-v4 project aims to build the world's largest graph of language models, integrating specialized models and training Octopus models to connect nodes efficiently. The project focuses on identifying, training, and connecting specialized models. The repository includes scripts for running the Octopus v4 model, methods for managing the graph, training code for specialized models, and inference code. Environment setup instructions are provided for Linux with NVIDIA GPU. The Octopus v4 model helps users find suitable models for tasks and reformats queries for effective processing. The project leverages Language Large Models for various domains and provides benchmark results. Users are encouraged to train and add specialized models following recommended procedures.

github

: 97

Awesome-LLM-RAG

This repository, Awesome-LLM-RAG, aims to record advanced papers on Retrieval Augmented Generation (RAG) in Large Language Models (LLMs). It serves as a resource hub for researchers interested in promoting their work related to LLM RAG by updating paper information through pull requests. The repository covers various topics such as workshops, tutorials, papers, surveys, benchmarks, retrieval-enhanced LLMs, RAG instruction tuning, RAG in-context learning, RAG embeddings, RAG simulators, RAG search, RAG long-text and memory, RAG evaluation, RAG optimization, and RAG applications.

github

: 733

stm32ai-modelzoo

The STM32 AI model zoo is a collection of reference machine learning models optimized to run on STM32 microcontrollers. It provides a large collection of application-oriented models ready for re-training, scripts for easy retraining from user datasets, pre-trained models on reference datasets, and application code examples generated from user AI models. The project offers training scripts for transfer learning or training custom models from scratch. It includes performances on reference STM32 MCU and MPU for float and quantized models. The project is organized by application, providing step-by-step guides for training and deploying models.

github

: 255

For similar jobs

weave

Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

github

: 980

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

VisionCraft

The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

github

: 94

kaito

Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

github

: 405

PyRIT

PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

github

: 2.9k

tabby

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

github

: 32.1k

spear

SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

github

: 224

Magick

Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.

github

: 675

Awesome-Neuro-Symbolic-Learning-with-LLM

README:

Towards Improving Reasoning & Planning Capabilities of LLMs with Neuro-Symbolic Learning

📌 Awesome Tutorials & Workshops & Talks

🔍 Survey

Survey on LLM Reasoning

Survey on LLM Planning

Survey on Neuro-Symbolic Learning

📖 Basic Neuro-Symbolic Frameworks

📖 Symbolic to LLM

Symbolic Generation, LLM Imitation

LLM Formalize, Symbolic Augment

📖 LLM to Symbolic

Symbolic Solver Aided Methods

Program-Aided Methods

Tool-Aided Methods

Search Augmented Methods

📖 LLM plus Symbolic

Symbolic Formatted Reasoning

Differential Symbolic Modules

Symbolic Feedback

Misc on Neuro-Symbolic Learning

🛠️ Awesome Datasets & Benchmarks

Mathematical Reasoning

Logical Reasoning

Code Generation

Visual Reasoning

Geometry Reasoning

Classical Planning

Game AI Planning

Robotic Planning

AI Agent Planning

Others

For Tasks:

For Jobs:

Alternative AI tools for Awesome-Neuro-Symbolic-Learning-with-LLM

Similar Open Source Tools

Awesome-Neuro-Symbolic-Learning-with-LLM

Awesome-Model-Merging-Methods-Theories-Applications

Awesome-LLM-Large-Language-Models-Notes

awesome-mobile-llm

Awesome-LLM-Constrained-Decoding

AI-For-Beginners

ZhiLight

TrustLLM

mcp-for-beginners

YuLan-Mini

awesome-generative-ai-data-scientist

PredictorLLM

Awesome-Knowledge-Distillation-of-LLMs

CogVLM2

Prompt-Engineering-Holy-Grail

edge-ai-libraries

For similar tasks

byteir

ScandEval

opencompass

openvino.genai

GPT4Point

octopus-v4

Awesome-LLM-RAG

stm32ai-modelzoo

For similar jobs

weave

LLMStack

VisionCraft

kaito

PyRIT

tabby

spear

Magick