awesome-llm-understanding-mechanism

awesome-llm-understanding-mechanism

awesome papers in LLM interpretability

Stars: 376

Visit
 screenshot

This repository is a collection of papers focused on understanding the internal mechanism of large language models (LLM). It includes research on topics such as how LLMs handle multilingualism, learn in-context, and handle factual associations. The repository aims to provide insights into the inner workings of transformer-based language models through a curated list of papers and surveys.

README:

Awesome Papers for Understanding LLM Mechanism

This list focuses on understanding the internal mechanism of large language models (LLM). Works in this list are accepted by top conferences (e.g. ICML, NeurIPS, ICLR, ACL, EMNLP, NAACL), or written by top research institutions.

Other paper lists focuses on SAE and neuron.

Paper recommendation (accepted by conferences): please contact me.

Papers

2024

2023

2022

2021

Survey

Other good LLM repos

Why mechanistic interpretability?

From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP

Interpretability Dreams

A Longlist of Theories of Impact for Interpretability

Recommended blogs

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for awesome-llm-understanding-mechanism

Similar Open Source Tools

For similar tasks

For similar jobs