LLM-from-scratch

LLM-from-scratch

一些 LLM 方面的从零复现笔记

Stars: 108

Visit
 screenshot

This repository contains notes on re-implementing some LLM models from scratch. It includes steps to pre-train a super mini LLaMA 3 model, implement LoRA from scratch using PyTorch, and work on implementing the 'generate' method.

README:

LLM-from-scratch

一些 LLM 的从零复现笔记。

  • [x] 1. 从头预训练一只超迷你 LLaMA 3——复现 TinyStories
  • [x] 2. 用 PyTorch 从零实现 LoRA
  • [ ] 3. 从零实现 generate 方法

知乎链接

  1. 从头预训练一只超迷你 LLaMA 3——复现 TinyStories
  2. 用 PyTorch 从零实现 LoRA

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for LLM-from-scratch

Similar Open Source Tools

For similar tasks

For similar jobs