AI-LLM-ML-CS-Quant-Review

In-depth review of industry trends in AI, LLMs, Machine Learning, Computer Science, and Quantitative Finance.

Stars: 380

Visit

This repository provides an in-depth review of industry trends in AI, Large Language Models (LLMs), Machine Learning, Computer Science, and Quantitative Finance. It covers various topics such as NVIDIA GTC conferences, DeepSeek theory and applications, LangGraph & Cursor AI, LLM essentials, system design, computer systems, big data and AI in finance, C++ design patterns, high-frequency finance, machine learning for algorithmic trading, stochastic volatility modeling, and quant job interview questions.

README:

Views:

• Followers:

• Stars:

AI-LLM-ML-CS-Quant-Review

In-depth review of industry trends in AI, LLMs, Machine Learning, Computer Science, and Quantitative Finance.

1. NVIDIA GTC | AI Conference for Developers
2. Agentic AI Summit
3. LLM Essentials
4. DeepSeek & Kimi
5. 2025 Paper Reading
6. LangGraph & Cursor AI Projects
7. System Design
- ByteByteGo - GenAI/ML/Modern System Design Interview
- Educative - GenAI/Modern System Design Interview
8. Computer Systems
9. Big Data and AI in Finance, Econometrics and Statistics Conference, UChicago 2024
10. C++ Design Patterns and Derivatives Pricing
11. High-Frequency Finance
12. Machine Learning for Algorithmic Trading
13. Stochastic Volatility Modeling
14. Quant Job Interview Questions
- Star History

1. NVIDIA GTC | AI Conference for Developers

2025 NVIDIA GTC Conference − Technical & Industrial Insight

GTC 2024 Notes-Chinese

2. Agentic AI Summit

2025 Agentic AI Summit Berkeley − Technical & Industrial Insight

3. LLM Essentials

LLM Theory

Dive into DeepSeek LLM, by Xiaojing Ding, 2025

Notes-Chinese

DeepSeek Large Model High-Performance Core Technology and Multimodal Fusion Development, by Xiaohua Wang, 2025

Notes-Chinese

Efficient Training in PyTorch, by Ailing Zhang, 2024

Notes-Chinese

Generative AI on AWS, by Chris Fregly, 2024

Notes-Chinese

LLM from Theory to Practice, by Qi Zhang, 2024

Notes-Chinese

LangChain Scalable LLM Apps, by Teli Li, 2024

Notes-Chinese

Foundations of LLMs - by Yuren Mao, Zhejiang University, 2024

Course Github | Course Video | Textbook | PDF Notes-Chinese

30 Essential Questions and Answers on Machine Learning and AI - by Sebastian Raschka, 2025

Notes-Chinese

Unveiling Large Model, by Liang Wen, 2025

Notes-Chinese

GeekBang: AI LLM Practice | Notes-Chinese

GeekBang: AI LLM System | Notes-Chinese

LLM Applications

GeekBang: AI LLM Project Implementation | Notes-Chinese

GeekBang: LLM App Developmenmt | Notes-Chinese

RAG

Educative: Advanced RAG Techniques - Choosing the Right Approach | Notes

GeekBang: RAG Development | Notes-Chinese

Multi-Agent

Educative: Build AI Agents and Multi-Agent Systems with CrewAI | Notes

GeekBang: AI Agents | Notes-Chinese

4. DeepSeek & Kimi

Research Implementation

Github: Mixture-of-Experts (MoE) Implementation in PyTorch

Github: MiniGPT-and-DeepSeek-MLA-Multi-Head-Latent-Attention

DeepSeek Theory

Educative: Everything You Need to Know About DeepSeek | Notes

Zomi-Bilibili | Github | Notes-Chinese

DeepSeek Applications

GeekBang: DeepSeek HandsOn | Notes-Chinese

GeekBang: DeepSeek App Development | Notes-Chinese

Kimi K2

Kimi K2 | 🚀 Understand Kimi K2 in 10 Minutes

5. 2025 Paper Reading

World Models | LinkedIn: World Model: 5 Debates Between Eric Xing's PAN & Yann LeCun’s JEPA

6. LangGraph & Cursor AI Projects

Notes

GitHub Projects

7. System Design

ByteByteGo - GenAI/ML/Modern System Design Interview

System Design Interview, An Insider's Guide, Second Edition - by Alex Xu, 2020

Book Link | PDF Notes-Chinese

Generative AI System Design Interview - by Ali Aminian, Hao Sheng, 2024

Book Link | Markdown Notes

Machine Learning System Design Interview - by Ali Aminian, Alex Xu, 2023

Book Link | Markdown Notes

Educative - GenAI/Modern System Design Interview

Educative - Grokking System Design Interview | PDF Notes | Markdown Notes

Educative - Grokking the Modern System Design Interview | Markdown Notes

Educative - GenAI System Design | Notes

8. Computer Systems

计算机底层的秘密，陆小风 - 2023，电子工业出版社

Book Link | PDF Notes-Chinese

9. Big Data and AI in Finance, Econometrics and Statistics Conference, UChicago 2024

BDAI Conference, 2024 Oct 3-5, UChicago

Abstract PDF | Agenda PDF | High Level Overview Notes PDF | Conference Review Notes PDF

10. C++ Design Patterns and Derivatives Pricing

C++ Design Patterns and Derivatives Pricing (Mathematics, Finance and Risk, Series Number 2) 2nd Edition, by M. S. Joshi

Book Link | PDF Notes | Markdown Notes

11. High-Frequency Finance

An Introduction to High-Frequency Finance, by Ramazan Gençay, et al.

Book Link | PDF Notes | Markdown Notes

12. Machine Learning for Algorithmic Trading

Machine Learning for Algorithmic Trading: Predictive models to extract signals from market and alternative data for systematic trading strategies with Python, 2nd Edition Paperback – by Stefan Jansen 2020

Book Link | PDF Notes | Markdown Notes

13. Stochastic Volatility Modeling

Stochastic Volatility Modeling (Chapman and Hall/CRC Financial Mathematics Series) 1st Edition, by Lorenzo Bergomi

Book Link | PDF Char 1 Intro | Markdown Char 1 Intro | PDF Char 2 Local Vol | Markdown Char 2 Local Vol

14. Quant Job Interview Questions

Quant Job Interview Questions and Answers (Second Edition) – by Mark Joshi 2013

Book Link | Markdown Notes

Cloud Platform Notes | Quant Notes | FX Exotic Derivatives Notes | Risk Methodologies Notes

Connect with me: GitHub • Resume • LinkedIn • X • Email • Instagram • Facebook • Douban • WeChat

Star History

For Tasks:

Click tags to check more tools for each tasks

analyze trends implement llms design ai systems develop financial models prepare for quant job interviews

For Jobs:

data scientist machine learning engineer quantitative analyst ai researcher financial analyst

Alternative AI tools for AI-LLM-ML-CS-Quant-Review

Similar Open Source Tools

AI-LLM-ML-CS-Quant-Review

github

: 380

AI-LLM-ML-CS-Quant-Overview

AI-LLM-ML-CS-Quant-Overview is a repository providing overview notes on AI, Large Language Models (LLM), Machine Learning (ML), Computer Science (CS), and Quantitative Finance. It covers various topics such as LangGraph & Cursor AI, DeepSeek, MoE (Mixture of Experts), NVIDIA GTC, LLM Essentials, System Design, Computer Systems, Big Data and AI in Finance, Econometrics and Statistics Conference, C++ Design Patterns and Derivatives Pricing, High-Frequency Finance, Machine Learning for Algorithmic Trading, Stochastic Volatility Modeling, Quant Job Interview Questions, Distributed Systems, Language Models, Designing Machine Learning Systems, Designing Data-Intensive Applications (DDIA), Distributed Machine Learning, and The Elements of Quantitative Investing.

github

: 52

AI-LLM-ML-CS-Quant-Readings

AI-LLM-ML-CS-Quant-Readings is a repository dedicated to taking notes on Artificial Intelligence, Large Language Models, Machine Learning, Computer Science, and Quantitative Finance. It contains a wide range of resources, including theory, applications, conferences, essentials, foundations, system design, computer systems, finance, and job interview questions. The repository covers topics such as AI systems, multi-agent systems, deep learning theory and applications, system design interviews, C++ design patterns, high-frequency finance, algorithmic trading, stochastic volatility modeling, and quantitative investing. It is a comprehensive collection of materials for individuals interested in these fields.

github

: 51

Embodied-AI-Guide

Embodied-AI-Guide is a comprehensive guide for beginners to understand Embodied AI, focusing on the path of entry and useful information in the field. It covers topics such as Reinforcement Learning, Imitation Learning, Large Language Model for Robotics, 3D Vision, Control, Benchmarks, and provides resources for building cognitive understanding. The repository aims to help newcomers quickly establish knowledge in the field of Embodied AI.

github

: 4.1k

Awesome-LLM-RAG-Application

Awesome-LLM-RAG-Application is a repository that provides resources and information about applications based on Large Language Models (LLM) with Retrieval-Augmented Generation (RAG) pattern. It includes a survey paper, GitHub repo, and guides on advanced RAG techniques. The repository covers various aspects of RAG, including academic papers, evaluation benchmarks, downstream tasks, tools, and technologies. It also explores different frameworks, preprocessing tools, routing mechanisms, evaluation frameworks, embeddings, security guardrails, prompting tools, SQL enhancements, LLM deployment, observability tools, and more. The repository aims to offer comprehensive knowledge on RAG for readers interested in exploring and implementing LLM-based systems and products.

github

: 1.5k

awesome-ai-efficiency

Awesome AI Efficiency is a curated list of resources dedicated to enhancing efficiency in AI systems. The repository covers various topics essential for optimizing AI models and processes, aiming to make AI faster, cheaper, smaller, and greener. It includes topics like quantization, pruning, caching, distillation, factorization, compilation, parameter-efficient fine-tuning, speculative decoding, hardware optimization, training techniques, inference optimization, sustainability strategies, and scalability approaches.

github

: 115

cia

CIA is a powerful open-source tool designed for data analysis and visualization. It provides a user-friendly interface for processing large datasets and generating insightful reports. With CIA, users can easily explore data, perform statistical analysis, and create interactive visualizations to communicate findings effectively. Whether you are a data scientist, analyst, or researcher, CIA offers a comprehensive set of features to streamline your data analysis workflow and uncover valuable insights.

github

: 192

Video-ChatGPT

Video-ChatGPT is a video conversation model that aims to generate meaningful conversations about videos by combining large language models with a pretrained visual encoder adapted for spatiotemporal video representation. It introduces high-quality video-instruction pairs, a quantitative evaluation framework for video conversation models, and a unique multimodal capability for video understanding and language generation. The tool is designed to excel in tasks related to video reasoning, creativity, spatial and temporal understanding, and action recognition.

github

: 1.3k

happy-llm

Happy-LLM is a systematic learning tutorial for Large Language Models (LLM) that covers NLP research methods, LLM architecture, training process, and practical applications. It aims to help readers understand the principles and training processes of large language models. The tutorial delves into Transformer architecture, attention mechanisms, pre-training language models, building LLMs, training processes, and practical applications like RAG and Agent technologies. It is suitable for students, researchers, and LLM enthusiasts with programming experience, Python knowledge, and familiarity with deep learning and NLP concepts. The tutorial encourages hands-on practice and participation in LLM projects and competitions to deepen understanding and contribute to the open-source LLM community.

github

: 17.4k

PyTorch-Tutorial-2nd

The second edition of "PyTorch Practical Tutorial" was completed after 5 years, 4 years, and 2 years. On the basis of the essence of the first edition, rich and detailed deep learning application cases and reasoning deployment frameworks have been added, so that this book can more systematically cover the knowledge involved in deep learning engineers. As the development of artificial intelligence technology continues to emerge, the second edition of "PyTorch Practical Tutorial" is not the end, but the beginning, opening up new technologies, new fields, and new chapters. I hope to continue learning and making progress in artificial intelligence technology with you in the future.

github

: 2.8k

anylabeling

AnyLabeling is a tool for effortless data labeling with AI support from YOLO and Segment Anything. It combines features from LabelImg and Labelme with an improved UI and auto-labeling capabilities. Users can annotate images with polygons, rectangles, circles, lines, and points, as well as perform auto-labeling using YOLOv5 and Segment Anything. The tool also supports text detection, recognition, and Key Information Extraction (KIE) labeling, with multiple language options available such as English, Vietnamese, and Chinese.

github

: 2.6k

Code-Review-GPT-Gitlab

A project that utilizes large models to help with Code Review on Gitlab, aimed at improving development efficiency. The project is customized for Gitlab and is developing a Multi-Agent plugin for collaborative review. It integrates various large models for code security issues and stays updated with the latest Code Review trends. The project architecture is designed to be powerful, flexible, and efficient, with easy integration of different models and high customization for developers.

github

: 715

agenta

Agenta is an open-source LLM developer platform for prompt engineering, evaluation, human feedback, and deployment of complex LLM applications. It provides tools for prompt engineering and management, evaluation, human annotation, and deployment, all without imposing any restrictions on your choice of framework, library, or model. Agenta allows developers and product teams to collaborate in building production-grade LLM-powered applications in less time.

github

: 3.2k

CVPR2024-Papers-with-Code-Demo

This repository contains a collection of papers and code for the CVPR 2024 conference. The papers cover a wide range of topics in computer vision, including object detection, image segmentation, image generation, and video analysis. The code provides implementations of the algorithms described in the papers, making it easy for researchers and practitioners to reproduce the results and build upon the work of others. The repository is maintained by a team of researchers at the University of California, Berkeley.

github

: 1.2k

timeline-studio

Timeline Studio is a next-generation professional video editor with AI integration that automates content creation for social media. It combines the power of desktop applications with the convenience of web interfaces. With 257 AI tools, GPU acceleration, plugin system, multi-language interface, and local processing, Timeline Studio offers complete video production automation. Users can create videos for various social media platforms like TikTok, YouTube, Vimeo, Telegram, and Instagram with optimized versions. The tool saves time, understands trends, provides professional quality, and allows for easy feature extension through plugins. Timeline Studio is open source, transparent, and offers significant time savings and quality improvements for video editing tasks.

github

: 56

Open-dLLM

Open-dLLM is the most open release of a diffusion-based large language model, providing pretraining, evaluation, inference, and checkpoints. It introduces Open-dCoder, the code-generation variant of Open-dLLM. The repo offers a complete stack for diffusion LLMs, enabling users to go from raw data to training, checkpoints, evaluation, and inference in one place. It includes pretraining pipeline with open datasets, inference scripts for easy sampling and generation, evaluation suite with various metrics, weights and checkpoints on Hugging Face, and transparent configs for full reproducibility.

github

: 237

For similar tasks

Awesome-Segment-Anything

Awesome-Segment-Anything is a powerful tool for segmenting and extracting information from various types of data. It provides a user-friendly interface to easily define segmentation rules and apply them to text, images, and other data formats. The tool supports both supervised and unsupervised segmentation methods, allowing users to customize the segmentation process based on their specific needs. With its versatile functionality and intuitive design, Awesome-Segment-Anything is ideal for data analysts, researchers, content creators, and anyone looking to efficiently extract valuable insights from complex datasets.

github

: 926

Time-LLM

Time-LLM is a reprogramming framework that repurposes large language models (LLMs) for time series forecasting. It allows users to treat time series analysis as a 'language task' and effectively leverage pre-trained LLMs for forecasting. The framework involves reprogramming time series data into text representations and providing declarative prompts to guide the LLM reasoning process. Time-LLM supports various backbone models such as Llama-7B, GPT-2, and BERT, offering flexibility in model selection. The tool provides a general framework for repurposing language models for time series forecasting tasks.

github

: 764

crewAI

CrewAI is a cutting-edge framework designed to orchestrate role-playing autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks. It enables AI agents to assume roles, share goals, and operate in a cohesive unit, much like a well-oiled crew. Whether you're building a smart assistant platform, an automated customer service ensemble, or a multi-agent research team, CrewAI provides the backbone for sophisticated multi-agent interactions. With features like role-based agent design, autonomous inter-agent delegation, flexible task management, and support for various LLMs, CrewAI offers a dynamic and adaptable solution for both development and production workflows.

github

: 38.6k

Transformers_And_LLM_Are_What_You_Dont_Need

Transformers_And_LLM_Are_What_You_Dont_Need is a repository that explores the limitations of transformers in time series forecasting. It contains a collection of papers, articles, and theses discussing the effectiveness of transformers and LLMs in this domain. The repository aims to provide insights into why transformers may not be the best choice for time series forecasting tasks.

github

: 644

pytorch-forecasting

PyTorch Forecasting is a PyTorch-based package for time series forecasting with state-of-the-art network architectures. It offers a high-level API for training networks on pandas data frames and utilizes PyTorch Lightning for scalable training on GPUs and CPUs. The package aims to simplify time series forecasting with neural networks by providing a flexible API for professionals and default settings for beginners. It includes a timeseries dataset class, base model class, multiple neural network architectures, multi-horizon timeseries metrics, and hyperparameter tuning with optuna. PyTorch Forecasting is built on pytorch-lightning for easy training on various hardware configurations.

github

: 3.8k

spider

Spider is a high-performance web crawler and indexer designed to handle data curation workloads efficiently. It offers features such as concurrency, streaming, decentralization, headless Chrome rendering, HTTP proxies, cron jobs, subscriptions, smart mode, blacklisting, whitelisting, budgeting depth, dynamic AI prompt scripting, CSS scraping, and more. Users can easily get started with the Spider Cloud hosted service or set up local installations with spider-cli. The tool supports integration with Node.js and Python for additional flexibility. With a focus on speed and scalability, Spider is ideal for extracting and organizing data from the web.

github

: 946

AI_for_Science_paper_collection

AI for Science paper collection is an initiative by AI for Science Community to collect and categorize papers in AI for Science areas by subjects, years, venues, and keywords. The repository contains `.csv` files with paper lists labeled by keys such as `Title`, `Conference`, `Type`, `Application`, `MLTech`, `OpenReviewLink`. It covers top conferences like ICML, NeurIPS, and ICLR. Volunteers can contribute by updating existing `.csv` files or adding new ones for uncovered conferences/years. The initiative aims to track the increasing trend of AI for Science papers and analyze trends in different applications.

github

: 55

pytorch-forecasting

PyTorch Forecasting is a PyTorch-based package designed for state-of-the-art timeseries forecasting using deep learning architectures. It offers a high-level API and leverages PyTorch Lightning for efficient training on GPU or CPU with automatic logging. The package aims to simplify timeseries forecasting tasks by providing a flexible API for professionals and user-friendly defaults for beginners. It includes features such as a timeseries dataset class for handling data transformations, missing values, and subsampling, various neural network architectures optimized for real-world deployment, multi-horizon timeseries metrics, and hyperparameter tuning with optuna. Built on pytorch-lightning, it supports training on CPUs, single GPUs, and multiple GPUs out-of-the-box.

github

: 4.0k

For similar jobs

weave

Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

github

: 980

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

VisionCraft

The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

github

: 94

kaito

Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

github

: 405

PyRIT

PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

github

: 2.9k

tabby

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

github

: 32.1k

spear

SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

github

: 224

Magick

Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.

github

: 675

AI-LLM-ML-CS-Quant-Review

README:

AI-LLM-ML-CS-Quant-Review

Contents

1. NVIDIA GTC | AI Conference for Developers

2. Agentic AI Summit

3. LLM Essentials

LLM Theory

LLM Applications

RAG

Multi-Agent

4. DeepSeek & Kimi

Research Implementation

DeepSeek Theory

DeepSeek Applications

Kimi K2

5. 2025 Paper Reading

6. LangGraph & Cursor AI Projects

7. System Design

ByteByteGo - GenAI/ML/Modern System Design Interview

Educative - GenAI/Modern System Design Interview

8. Computer Systems

9. Big Data and AI in Finance, Econometrics and Statistics Conference, UChicago 2024

10. C++ Design Patterns and Derivatives Pricing

11. High-Frequency Finance

12. Machine Learning for Algorithmic Trading

13. Stochastic Volatility Modeling

14. Quant Job Interview Questions

Star History

For Tasks:

For Jobs:

Alternative AI tools for AI-LLM-ML-CS-Quant-Review

Similar Open Source Tools

AI-LLM-ML-CS-Quant-Review

AI-LLM-ML-CS-Quant-Overview

AI-LLM-ML-CS-Quant-Readings

Embodied-AI-Guide

Awesome-LLM-RAG-Application

awesome-ai-efficiency

cia

Video-ChatGPT

happy-llm

PyTorch-Tutorial-2nd

anylabeling

Code-Review-GPT-Gitlab

agenta

CVPR2024-Papers-with-Code-Demo

timeline-studio

Open-dLLM

For similar tasks

Awesome-Segment-Anything

Time-LLM

crewAI

Transformers_And_LLM_Are_What_You_Dont_Need

pytorch-forecasting

spider

AI_for_Science_paper_collection

pytorch-forecasting

For similar jobs

weave

LLMStack

VisionCraft

kaito

PyRIT

tabby

spear

Magick