crazyai-ml

全民瘋AI系列 [經典機器學習]

Stars: 184

Visit

The 'crazyai-ml' repository is a collection of resources related to machine learning, specifically focusing on explaining artificial intelligence models. It includes articles, code snippets, and tutorials covering various machine learning algorithms, data analysis, model training, and deployment. The content aims to provide a comprehensive guide for beginners in the field of AI, offering practical implementations and insights into popular machine learning packages and model tuning techniques. The repository also addresses the integration of AI models and frontend-backend concepts, making it a valuable resource for individuals interested in AI applications.

README:

全民瘋AI系列 [經典機器學習]

第13屆iT邦幫忙鐵人賽

2025 新書預購中！

📢 新書推播 7.9 折優惠！ 🎉

本書改寫自 「第12屆iT邦幫忙鐵人賽全民瘋AI系列」，在原有的內容基礎上，增添 十三種經典實務範例，並強化 模型部署實作，讓讀者能更全面掌握 AI 技術的實際應用。

從基礎機器學習演算法入門，本書循序漸進解析 AI 技術，涵蓋 資料處理、模型構建、優化與部署，並提供豐富的 Python 範例 與 實務應用，讓讀者不僅理解理論，更能將 AI 技術應用於實際場景。

如果你喜歡我的創作 ❤️，歡迎 購買書籍 作為支持！🙏你的支持將成為我持續開源與分享更多 AI 相關內容的動力！

新書主題曲🎶

公告

📢 [2025/02] ✨此系列出版實體書籍囉！

2/19 前預購 7.9 折優惠！ 🎉

如果你喜歡我的創作 ❤️，歡迎購買書籍作為支持！🙏 你的支持將成為我持續開源與分享更多 AI 相關內容的動力！

📢 [2025/01] 電子書新增ChatBot🤖學習小助手

點選網頁右下角 icon 即可免費快速詢問此系列電子書內容。

📢 [2024/09] 此系列新增英文版Podcast！

無論是上學或上班途中，讓我們陪伴你一起展開新的學習旅程！我們很高興宣布，此系列節目已新增英文版Podcast，適合想加強英文聽力或喜歡用英文學習的朋友們！

此Podcast內容由生成式AI產生，因此在某些情況下，可能會提供不完全準確的資訊。

快速收聽 ⬇

Soptify平台

📢 [2024/08] 此系列新增電子書版本～全民瘋AI系列 [經典機器學習]

提供方便的學習平台，匯集影片與文章形式。

傳送門 ⬇

電子書

📢 [2023/09] 新內容連載！ 2023 iThome 鐵人賽揭開黑箱模型：探索可解釋人工智慧

大家好！我有個好消息要告訴大家。今年我參加了2023年第15屆iT幫鐵人賽的AI&Data組，我的主題是「揭開黑箱模型：探索可解釋人工智慧」，這是全民瘋AI系列的進階篇。在新的系列本系列將從 XAI 的基礎知識出發，深入探討可解釋人工智慧在機器學習和深度學習中的應用、案例和挑戰，以及未來發展方向。有興趣朋友歡迎點選下面連結前來iT幫支持與訂閱。

傳送門 ⬇

2023 iThome 鐵人賽揭開黑箱模型：探索可解釋人工智慧

電子書： https://andy6804tw.github.io/2021-13th-ironman/

全民瘋AI系列電子書

全民瘋AI系列是一個專為 AI 學習資源打造的開源平台，由一群熱愛資料科學的工程師所創立。這個平台的宗旨是提供一個開放、協作的環境，讓更多人能夠方便地學習 AI 和機器學習相關技術，無論是初學者還是進階使用者，都可以在這裡找到適合的學習資源和工具。透過社群的力量，平台上的內容持續更新，涵蓋從基礎理論到實務應用，滿足不同層次的學習需求。

書名	簡介	完成進度	討論區連結
Python從零開始	適合初學者，詳細介紹Python語言的基本概念與程式設計技巧。	30%	加入討論
經典機器學習	涵蓋各種經典的機器學習模型與演算法，從理論到實踐。	100%	加入討論
探索可解釋人工智慧	介紹解釋AI模型的最新技術與方法，幫助讀者理解AI決策的背後原因。	100%	加入討論
深度學習與神經網路	深入介紹深度學習與神經網路的概念與實作，適合進階讀者。	20%	加入討論
深度強化學習	涵蓋深度強化學習的理論與應用，適合對最佳化有深入興趣的讀者。	10%	加入討論
大語言模型應用與實戰	探討 LLM 基礎、微調與應用，透過實作輕鬆上手打造專屬 AI 機器人。	10%	加入討論

鐵人賽列表

文章	程式
[Day 1] 全民瘋AI系列2.0-機器學習實戰手冊	-
[Day 2] 快來探索AI的世界	-
[Day 3] 你真了解資料嗎?試試看視覺化分析吧!	Code
[Day 4] 咱們一起做資料清理和前處理	Code
[Day 5] 機器學習大補帖	-
[Day 6] 非監督式學習 K-means 分群	Code
[Day 7] 非監督式學習-降維	Code
[Day 8] 線性迴歸 (Linear Regression)	Code
[Day 9] 邏輯迴歸 (Logistic Regression)	Code
[Day 10] 近朱者赤，近墨者黑 - KNN	KNN(Classification)、KNN(Regression)
[Day 11] 核模型 - 支持向量機 (SVM)	SVM(Classification)、SVR(Regression)
[Day 12] 決策樹 (Decision tree)	決策樹(Classification)、決策樹(Regression)
[Day 13] 整體學習 (Ensemble Learning)	-
[Day 14] 多棵決策樹更厲害：隨機森林 (Random forest)	隨機森林(Classification)、隨機森林(Regression)
[Day 15] 機器學習常勝軍 - XGBoost	XGBoost(Classification)、XGBoost(Regression)
[Day 16] 每個模型我全都要 - 堆疊法 (Stacking)	Code
[Day 17] 輕量化的梯度提升機 - LightGBM	Code
[Day 18] 機器學習 boosting 神器 - CatBoost	Code
[Day 19] 自動化機器學習 - AutoML	-
[Day 20] 機器學習金手指 - Auto-sklearn	Code
[Day 21] 調整模型超參數利器 - Optuna	Code
[Day 22] Python 視覺化解釋數據 - Plotly Express	Code
[Day 23] 資料分布與離群值處理	Code
[Day 24] 機器學習 - 不能忽視的過擬合與欠擬合	-
[Day 25] 交叉驗證 Cross-Validation 簡介	-
[Day 26] 交叉驗證 K-Fold Cross-Validation	-
[Day 27] 機器學習常犯錯的十件事	-
[Day 28] 儲存訓練好的模型	Code
[Day 29] 使用 Python Flask 架設 API 吧！	Code
[Day 30] 使用 Heroku 部署機器學習 API	Code

前言

哈囉大家好我是10程式中的10！我是上一屆鐵人賽影片教學組全民瘋AI系列的作者，當時講解了人工智慧的基礎以及常見的機器學習演算法與手把手教學。由於大家反應很熱烈，讓我看到了大家對於AI的學習熱忱。也因為上一屆獲得了影片教學組優選，收到了許多書商的出版邀請，由於我沒有時間與動力將這些大量知識寫成文章因此都婉拒了。因此我想藉由這一次鐵人賽將上一屆的影片內容整理成電子書版本，提供大家影片教學與文字版的筆記內容(唷呼書商快看過來～)當然內容會以之前影片教學為基底，並加入一些新的元素讓文章內容變得更紮實。在全新的全民瘋AI系列2.0中我會介紹實用的機器學習演算法並含有程式手把手實作，以及近年來熱門的機器學習套件與模型調參技巧。除此之外我還會提到大家最感興趣的 AI 模型落地與整合。希望在這次的鐵人賽能夠將AI的資源整理得更詳細並分享給各位。

此系列教學適合誰?

如果您是之前的舊讀者，歡迎回來為自己充電～新的系列文章保證讓你收穫滿滿！若您是新來的讀者歡迎加入人工智慧的世界，此系列文章正適合初學者閱讀。另外建議可以搭配我上一屆鐵人賽的影片教學進行學習。

系列文章內容規劃

在本次鐵人賽預計新增了許多新內容，特別是近年來比較新的演算法套件，以及在模型訓練中必須注意的大小事。本系列要在短短30天內講完所有 AI 領域相關應用是不太可能的事情，因此我的規劃是從認識人工智慧開始切入主題。先讓大家知道何謂人工智慧以及相關應用有哪些。接著帶各位了解成為資料科學家的第一步，就是資料分析與視覺化，再來會有一系列經典的機器學習演算法介紹。最後也是大家可能會有興趣的整合部分，會以實際的帶大家手把手部署我們的AI模型以及前後端串接的概念。

前置作業資源

本系列教學將有大量的程式實作，並採用 Google Colab 做為程式雲端運行的編輯執行環境。各位可以直接利用 Colab 開啟本系列文章的範例程式。在使用此平台之前每個人都必須要有自己的 Google 帳號，才能順利的開啟並執行程式碼。Colab 可讓你輕鬆地在瀏覽器上撰寫並執行 Python 程式語言，它可以說是機器學習新手的入門工具。此外 Colab 具備了以下幾個優點：

不必進行任何設定與安裝
免費額度使用 GPU、TPU 資源
輕鬆共用與分享檔案

因此讀者必須先熟悉 Colab 的操作模式，想了解該如何操作的朋友們可以先來看這部影片教學。

回報錯誤與建議

本系列文章若有問題或是內容建議都可以來 GitHub 中的 issue 提出。歡迎大家一同貢獻為這系列文章有更好的閱讀品質。

關於作者

曾任職於台灣人工智慧學校，擔任AI工程師，擁有豐富的教學經驗，熱衷於網頁前後端整合與AI演算法的開發。希望藉由鐵人賽，將所學貢獻出來，為AI領域提供更多資源。

@andy6804tw

歡迎大家訂閱我的 YouTube 頻道。

本系列教學內容都可以從我的 GitHub 取得！

For Tasks:

Click tags to check more tools for each tasks

explore ai world visualize data clean data train models deploy ai models

For Jobs:

data scientist machine learning engineer ai researcher data analyst ai solutions architect

Alternative AI tools for crazyai-ml

Similar Open Source Tools

crazyai-ml

github

: 184

2021-13th-ironman

This repository is a part of the 13th iT Help Ironman competition, focusing on exploring explainable artificial intelligence (XAI) in machine learning and deep learning. The content covers the basics of XAI, its applications, cases, challenges, and future directions. It also includes practical machine learning algorithms, model deployment, and integration concepts. The author aims to provide detailed resources on AI and share knowledge with the audience through this competition.

github

: 154

TigerBot

TigerBot is a cutting-edge foundation for your very own LLM, providing a world-class large model for innovative Chinese-style contributions. It offers various upgrades and features, such as search mode enhancements, support for large context lengths, and the ability to play text-based games. TigerBot is suitable for prompt-based game engine development, interactive game design, and real-time feedback for playable games.

github

: 2.2k

gpt_server

The GPT Server project leverages the basic capabilities of FastChat to provide the capabilities of an openai server. It perfectly adapts more models, optimizes models with poor compatibility in FastChat, and supports loading vllm, LMDeploy, and hf in various ways. It also supports all sentence_transformers compatible semantic vector models, including Chat templates with function roles, Function Calling (Tools) capability, and multi-modal large models. The project aims to reduce the difficulty of model adaptation and project usage, making it easier to deploy the latest models with minimal code changes.

github

: 163

AstrBot

AstrBot is a powerful and versatile tool that leverages the capabilities of large language models (LLMs) like GPT-3, GPT-3.5, and GPT-4 to enhance communication and automate tasks. It seamlessly integrates with popular messaging platforms such as QQ, QQ Channel, and Telegram, enabling users to harness the power of AI within their daily conversations and workflows.

github

: 6.6k

2020-12th-ironman

This repository contains tutorial content for the 12th iT Help Ironman competition, focusing on machine learning algorithms and their practical applications. The tutorials cover topics such as AI model integration, API server deployment techniques, and hands-on programming exercises. The series is presented in video format and will be compiled into an e-book in the future. Suitable for those familiar with Python, interested in implementing AI prediction models, data analysis, and backend integration and deployment of AI models.

github

: 199

AstrBot

github

: 7.0k

Chinese-LLaMA-Alpaca-2

Chinese-LLaMA-Alpaca-2 is a large Chinese language model developed by Meta AI. It is based on the Llama-2 model and has been further trained on a large dataset of Chinese text. Chinese-LLaMA-Alpaca-2 can be used for a variety of natural language processing tasks, including text generation, question answering, and machine translation. Here are some of the key features of Chinese-LLaMA-Alpaca-2: * It is the largest Chinese language model ever trained, with 13 billion parameters. * It is trained on a massive dataset of Chinese text, including books, news articles, and social media posts. * It can be used for a variety of natural language processing tasks, including text generation, question answering, and machine translation. * It is open-source and available for anyone to use. Chinese-LLaMA-Alpaca-2 is a powerful tool that can be used to improve the performance of a wide range of natural language processing tasks. It is a valuable resource for researchers and developers working in the field of artificial intelligence.

github

: 6.8k

MEGREZ

MEGREZ is a modern and elegant open-source high-performance computing platform that efficiently manages GPU resources. It allows for easy container instance creation, supports multiple nodes/multiple GPUs, modern UI environment isolation, customizable performance configurations, and user data isolation. The platform also comes with pre-installed deep learning environments, supports multiple users, features a VSCode web version, resource performance monitoring dashboard, and Jupyter Notebook support.

github

: 77

pmhub

PmHub is a smart project management system based on SpringCloud, SpringCloud Alibaba, and LLM. It aims to help students quickly grasp the architecture design and development process of microservices/distributed projects. PmHub provides a platform for students to experience the transformation from monolithic to microservices architecture, understand the pros and cons of both architectures, and prepare for job interviews. It offers popular technologies like SpringCloud-Gateway, Nacos, Sentinel, and provides high-quality code, continuous integration, product design documents, and an enterprise workflow system. PmHub is suitable for beginners and advanced learners who want to master core knowledge of microservices/distributed projects.

github

: 280

DISC-LawLLM

DISC-LawLLM is a legal domain large model that aims to provide professional, intelligent, and comprehensive **legal services** to users. It is developed and open-sourced by the Data Intelligence and Social Computing Lab (Fudan-DISC) at Fudan University.

github

: 590

Chinese-LLaMA-Alpaca

This project open sources the **Chinese LLaMA model and the Alpaca large model fine-tuned with instructions**, to further promote the open research of large models in the Chinese NLP community. These models **extend the Chinese vocabulary based on the original LLaMA** and use Chinese data for secondary pre-training, further enhancing the basic Chinese semantic understanding ability. At the same time, the Chinese Alpaca model further uses Chinese instruction data for fine-tuning, significantly improving the model's understanding and execution of instructions.

github

: 17.2k

Awesome-LWMs

Awesome Large Weather Models (LWMs) is a curated collection of articles and resources related to large weather models used in AI for Earth and AI for Science. It includes information on various cutting-edge weather forecasting models, benchmark datasets, and research papers. The repository serves as a hub for researchers and enthusiasts to explore the latest advancements in weather modeling and forecasting.

github

: 188

Awesome-LLM-for-RecSys

github

: 1.2k

MedicalGPT

MedicalGPT is a training medical GPT model with ChatGPT training pipeline, implement of Pretraining, Supervised Finetuning, RLHF(Reward Modeling and Reinforcement Learning) and DPO(Direct Preference Optimization).

github

: 3.6k

JiwuChat

JiwuChat is a lightweight multi-platform chat application built on Tauri2 and Nuxt3, with various real-time messaging features, AI group chat bots (such as 'iFlytek Spark', 'KimiAI' etc.), WebRTC audio-video calling, screen sharing, and AI shopping functions. It supports seamless cross-device communication, covering text, images, files, and voice messages, also supporting group chats and customizable settings. It provides light/dark mode for efficient social networking.

github

: 400

For similar tasks

opendataeditor

The Open Data Editor (ODE) is a no-code application to explore, validate and publish data in a simple way. It is an open source project powered by the Frictionless Framework. The ODE is currently available for download and testing in beta.

github

: 148

data-juicer

Data-Juicer is a one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs. It is a systematic & reusable library of 80+ core OPs, 20+ reusable config recipes, and 20+ feature-rich dedicated toolkits, designed to function independently of specific LLM datasets and processing pipelines. Data-Juicer allows detailed data analyses with an automated report generation feature for a deeper understanding of your dataset. Coupled with multi-dimension automatic evaluation capabilities, it supports a timely feedback loop at multiple stages in the LLM development process. Data-Juicer offers tens of pre-built data processing recipes for pre-training, fine-tuning, en, zh, and more scenarios. It provides a speedy data processing pipeline requiring less memory and CPU usage, optimized for maximum productivity. Data-Juicer is flexible & extensible, accommodating most types of data formats and allowing flexible combinations of OPs. It is designed for simplicity, with comprehensive documentation, easy start guides and demo configs, and intuitive configuration with simple adding/removing OPs from existing configs.

github

: 4.1k

OAD

OAD is a powerful open-source tool for analyzing and visualizing data. It provides a user-friendly interface for exploring datasets, generating insights, and creating interactive visualizations. With OAD, users can easily import data from various sources, clean and preprocess data, perform statistical analysis, and create customizable visualizations to communicate findings effectively. Whether you are a data scientist, analyst, or researcher, OAD can help you streamline your data analysis workflow and uncover valuable insights from your data.

github

: 132

Streamline-Analyst

Streamline Analyst is a cutting-edge, open-source application powered by Large Language Models (LLMs) designed to revolutionize data analysis. This Data Analysis Agent effortlessly automates tasks such as data cleaning, preprocessing, and complex operations like identifying target objects, partitioning test sets, and selecting the best-fit models based on your data. With Streamline Analyst, results visualization and evaluation become seamless. It aims to expedite the data analysis process, making it accessible to all, regardless of their expertise in data analysis. The tool is built to empower users to process data and achieve high-quality visualizations with unparalleled efficiency, and to execute high-performance modeling with the best strategies. Future enhancements include Natural Language Processing (NLP), neural networks, and object detection utilizing YOLO, broadening its capabilities to meet diverse data analysis needs.

github

: 301

2021-13th-ironman

github

: 154

crazyai-ml

github

: 184

ProX

ProX is a lm-based data refinement framework that automates the process of cleaning and improving data used in pre-training large language models. It offers better performance, domain flexibility, efficiency, and cost-effectiveness compared to traditional methods. The framework has been shown to improve model performance by over 2% and boost accuracy by up to 20% in tasks like math. ProX is designed to refine data at scale without the need for manual adjustments, making it a valuable tool for data preprocessing in natural language processing tasks.

github

: 164

LLM4DB

LLM4DB is a repository focused on the intersection of Large Language Models (LLMs) and Database technologies. It covers various aspects such as data processing, data analysis, database optimization, and data management for LLMs. The repository includes research papers, tools, and techniques related to leveraging LLMs for tasks like data cleaning, entity matching, schema matching, data discovery, NL2SQL, data exploration, data visualization, knob tuning, query optimization, and database diagnosis.

github

: 89

For similar jobs

weave

Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

github

: 855

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

VisionCraft

The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

github

: 94

kaito

Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

github

: 405

PyRIT

PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

github

: 2.3k

tabby

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

github

: 30.6k

spear

SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

github

: 224

Magick

Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.

github

: 675