AcademicForge

A curated skill collection for academic writing and research

Stars: 66

Visit

Academic Forge is a collection of skills integrated for academic writing workflows. It provides a curated set of skills related to academic writing and research, allowing for precise skill calls, avoiding confusion between similar skills, maintaining focus on research workflows, and receiving timely updates from original authors. The forge integrates carefully selected skills covering various areas such as bioinformatics, clinical research, data analysis, scientific writing, laboratory automation, machine learning, databases, AI research, model architectures, fine-tuning, post-training, distributed training, optimization, inference, evaluation, agents, multimodal tasks, and machine learning paper writing. It is designed to streamline the academic writing and AI research processes by providing a cohesive and community-driven collection of skills.

README:

🎓 Academic Forge

为学术写作整合的Skills集合

English | 简体中文

📖 什么是 Forge（熔炉）？

"Forge" 这个名字灵感来自 Minecraft 的模组加载器系统（如 Forge 或 Fabric），它允许玩家无缝运行多个模组。就像 Minecraft Forge 提供的整合包为特定游戏体验集成各种模组一样，Academic Forge 为专注的学术写作工作流程集成多个 Skills。

为什么叫 "Forge"？

🔧 集成优于安装 - 就像 Minecraft 整合包，你得到的是一个精心策划、协同工作的集合
🎯 专门构建 - 每个 forge 针对特定领域（学术写作、Web 开发、数据科学等）
🔄 自动更新 - Skills通过 git submodules 保持与原始仓库的链接
🤝 社区驱动 - 建立在多个Skills创作者的优秀工作之上

🎯 用途

Academic Forge 解决了一个常见问题：太多Skills会导致 AI agent准确性下降。通过只精选与学术写作和研究相关的Skills，可以：

✅ 做出更精准的Skills调用
✅ 避免类似Skills之间的混淆
✅ 保持对研究工作流程的专注
✅ 及时获得原始作者的改进更新

📦 包含的Skills

本 forge 整合了以下精心挑选的Skills：

claude-scientific-skills (140 Skills)

作者: @k-dense-ai - By K-Dense Inc.
许可证: MIT
覆盖范围: 140 个即用型科学skills，涵盖15+领域
包含内容:
- 🧬 生物信息学与基因组学 - BioPython, Scanpy, 单细胞RNA-seq, 变异注释
- 🧪 化学信息学与药物发现 - RDKit, DeepChem, 分子对接, 虚拟筛选
- 🏥 临床研究 - ClinicalTrials.gov, ClinVar, FDA数据库, 药物基因组学
- 📊 数据分析 - 统计分析, matplotlib, seaborn, 出版级图表
- 📚 科学写作 - LaTeX格式化, 引用管理, 同行评审
- 🔬 实验室自动化 - PyLabRobot, Benchling, Opentrons集成
- 🤖 机器学习 - PyTorch Lightning, scikit-learn, 深度学习工作流
- 📚 数据库 - 28+ 科学数据库 (PubMed, OpenAlex, ChEMBL, UniProt等)
最适合: 从文献综述到论文发表的多步骤科学工作流程

AI-research-SKILLs (82 Skills)

作者: @zechenzhangAGI - By Orchestra Research
许可证: MIT
覆盖范围: 82 个专家级AI研究工程skills，涵盖20个类别
包含内容:
- 🏗️ 模型架构 - LitGPT, Mamba, RWKV, NanoGPT, TorchTitan (5个skills)
- 🎯 微调 - Axolotl, LLaMA-Factory, PEFT, Unsloth (4个skills)
- 🎓 后训练 - TRL, GRPO, OpenRLHF, SimPO, verl (8个RLHF/DPO skills)
- ⚡ 分布式训练 - DeepSpeed, FSDP, Megatron-Core, Accelerate (6个skills)
- 🚀 优化 - Flash Attention, bitsandbytes, GPTQ, AWQ (6个skills)
- 🔥 推理 - vLLM, TensorRT-LLM, SGLang, llama.cpp (4个skills)
- 📊 评估 - lm-eval-harness, BigCode, NeMo Evaluator (3个skills)
- 🤖 Agents与RAG - LangChain, LlamaIndex, Chroma, FAISS (9个skills)
- 🎨 多模态 - CLIP, Whisper, LLaVA, Stable Diffusion (7个skills)
- 📝 机器学习论文写作 - NeurIPS, ICML, ICLR, ACL的LaTeX模板 (1个skill)
文档质量: 每个skill约420行 + 300KB+参考资料
最适合: 从假设到论文发表的AI研究工作流程

humanizer

作者: @blader
许可证: 查看原始仓库
用途: 优化学术语气、提高可读性、避免 AI 检测特征
最适合: 润色草稿、保持学术声调、同行评审准备

注意: 所有Skills保留其原始许可证和作者身份。本 forge 仅提供便捷的集成。详细归属请查看 ATTRIBUTIONS.md。

🚀 快速开始

安装

直接将 Academic Forge 安装到你的 Claude Code/OpenCode 项目中：

macOS/Linux:

cd your-project
curl -sSL https://raw.githubusercontent.com/HughYau/AcademicForge/main/scripts/install.sh | bash

Windows (PowerShell):

cd your-project
irm https://raw.githubusercontent.com/HughYau/AcademicForge/main/scripts/install.ps1 | iex

或手动安装：

# 克隆包含所有 submodules
git clone --recursive https://github.com/HughYau/AcademicForge .opencode/skills/academic-forge

下载 Skills Submodules

如果你只想下载 skills 文件夹中的子模块（不包含整个仓库）：

Windows (PowerShell):

.\scripts\download-skills.ps1

Linux/macOS:

bash scripts/download-skills.sh

这些脚本将自动下载所有 skills 子模块到本地 skills/ 文件夹。

更新 Skills

保持所有 Skills 与最新改进同步：

cd .opencode/skills/academic-forge
./scripts/update.sh  # 或在 Windows 上使用 update.ps1

🔄 自动更新

本仓库配置了自动化工作流程，每周一 09:00 UTC 自动更新所有 submodules 到最新版本。这意味着：

✅ Skills 始终保持最新状态
✅ 自动获取原作者的改进和bug修复
✅ 无需手动运行更新脚本
📅 更新时间：每周一 09:00 UTC（北京时间 17:00）

🎓 使用案例

Academic Forge 非常适合：

📝 撰写研究论文 - 从大纲到提交就绪的手稿
🔬 实验设计 - 规划和记录研究方法
📊 数据分析 - 统计分析和结果解释
📚 文献综述 - 组织和综合学术资源
✍️ 学位论文写作 - 长篇学术文档管理
👥 协作研究 - 在团队成员之间保持一致的风格

📄 文档

快速入门指南 - 5 分钟上手
使用示例 - 真实工作流程示例
Skills归属 - 详细的作者信息和许可证
贡献指南 - 如何贡献或创建你自己的 forge

🤝 贡献

发现了一个非常适合学术写作的Skills？请查看 CONTRIBUTING.md 了解如何：

建议新Skills
报告问题
改进文档
创建你自己领域的 forge

📄 许可证

forge 结构（脚本、配置、文档）采用 MIT 许可证。

单个Skills保留其原始许可证 - 详见 ATTRIBUTIONS.md 和每个Skills的仓库。

为学术研究社区用 💙 构建

⭐ 如果这个 forge 对你的研究有帮助，请给本仓库和各个Skills仓库点星！

For Tasks:

Click tags to check more tools for each tasks

write research papers design experiments analyze data review literature collaborate on research

For Jobs:

research assistant academic writer data analyst machine learning engineer scientific researcher

Alternative AI tools for AcademicForge

Similar Open Source Tools

AcademicForge

github

: 66

Daily-DeepLearning

Daily-DeepLearning is a repository that covers various computer science topics such as data structures, operating systems, computer networks, Python programming, data science packages like numpy, pandas, matplotlib, machine learning theories, deep learning theories, NLP concepts, machine learning practical applications, deep learning practical applications, and big data technologies like Hadoop and Hive. It also includes coding exercises related to '剑指offer'. The repository provides detailed explanations and examples for each topic, making it a comprehensive resource for learning and practicing different aspects of computer science and data-related fields.

github

: 666

LLM_book

LLM_book is a learning record and roadmap for programmers with a certain AI foundation to learn Large Language Models (LLM). It covers topics such as PyTorch basics, Transformer architecture, langchain basics, foundational concepts of large models, fine-tuning methods, RAG (Retrieval-Augmented Generation), and building intelligent agents using LLM. The repository provides learning materials, code implementations, and documentation to help users progress in understanding and implementing LLM technologies.

github

: 89

chatwiki

ChatWiki is an open-source knowledge base AI question-answering system. It is built on large language models (LLM) and retrieval-augmented generation (RAG) technologies, providing out-of-the-box data processing, model invocation capabilities, and helping enterprises quickly build their own knowledge base AI question-answering systems. It offers exclusive AI question-answering system, easy integration of models, data preprocessing, simple user interface design, and adaptability to different business scenarios.

github

: 415

kcores-llm-arena

KCORES LLM Arena is a large model evaluation tool that focuses on real-world scenarios, using human scoring and benchmark testing to assess performance. It aims to provide an unbiased evaluation of large models in real-world applications. The tool includes programming ability tests and specific benchmarks like Mandelbrot Set, Mars Mission, Solar System, and Ball Bouncing Inside Spinning Heptagon. It supports various programming languages and emphasizes performance optimization, rendering, animations, physics simulations, and creative implementations.

github

: 344

ai_wiki

This repository provides a comprehensive collection of resources, open-source tools, and knowledge related to quantitative analysis. It serves as a valuable knowledge base and navigation guide for individuals interested in various aspects of quantitative investing, including platforms, programming languages, mathematical foundations, machine learning, deep learning, and practical applications. The repository is well-structured and organized, with clear sections covering different topics. It includes resources on system platforms, programming codes, mathematical foundations, algorithm principles, machine learning, deep learning, reinforcement learning, graph networks, model deployment, and practical applications. Additionally, there are dedicated sections on quantitative trading and investment, as well as large models. The repository is actively maintained and updated, ensuring that users have access to the latest information and resources.

github

: 346

LLM-Navigation

LLM-Navigation is a repository dedicated to documenting learning records related to large models, including basic knowledge, prompt engineering, building effective agents, model expansion capabilities, security measures against prompt injection, and applications in various fields such as AI agent control, browser automation, financial analysis, 3D modeling, and tool navigation using MCP servers. The repository aims to organize and collect information for personal learning and self-improvement through AI exploration.

github

: 110

FastDeploy

FastDeploy is an inference and deployment toolkit for large language models and visual language models based on PaddlePaddle. It provides production-ready deployment solutions with core acceleration technologies such as load-balanced PD disaggregation, unified KV cache transmission, OpenAI API server compatibility, comprehensive quantization format support, advanced acceleration techniques, and multi-hardware support. The toolkit supports various hardware platforms like NVIDIA GPUs, Kunlunxin XPUs, Iluvatar GPUs, Enflame GCUs, and Hygon DCUs, with plans for expanding support to Ascend NPU and MetaX GPU. FastDeploy aims to optimize resource utilization, throughput, and performance for inference and deployment tasks.

github

: 3.6k

llm_interview_note

This repository provides a comprehensive overview of large language models (LLMs), covering various aspects such as their history, types, underlying architecture, training techniques, and applications. It includes detailed explanations of key concepts like Transformer models, distributed training, fine-tuning, and reinforcement learning. The repository also discusses the evaluation and limitations of LLMs, including the phenomenon of hallucinations. Additionally, it provides a list of related courses and references for further exploration.

github

: 2.1k

hongbomiao.com

hongbomiao.com is a personal research and development (R&D) lab that facilitates the sharing of knowledge. The repository covers a wide range of topics including web development, mobile development, desktop applications, API servers, cloud native technologies, data processing, machine learning, computer vision, embedded systems, simulation, database management, data cleaning, data orchestration, testing, ops, authentication, authorization, security, system tools, reverse engineering, Ethereum, hardware, network, guidelines, design, bots, and more. It provides detailed information on various tools, frameworks, libraries, and platforms used in these domains.

github

: 253

DeepBattler

DeepBattler is a tool designed for Hearthstone Battlegrounds players, providing real-time strategic advice and insights to improve gameplay experience. It integrates with the Hearthstone Deck Tracker plugin and offers voice-assisted guidance. The tool is powered by a large language model (LLM) and can match the strength of top players on EU servers. Users can set up the tool by adding dependencies, configuring the plugin path, and launching the LLM agent. DeepBattler is licensed for personal, educational, and non-commercial use, with guidelines on non-commercial distribution and acknowledgment of external contributions.

github

: 88

free-llm-collect

This repository is a collection of free large language models (LLMs) that can be used for various natural language processing tasks. It includes information on different free LLM APIs and projects that can be deployed without cost. Users can find details on the performance, login requirements, function calling capabilities, and deployment environments of each listed LLM source.

github

: 132

LogChat

LogChat is an open-source and free AI chat client that supports various chat models and technologies such as ChatGPT, 讯飞星火, DeepSeek, LLM, TTS, STT, and Live2D. The tool provides a user-friendly interface designed using Qt Creator and can be used on Windows systems without any additional environment requirements. Users can interact with different AI models, perform voice synthesis and recognition, and customize Live2D character models. LogChat also offers features like language translation, AI platform integration, and menu items like screenshot editing, clock, and application launcher.

github

: 53

nix-ai-tools

Exploring the integration between Nix and AI coding agents, this repository serves as a testbed for packaging, sandboxing, and enhancing AI-powered development tools within the Nix ecosystem. It provides a collection of AI tools with descriptions, versions, sources, licenses, homepages, and usage instructions. The repository also supports daily updates using GitHub Actions and offers a platform for experimental features like sandboxed execution, provider abstraction, and tool composition in Nix environments. Contributions are welcome, and the Nix packaging code in this repository is licensed under MIT.

github

: 112

bk-lite

Blueking Lite is an AI First lightweight operation product with low deployment resource requirements, low usage costs, and progressive experience, providing essential tools for operation administrators.

github

: 119

AI-Catalog

AI-Catalog is a curated list of AI tools, platforms, and resources across various domains. It serves as a comprehensive repository for users to discover and explore a wide range of AI applications. The catalog includes tools for tasks such as text-to-image generation, summarization, prompt generation, writing assistance, code assistance, developer tools, low code/no code tools, audio editing, video generation, 3D modeling, search engines, chatbots, email assistants, fun tools, gaming, music generation, presentation tools, website builders, education assistants, autonomous AI agents, photo editing, AI extensions, deep face/deep fake detection, text-to-speech, startup tools, SQL-related AI tools, education tools, and text-to-video conversion.

github

: 361

For similar tasks

Azure-Analytics-and-AI-Engagement

The Azure-Analytics-and-AI-Engagement repository provides packaged Industry Scenario DREAM Demos with ARM templates (Containing a demo web application, Power BI reports, Synapse resources, AML Notebooks etc.) that can be deployed in a customer’s subscription using the CAPE tool within a matter of few hours. Partners can also deploy DREAM Demos in their own subscriptions using DPoC.

github

: 136

sorrentum

Sorrentum is an open-source project that aims to combine open-source development, startups, and brilliant students to build machine learning, AI, and Web3 / DeFi protocols geared towards finance and economics. The project provides opportunities for internships, research assistantships, and development grants, as well as the chance to work on cutting-edge problems, learn about startups, write academic papers, and get internships and full-time positions at companies working on Sorrentum applications.

github

: 89

tidb

TiDB is an open-source distributed SQL database that supports Hybrid Transactional and Analytical Processing (HTAP) workloads. It is MySQL compatible and features horizontal scalability, strong consistency, and high availability.

github

: 37.1k

zep-python

Zep is an open-source platform for building and deploying large language model (LLM) applications. It provides a suite of tools and services that make it easy to integrate LLMs into your applications, including chat history memory, embedding, vector search, and data enrichment. Zep is designed to be scalable, reliable, and easy to use, making it a great choice for developers who want to build LLM-powered applications quickly and easily.

github

: 60

telemetry-airflow

This repository codifies the Airflow cluster that is deployed at workflow.telemetry.mozilla.org (behind SSO) and commonly referred to as "WTMO" or simply "Airflow". Some links relevant to users and developers of WTMO: * The `dags` directory in this repository contains some custom DAG definitions * Many of the DAGs registered with WTMO don't live in this repository, but are instead generated from ETL task definitions in bigquery-etl * The Data SRE team maintains a WTMO Developer Guide (behind SSO)

github

: 185

mojo

Mojo is a new programming language that bridges the gap between research and production by combining Python syntax and ecosystem with systems programming and metaprogramming features. Mojo is still young, but it is designed to become a superset of Python over time.

github

: 23.0k

pandas-ai

PandasAI is a Python library that makes it easy to ask questions to your data in natural language. It helps you to explore, clean, and analyze your data using generative AI.

github

: 14.0k

databend

Databend is an open-source cloud data warehouse that serves as a cost-effective alternative to Snowflake. With its focus on fast query execution and data ingestion, it's designed for complex analysis of the world's largest datasets.

github

: 7.7k

For similar jobs

SLR-FC

This repository provides a comprehensive collection of AI tools and resources to enhance literature reviews. It includes a curated list of AI tools for various tasks, such as identifying research gaps, discovering relevant papers, visualizing paper content, and summarizing text. Additionally, the repository offers materials on generative AI, effective prompts, copywriting, image creation, and showcases of AI capabilities. By leveraging these tools and resources, researchers can streamline their literature review process, gain deeper insights from scholarly literature, and improve the quality of their research outputs.

github

: 131

paper-ai

Paper-ai is a tool that helps you write papers using artificial intelligence. It provides features such as AI writing assistance, reference searching, and editing and formatting tools. With Paper-ai, you can quickly and easily create high-quality papers.

github

: 664

paper-qa

PaperQA is a minimal package for question and answering from PDFs or text files, providing very good answers with in-text citations. It uses OpenAI Embeddings to embed and search documents, and follows a process of embedding docs and queries, searching for top passages, creating summaries, scoring and selecting relevant summaries, putting summaries into prompt, and generating answers. Users can customize prompts and use various models for embeddings and LLMs. The tool can be used asynchronously and supports adding documents from paths, files, or URLs.

github

: 3.6k

ChatData

ChatData is a robust chat-with-documents application designed to extract information and provide answers by querying the MyScale free knowledge base or uploaded documents. It leverages the Retrieval Augmented Generation (RAG) framework, millions of Wikipedia pages, and arXiv papers. Features include self-querying retriever, VectorSQL, session management, and building a personalized knowledge base. Users can effortlessly navigate vast data, explore academic papers, and research documents. ChatData empowers researchers, students, and knowledge enthusiasts to unlock the true potential of information retrieval.

github

: 135

noScribe

noScribe is an AI-based software designed for automated audio transcription, specifically tailored for transcribing interviews for qualitative social research or journalistic purposes. It is a free and open-source tool that runs locally on the user's computer, ensuring data privacy. The software can differentiate between speakers and supports transcription in 99 languages. It includes a user-friendly editor for reviewing and correcting transcripts. Developed by Kai Dröge, a PhD in sociology with a background in computer science, noScribe aims to streamline the transcription process and enhance the efficiency of qualitative analysis.

github

: 1.4k

AIStudyAssistant

AI Study Assistant is an app designed to enhance learning experience and boost academic performance. It serves as a personal tutor, lecture summarizer, writer, and question generator powered by Google PaLM 2. Features include interacting with an AI chatbot, summarizing lectures, generating essays, and creating practice questions. The app is built using 100% Kotlin, Jetpack Compose, Clean Architecture, and MVVM design pattern, with technologies like Ktor, Room DB, Hilt, and Kotlin coroutines. AI Study Assistant aims to provide comprehensive AI-powered assistance for students in various academic tasks.

github

: 69

data-to-paper

Data-to-paper is an AI-driven framework designed to guide users through the process of conducting end-to-end scientific research, starting from raw data to the creation of comprehensive and human-verifiable research papers. The framework leverages a combination of LLM and rule-based agents to assist in tasks such as hypothesis generation, literature search, data analysis, result interpretation, and paper writing. It aims to accelerate research while maintaining key scientific values like transparency, traceability, and verifiability. The framework is field-agnostic, supports both open-goal and fixed-goal research, creates data-chained manuscripts, involves human-in-the-loop interaction, and allows for transparent replay of the research process.

github

: 553

k2

K2 (GeoLLaMA) is a large language model for geoscience, trained on geoscience literature and fine-tuned with knowledge-intensive instruction data. It outperforms baseline models on objective and subjective tasks. The repository provides K2 weights, core data of GeoSignal, GeoBench benchmark, and code for further pretraining and instruction tuning. The model is available on Hugging Face for use. The project aims to create larger and more powerful geoscience language models in the future.

github

: 153