SakuraLLM

适配轻小说/Galgame的日中翻译大模型

Stars: 2717

Visit

SakuraLLM is a project focused on building large language models for Japanese to Chinese translation in the light novel and galgame domain. The models are based on open-source large models and are pre-trained and fine-tuned on general Japanese corpora and specific domains. The project aims to provide high-performance language models for galgame/light novel translation that are comparable to GPT3.5 and can be used offline. It also offers an API backend for running the models, compatible with the OpenAI API format. The project is experimental, with version 0.9 showing improvements in style, fluency, and accuracy over GPT-3.5.

README:

SakuraLLM

Sakura: SFT And RLHF models using Knowledge of Universal Character and Relationship Attributes for Japanese to Chinese Translation in Light Novel & Galgame Domain.

🤗 Hugging Face • 🤖 ModelScope

目前Sakura发布的所有模型均采用CC BY-NC-SA 4.0协议，Sakura所有模型与其衍生模型均禁止任何形式的商用！Sakura系列所有模型皆仅供学习交流使用，开发者对使用Sakura模型造成的问题不负任何责任。

介绍

基于一系列开源大模型构建，在通用日文语料与轻小说/Galgame等领域的中日语料上进行继续预训练与微调，旨在提供开源可控可离线自部署的、ACGN风格的日中翻译模型。
新建了TG交流群，欢迎交流讨论。

对于其他适配本模型的项目如使用非本项目提供的prompt格式进行翻译，不保证会获得与README中的说明一致的质量！

如果使用模型翻译并发布，请在最显眼的位置标注机翻！！！！！开发者对于滥用本模型造成的一切后果不负任何责任。

由于模型一直在更新，请同时注明使用的模型版本等信息，方便进行质量评估和更新翻译。

对于模型翻译的人称代词问题（错用，乱加，主宾混淆，男女不分等）和上下文理解问题，如果有好的想法或建议，欢迎提issue！

TODO：见https://github.com/SakuraLLM/Sakura-13B-Galgame/issues/42

快速开始

教程：

详见本仓库Wiki.

部分使用方法：usage.md

请注意，如果给轻小说机翻站使用，请参见机翻站站内教程，本 repo 不适用。

如果是翻译字幕等文本，可以考虑直接使用 AiNiee

模型下载：

参数量	发布时间-底模-版本	模型
32B	20240508-Qwen1.5-32B-v0.9	🤗 Sakura-32B-Qwen2beta-v0.9-GGUF
14B	20241008-Qwen2.5-14B-v1.0	🤗 Sakura-14B-Qwen2.5-v1.0-GGUF
7B	20241123-Qwen2.5-7B-v1.0	🤗 Sakura-7B-Qwen2.5-v1.0-GGUF
	20240531-Qwen1.5-7B-Galtransl-v2.6	🤗 Galtransl-v2.6
~2B	20241012-Qwen2.5-1.5B-v1.0	🤗 Sakura-1.5B-Qwen2.5-v1.0-GGUF

p.s. 如果无法连接到HuggingFace服务器，可将链接中的huggingface.co改成hf-mirror.com，使用hf镜像站下载。

News

更新了基于Qwen2.5的v1.0正式版14B模型Sakura-14B-Qwen2.5-v1.0、1.5B模型Qwen2.5-1.5B-v1.0和7B模型Sakura-7B-Qwen2.5-v1.0-GGUF。prompt格式参见下方说明。主要改进：
- 改善翻译质量，提高翻译准确率，尤其是人称的准确率。
- 支持术语表(GPT字典)，以保持专有名词和人称的一致性。
- 提高部分简单控制符的保留能力，尤其是单行内存在\n的情况下保留\n的能力。降低行数与原文不一致的概率。
- 由于底模使用GQA，推理速度和显存占用显著改善，可实现更快的多线程推理。关于多线程推理，可参考Sakura启动器GUI使用教程或SakuraLLMServer。
更新了基于Qwen1.5-7B的Galtransl模型，为视觉小说翻译任务专项优化。对视觉小说脚本中的行内换行、控制符、ruby注音等符号具有较好的保留能力。适配GalTransl视觉小说翻译工具并调优，支持GPT字典（字典写法见此）。
增加了vllm模型后端的支持，详见#40
感谢Isotr0py提供运行模型的NoteBook仓库SakuraLLM-Notebooks，可在Colab(免费T4*1)与Kaggle(免费P100*1或T4*2)平台使用。已经更新Kaggle平台的使用教程，可以白嫖一定时间的T4*2。 警告，Kaggle 官方已经采取措施封禁 SakuraLLM 所有模型，参见，在 Kaggle 上使用 SakuraLLM 将会导致永久性封号。请转移至租卡或者利用机翻站算力共享工具（为防止滥用，请自行搜索）。
Sakura API已经支持OpenAI格式，现在可以通过OpenAI库或者OpenAI API Reference上的请求形式与Server交互。 一个使用OpenAI库与Sakura模型交互的例子详见openai_example.py。

已经接入模型的工具

网站：轻小说机翻机器人已接入Sakura模型(v0.8-4bit)，站内有大量模型翻译结果可供参考。你也可以自行部署模型并使用该网站生成机翻，目前已经支持v0.8与v0.9模型，且提供了llama.cpp一键包。

轻小说机翻机器人网站是一个自动生成轻小说机翻并分享的网站。你可以浏览日文网络小说，或者上传Epub/Txt文件，并生成机翻。
LunaTranslator已经支持Sakura API，可以通过本地部署API后端，并在LunaTranslator中配置Sakura API来使用Sakura模型进行Galgame实时翻译。
~~使用KurikoMoe的版本可以支持流式输出。~~ 目前官方版本已经支持流式输出，只需在翻译设置界面勾选流式输出即可。

LunaTranslator是一个Galgame翻译工具，支持剪贴板、OCR、HOOK，支持40余种翻译引擎。
GalTransl已经支持Sakura API，可以通过本地部署API后端，在GalTransl中配置使用Sakura模型来翻译Galgame，制作内嵌式翻译补丁。

GalTransl是一个galgame自动化翻译工具，用于制作内嵌式翻译补丁。一个使用GalTransl和Sakura模型翻译的示例
翻译Unity引擎游戏的工具SakuraTranslator。感谢fkiliver提供。
翻译RPGMaker引擎游戏的工具RPGMaker_LLaMA_Translator。感谢fkiliver提供。
AiNiee已经支持Sakura API，可以通过本地部署API后端，在AiNiee中使用Sakura模型进行翻译。

AiNiee是一款基于【mtool】或【Translator++】，chatgpt自动批量翻译工具，主要是用来翻译各种RPG游戏。
manga-image-translator已经支持Sakura API，可以通过本地部署API后端，使用Sakura自动翻译漫画。
BallonsTranslator已经支持Sakura API，可以通过本地部署API后端，使用Sakura翻译漫画。

显卡需求与显存需求

目前Sakura Launcher GUI可以支持NVIDIA独立显卡，部分AMD独立显卡和一部分核显的一键安装与启动。

下面的表格显示了部分模型推荐的显卡显存大小。如果你的显卡显存不满足上述需求，可以尝试同时使用CPU与GPU进行推理。

模型	显存大小	模型规模
sakura-7b-qwen2.5-v1.0-iq4xs.gguf	8G/10G	7B
sakura-14b-qwen2.5-v1.0-iq4xs.gguf	11G/12G/16G	14B
sakura-14b-qwen2.5-v1.0-q6k.gguf	24G	14B

模型详情

描述

Finetuned by SakuraUmi
Finetuned on Baichuan2-13B-Chat
Continual Pre-trained on Qwen model series
Continual Pre-trained on Qwen1.5 model series
Finetuned on Sakura-Base model series
Languages: Chinese/Japanese

效果

Galgame

一个例子
轻小说

网站：轻小说机翻机器人已接入Sakura模型(v0.9)，站内有大量模型翻译的轻小说可供参考。
PPL

Sakura-14B-Qwen2beta-v0.9-iq4_xs_ver2: 4.43

Sakura-32B-Qwen2beta-v0.9-iq4xs: 3.28

推理

openai api messages格式：

v0.9 使用代码处理如下：

input_text_list = ['a', 'bb', 'ccc', ...] # 一系列上下文文本，每个元素代表一行的文本
raw_text = "\n".join(input_text_list)
messages=[
    {
        "role": "system",
        "content": "你是一个轻小说翻译模型，可以流畅通顺地以日本轻小说的风格将日文翻译成简体中文，并联系上下文正确使用人称代词，不擅自添加原文中没有的代词。"
    },
    {
        "role": "user",
        "content": "将下面的日文文本翻译成中文：" + raw_text
    }
]

prompt格式：

v1.0pre1 代码处理如下：

        gpt_dict = [{
          "src": "原文1",
          "dst": "译文1",
          "info": "注释信息1",
        },]
        gpt_dict_text_list = []
        for gpt in gpt_dict:
            src = gpt['src']
            dst = gpt['dst']
            info = gpt['info'] if "info" in gpt.keys() else None
            if info:
                single = f"{src}->{dst} #{info}"
            else:
                single = f"{src}->{dst}"
            gpt_dict_text_list.append(single)

        gpt_dict_raw_text = "\n".join(gpt_dict_text_list)

        user_prompt = "根据以下术语表（可以为空）：\n" + gpt_dict_raw_text + "\n" + "将下面的日文文本根据对应关系和备注翻译成中文：" + japanese
        prompt = "<|im_start|>system\n你是一个轻小说翻译模型，可以流畅通顺地以日本轻小说的风格将日文翻译成简体中文，并联系上下文正确使用人称代词，不擅自添加原文中没有的代词。<|im_end|>\n" \ # system prompt
        + "<|im_start|>user\n" + user_prompt + "<|im_end|>\n" \ # user prompt
        + "<|im_start|>assistant\n" # assistant prompt start

        # 如果术语表为空，也可以使用如下prompt（在术语表为空时更加推荐）
        user_prompt = "将下面的日文文本翻译成中文：" + japanese
        prompt = "<|im_start|>system\n你是一个轻小说翻译模型，可以流畅通顺地以日本轻小说的风格将日文翻译成简体中文，并联系上下文正确使用人称代词，不擅自添加原文中没有的代词。<|im_end|>\n" \ # system prompt
        + "<|im_start|>user\n" + user_prompt + "<|im_end|>\n" \ # user prompt
        + "<|im_start|>assistant\n" # assistant prompt start

v0.9 文本格式如下：

<|im_start|>system
你是一个轻小说翻译模型，可以流畅通顺地以日本轻小说的风格将日文翻译成简体中文，并联系上下文正确使用人称代词，不擅自添加原文中没有的代词。<|im_end|>
<|im_start|>user
将下面的日文文本翻译成中文：日文第一行
日文第二行
日文第三行
...
日文第n行<|im_end|>
<|im_start|>assistant

使用代码处理如下：

input_text_list = ['a', 'bb', 'ccc', ...] # 一系列上下文文本，每个元素代表一行的文本
raw_text = "\n".join(input_text_list)
prompt = "<|im_start|>system\n你是一个轻小说翻译模型，可以流畅通顺地以日本轻小说的风格将日文翻译成简体中文，并联系上下文正确使用人称代词，不擅自添加原文中没有的代词。<|im_end|>\n" \ # system prompt
        + "<|im_start|>user\n将下面的日文文本翻译成中文：" + raw_text + "<|im_end|>\n" \ # user prompt
        + "<|im_start|>assistant\n" # assistant prompt start

prompt构建：

v0.8

input_text = "" # 要翻译的日文
query = "将下面的日文文本翻译成中文：" + input_text
prompt = "<reserved_106>" + query + "<reserved_107>"

注意 0.8 虽然还可以使用，但是本仓库不再支持调用基于 transformer 的 0.8 版本模型。

v0.9

input_text = "" # 要翻译的日文
query = "将下面的日文文本翻译成中文：" + input_text
prompt = "<|im_start|>system\n你是一个轻小说翻译模型，可以流畅通顺地以日本轻小说的风格将日文翻译成简体中文，并联系上下文正确使用人称代词，不擅自添加原文中没有的代词。<|im_end|>\n<|im_start|>user\n" + query + "<|im_end|>\n<|im_start|>assistant\n"

推理与解码参数：

参数	值
temperature	0.1
top p	0.3
do sample	True
beams number	1
repetition penalty	1
max new token	512
min new token	1

如出现退化（退化的例子可参见#35与#36），可增加frequency_penalty参数，并设置为大于0的某值，一般设置0.1~0.2即可。

微调

模型微调框架参考BELLE或LLaMA-Factory，prompt构造参考推理部分。

Legacy Models

参数量	发布时间-底模-版本	模型
32B	20240508-Qwen1.5-32B-v0.9	🤗 Sakura-32B-Qwen2beta-v0.9-GGUF
	20240508-Qwen1.5-32B-v0.10pre1	🤗 Sakura-32B-Qwen2beta-v0.10pre1-GGUF
14B	20240111-Qwen-14B-v0.9	🤗 Sakura-13B-LNovel-v0.9b-GGUF
	20240213-Qwen1.5-14B-v0.9	🤗 Sakura-14B-Qwen2beta-v0.9-GGUF
	20240516-Qwen1.5-14B-v0.9.2	🤗 Sakura-14B-Qwen2beta-v0.9.2-GGUF
7B	20240116-Qwen-7B-v0.9	🤗 Sakura-7B-LNovel-v0.9-GGUF
	20240531-Qwen1.5-7B-Galtransl-v2.6	🤗 Galtransl-v2.6
~2B	20240214-Qwen1.5-1.8B-v0.9.1	🤗 Sakura-1B8-Qwen2beta-v0.9.1-GGUF

致谢

Copyright Notice

v0.8版本模型的使用须遵守Apache 2.0、《Baichuan 2 模型社区许可协议》和CC BY-NC-SA 4.0协议。

v0.9版本模型的使用须遵守Qwen模型许可协议和CC BY-NC-SA 4.0协议。

v1.0版本模型的使用须遵守CC BY-NC-SA 4.0协议。

For Tasks:

Click tags to check more tools for each tasks

translate text create game content research ai models analyze language data develop linguistic tools

For Jobs:

translator game developer content creator ai researcher linguist

Alternative AI tools for SakuraLLM

Similar Open Source Tools

SakuraLLM

github

: 2.7k

kirara-ai

Kirara AI is a chatbot that supports mainstream large language models and chat platforms. It provides features such as image sending, keyword-triggered replies, multi-account support, personality settings, and support for various chat platforms like QQ, Telegram, Discord, and WeChat. The tool also supports HTTP server for Web API, popular large models like OpenAI and DeepSeek, plugin mechanism, conditional triggers, admin commands, drawing models, voice replies, multi-turn conversations, cross-platform message sending, custom workflows, web management interface, and built-in Frpc intranet penetration.

github

: 14.9k

app-builder

AppBuilder SDK is a one-stop development tool for AI native applications, providing basic cloud resources, AI capability engine, Qianfan large model, and related capability components to improve the development efficiency of AI native applications.

github

: 503

build_MiniLLM_from_scratch

This repository aims to build a low-parameter LLM model through pretraining, fine-tuning, model rewarding, and reinforcement learning stages to create a chat model capable of simple conversation tasks. It features using the bert4torch training framework, seamless integration with transformers package for inference, optimized file reading during training to reduce memory usage, providing complete training logs for reproducibility, and the ability to customize robot attributes. The chat model supports multi-turn conversations. The trained model currently only supports basic chat functionality due to limitations in corpus size, model scale, SFT corpus size, and quality.

github

: 397

grps_trtllm

The grps-trtllm repository is a C++ implementation of a high-performance OpenAI LLM service, combining GRPS and TensorRT-LLM. It supports functionalities like Chat, Ai-agent, and Multi-modal. The repository offers advantages over triton-trtllm, including a complete LLM service implemented in pure C++, integrated tokenizer supporting huggingface and sentencepiece, custom HTTP functionality for OpenAI interface, support for different LLM prompt styles and result parsing styles, integration with tensorrt backend and opencv library for multi-modal LLM, and stable performance improvement compared to triton-trtllm.

github

: 122

speechless

Speechless.AI is committed to integrating the superior language processing and deep reasoning capabilities of large language models into practical business applications. By enhancing the model's language understanding, knowledge accumulation, and text creation abilities, and introducing long-term memory, external tool integration, and local deployment, our aim is to establish an intelligent collaborative partner that can independently interact, continuously evolve, and closely align with various business scenarios.

github

: 100

DeepAI

DeepAI is a proxy server that enhances the interaction experience of large language models (LLMs) by integrating the 'thinking chain' process. It acts as an intermediary layer, receiving standard OpenAI API compatible requests, using independent 'thinking services' to generate reasoning processes, and then forwarding the enhanced requests to the LLM backend of your choice. This ensures that responses are not only generated by the LLM but also based on pre-inference analysis, resulting in more insightful and coherent answers. DeepAI supports seamless integration with applications designed for the OpenAI API, providing endpoints for '/v1/chat/completions' and '/v1/models', making it easy to integrate into existing applications. It offers features such as reasoning chain enhancement, flexible backend support, API key routing, weighted random selection, proxy support, comprehensive logging, and graceful shutdown.

github

: 121

Step-DPO

Step-DPO is a method for enhancing long-chain reasoning ability of LLMs with a data construction pipeline creating a high-quality dataset. It significantly improves performance on math and GSM8K tasks with minimal data and training steps. The tool fine-tunes pre-trained models like Qwen2-7B-Instruct with Step-DPO, achieving superior results compared to other models. It provides scripts for training, evaluation, and deployment, along with examples and acknowledgements.

github

: 155

chatgpt-mirai-qq-bot

Kirara AI is a chatbot that supports mainstream language models and chat platforms. It features various functionalities such as image sending, keyword-triggered replies, multi-account support, content moderation, personality settings, and support for platforms like QQ, Telegram, Discord, and WeChat. It also offers HTTP server capabilities, plugin support, conditional triggers, admin commands, drawing models, voice replies, multi-turn conversations, cross-platform message sending, and custom workflows. The tool can be accessed via HTTP API for integration with other platforms.

github

: 14.4k

ChuanhuChatGPT

Chuanhu Chat is a user-friendly web graphical interface that provides various additional features for ChatGPT and other language models. It supports GPT-4, file-based question answering, local deployment of language models, online search, agent assistant, and fine-tuning. The tool offers a range of functionalities including auto-solving questions, online searching with network support, knowledge base for quick reading, local deployment of language models, GPT 3.5 fine-tuning, and custom model integration. It also features system prompts for effective role-playing, basic conversation capabilities with options to regenerate or delete dialogues, conversation history management with auto-saving and search functionalities, and a visually appealing user experience with themes, dark mode, LaTeX rendering, and PWA application support.

github

: 15.2k

Muice-Chatbot

Muice-Chatbot is an AI chatbot designed to proactively engage in conversations with users. It is based on the ChatGLM2-6B and Qwen-7B models, with a training dataset of 1.8K+ dialogues. The chatbot has a speaking style similar to a 2D girl, being somewhat tsundere but willing to share daily life details and greet users differently every day. It provides various functionalities, including initiating chats and offering 5 available commands. The project supports model loading through different methods and provides onebot service support for QQ users. Users can interact with the chatbot by running the main.py file in the project directory.

github

: 314

Video-ChatGPT

github

: 1.3k

tiny-llm-zh

Tiny LLM zh is a project aimed at building a small-parameter Chinese language large model for quick entry into learning large model-related knowledge. The project implements a two-stage training process for large models and subsequent human alignment, including tokenization, pre-training, instruction fine-tuning, human alignment, evaluation, and deployment. It is deployed on ModeScope Tiny LLM website and features open access to all data and code, including pre-training data and tokenizer. The project trains a tokenizer using 10GB of Chinese encyclopedia text to build a Tiny LLM vocabulary. It supports training with Transformers deepspeed, multiple machine and card support, and Zero optimization techniques. The project has three main branches: llama2_torch, main tiny_llm, and tiny_llm_moe, each with specific modifications and features.

github

: 147

Awesome-ChatTTS

Awesome-ChatTTS is an official recommended guide for ChatTTS beginners, compiling common questions and related resources. It provides a comprehensive overview of the project, including official introduction, quick experience options, popular branches, parameter explanations, voice seed details, installation guides, FAQs, and error troubleshooting. The repository also includes video tutorials, discussion community links, and project trends analysis. Users can explore various branches for different functionalities and enhancements related to ChatTTS.

github

: 594

agentica

Agentica is a human-centric framework for building large language model agents. It provides functionalities for planning, memory management, tool usage, and supports features like reflection, planning and execution, RAG, multi-agent, multi-role, and workflow. The tool allows users to quickly code and orchestrate agents, customize prompts, and make API calls to various services. It supports API calls to OpenAI, Azure, Deepseek, Moonshot, Claude, Ollama, and Together. Agentica aims to simplify the process of building AI agents by providing a user-friendly interface and a range of functionalities for agent development.

github

: 108

Chinese-Mixtral-8x7B

Chinese-Mixtral-8x7B is an open-source project based on Mistral's Mixtral-8x7B model for incremental pre-training of Chinese vocabulary, aiming to advance research on MoE models in the Chinese natural language processing community. The expanded vocabulary significantly improves the model's encoding and decoding efficiency for Chinese, and the model is pre-trained incrementally on a large-scale open-source corpus, enabling it with powerful Chinese generation and comprehension capabilities. The project includes a large model with expanded Chinese vocabulary and incremental pre-training code.

github

: 635

For similar tasks

SakuraLLM

github

: 2.7k

OpenVoiceChat

OpenVoiceChat is an open-source tool designed for having natural voice conversations with an LLM model. It supports various speech-to-text (STT), text-to-speech (TTS), and large language model (LLM) models. The tool aims to provide an alternative to closed commercial implementations, with well-abstracted APIs that are easy to use and extend. Users can install base and functionality-specific packages using pip, and the tool supports interruptions during conversations. The project encourages contributions through bounties and has a detailed roadmap available for reference.

github

: 196

glossAPI

The glossAPI project aims to develop a Greek language model as open-source software, with code licensed under EUPL and data under Creative Commons BY-SA. The project focuses on collecting and evaluating open text sources in Greek, with efforts to prioritize and gather textual data sets. The project encourages contributions through the CONTRIBUTING.md file and provides resources in the wiki for viewing and modifying recorded sources. It also welcomes ideas and corrections through issue submissions. The project emphasizes the importance of open standards, ethically secured data, privacy protection, and addressing digital divides in the context of artificial intelligence and advanced language technologies.

github

: 101

llama.cpp

llama.cpp is a C++ implementation of LLaMA, a large language model from Meta. It provides a command-line interface for inference and can be used for a variety of tasks, including text generation, translation, and question answering. llama.cpp is highly optimized for performance and can be run on a variety of hardware, including CPUs, GPUs, and TPUs.

github

: 72.0k

llm2vec

LLM2Vec is a simple recipe to convert decoder-only LLMs into text encoders. It consists of 3 simple steps: 1) enabling bidirectional attention, 2) training with masked next token prediction, and 3) unsupervised contrastive learning. The model can be further fine-tuned to achieve state-of-the-art performance.

github

: 1.2k

manga-image-translator

Translate texts in manga/images. Some manga/images will never be translated, therefore this project is born. * Image/Manga Translator * Samples * Online Demo * Disclaimer * Installation * Pip/venv * Poetry * Additional instructions for **Windows** * Docker * Hosting the web server * Using as CLI * Setting Translation Secrets * Using with Nvidia GPU * Building locally * Usage * Batch mode (default) * Demo mode * Web Mode * Api Mode * Related Projects * Docs * Recommended Modules * Tips to improve translation quality * Options * Language Code Reference * Translators Reference * GPT Config Reference * Using Gimp for rendering * Api Documentation * Synchronous mode * Asynchronous mode * Manual translation * Next steps * Support Us * Thanks To All Our Contributors :

github

: 5.6k

curated-transformers

Curated Transformers is a transformer library for PyTorch that provides state-of-the-art models composed of reusable components. It supports various transformer architectures, including encoders like ALBERT, BERT, and RoBERTa, and decoders like Falcon, Llama, and MPT. The library emphasizes consistent type annotations, minimal dependencies, and ease of use for education and research. It has been production-tested by Explosion and will be the default transformer implementation in spaCy 3.7.

github

: 833

MateCat

Matecat is an enterprise-level, web-based CAT tool designed to make post-editing and outsourcing easy and to provide a complete set of features to manage and monitor translation projects.

github

: 427

For similar jobs

weave

Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

github

: 855

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

VisionCraft

The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

github

: 94

kaito

Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

github

: 405

PyRIT

PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

github

: 2.3k

tabby

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

github

: 30.6k

spear

SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

github

: 224

Magick

Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.

github

: 675

SakuraLLM

README:

SakuraLLM

目前Sakura发布的所有模型均采用CC BY-NC-SA 4.0协议，Sakura所有模型与其衍生模型均禁止任何形式的商用！Sakura系列所有模型皆仅供学习交流使用，开发者对使用Sakura模型造成的问题不负任何责任。

介绍

TODO：见https://github.com/SakuraLLM/Sakura-13B-Galgame/issues/42

快速开始

教程：

模型下载：

News

已经接入模型的工具

显卡需求与显存需求

模型详情

描述

效果

推理

微调

Legacy Models

相关项目

致谢

Copyright Notice

For Tasks:

For Jobs:

Alternative AI tools for SakuraLLM

Similar Open Source Tools

SakuraLLM

kirara-ai

app-builder

build_MiniLLM_from_scratch

grps_trtllm

speechless

DeepAI

Step-DPO

chatgpt-mirai-qq-bot

ChuanhuChatGPT

Muice-Chatbot

Video-ChatGPT

tiny-llm-zh

Awesome-ChatTTS

agentica

Chinese-Mixtral-8x7B

For similar tasks

SakuraLLM

OpenVoiceChat

glossAPI

llama.cpp

llm2vec

manga-image-translator

curated-transformers

MateCat

For similar jobs

weave

LLMStack

VisionCraft

kaito

PyRIT

tabby

spear

Magick