awesome-ai-painting
AI绘画资料合集(包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等) Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo
Stars: 11046
This repository, named 'awesome-ai-painting', is a comprehensive collection of resources related to AI painting. It is curated by a user named 秋风, who is an AI painting enthusiast with a background in the AIGC industry. The repository aims to help more people learn AI painting and also documents the user's goal of creating 100 AI products, with current progress at 4/100. The repository includes information on various AI painting products, tutorials, tools, and models, providing a valuable resource for individuals interested in AI painting and related technologies.
README:
我是秋风,是一名AI绘画爱好者,从22年中旬就开始接触AIGC行业,早期主要是AI绘画布道为主,目前主要分享AI知识和做AI产品。这个仓库是见证了我使用 AI 绘画的过程,它旨在帮助更多的人学会 AI 画画。并且也记录我励志打造100个AI产品的目标, 目前进度 4/100, 你可以在 twitter 关注我, 了解我的最新动态。
我的产品列表: MewXAI绘画 | 星月熊 | 艺映AI | 图片放大增强GoEnhance AI | 视频转视频Video2Video | GPTs-SEO优化 | stablediffusion3
ChatTTS是专门为对话场景设计的文本转语音模型,例如LLM助手对话任务。它支持英文和中文两种语言。最大的模型使用了10万小时以上的中英文数据进行训练。
在人工智能驱动的创意领域中,一颗新星冉冉升起:Flux.1 AI图像生成器。由Black Forest Labs开发的Flux.1正在彻底改变我们思考和创造视觉内容的方式。这款尖端的文本到图像合成模型正在图像生成领域树立新的标杆,提供无与伦比的质量、速度和多样性。
Flux.1不仅仅是另一个AI图像生成器;它是一个游戏规则改变者,正在挑战Midjourney和DALL·E等老牌玩家。凭借其从文本描述创建令人惊叹的高分辨率图像的能力,Flux.1正为全球艺术家、设计师和内容创作者开启新的可能性。
1.安装最新版本的 Comyfui ...
地址: https://www.youtube.com/watch?v=RDH5lyurock
地址: https://www.youtube.com/watch?v=Jh0kJl7duXM
Comfyui 工作流:
1.SVD + 插帧 (from https://twitter.com/PurzBeats)
最近研究了一下 AnimateDiff, 对此用户进行了总结,从我整理的资料上来看,大体上使用的高阶应用分为三个种类:
- cli (https://github.com/s9roll7/animatediff-cli-prompt-travel)
- comfyui (https://github.com/Kosinkadink/ComfyUI-AnimateDiff-Evolved)
- webui (https://github.com/continue-revolution/sd-webui-animatediff)
以上工具的容易上手程度 webui > comfyui > cli , 他们之前不存在谁能代替谁,我的理解只是使用的人机交互界面不同,所有方式都能实现一致的效果。不过目前看起来 webui 插件目前还带有部分模型灰图的情况,但是生态来说 webui 更加强大。
具体的对比查看以及工作流 AnimateDiff教程
23.11.10-产品-AI视频-艺映AI
产品地址: 艺映AI
23.7.22-产品-AI二维码-星月熊
产品地址: MewXAI星月熊
22.11.12-产品-AI绘画-MewXAI
21.7.13-新产品-木及简历
让 Stable Diffusion 提高图片质量的新方案 —— FreeU
概览
立即体验:https://qr.mewx.art(稍后星月熊小程序也会同步上线)
最近在某书、某音里,你肯定刷到这种超火的光影文字艺术作品,一发出去立刻能破万点赞。这种效果极具创意,将光影文字完美巧妙的融合进 AI 绘画里,收获了无数的喜欢,许多人甚至高价定制。
概览:
都说AI绘画来势汹汹,但论创意,还是人类玩得花。
不信来看看这张乍一看平平无奇,却在网上疯传的AI生成美女图片:
AI这样把NB写在脸上,它在玩一种很新的艺术
概览:
概览:
1.泛类AI绘画产品
Name | 价格 | URL |
---|---|---|
文心-一格 | 暂时免费 | https://yige.baidu.com/#/ |
6pen | 部分免费 | https://6pen.art/ |
MewxAI人工智能 | 免费 | 微信小程序 / https://mewx.art |
大画家Domo | - | https://www.domo.cool/ |
盗梦师 | 有免费次数 + 付费 | 微信小程序搜盗梦师 |
画几个画 | - | 微信小程序搜画几个画 |
Niko绘图 | 免费 + 看广告 | 微信小程序搜Niko绘图 |
飞链云AI绘画版图 | 免费 | https://ai.feilianyun.cn/ |
Freehand意绘 | 免费 | https://freehand.yunwooo.com/ |
即时AI | 免费 | https://js.design/pluginDetail?id=6322a4ab0eededcff6ba451a |
意见AI绘画 | 有免费次数 + 付费 | 微信小程序搜意见AI绘画 |
PAI | 免费 | https://artpai.xyz/ |
爱作画 | 有免费次数 + 付费 | https://aizuohua.com/ |
皮卡智能AI | 免费 | https://www.picup.shop/text2image.html#/ |
云景AI绘图 | 免费 | https://yunjing.gallery |
100prompt | 免费 | http://100prompt.com |
C站模型直接使用:TryYourAI | 部分免费 | https://tryyourai.com |
创作+赚钱:WaterWheel | 有免费次数 + 付费 | https://waterwheel.network |
2.垂类绘画产品
Name | 价格 | URL | 使用场景 |
---|---|---|---|
妙鸭相机 | 有免费/有付费 | 小程序搜妙鸭相机 | AI写真 |
星月熊 | 有免费/有付费 | https://qr.mewx.art | AI二维码 |
WeShop | 有免费/有付费 | https://weshop.com/ | AI模特 |
彩鱼相机 | 有免费/有付费 | https://pixpi.art/ | AI形象 |
SDXL
https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0
SD 2.1
https://huggingface.co/stabilityai/stable-diffusion-2-1-base
SD 1.4
https://huggingface.co/CompVis/stable-diffusion-v-1-4-original/resolve/main/sd-v1-4.ckpt
SD 1.5
https://huggingface.co/runwayml/stable-diffusion-v1-5/resolve/main/v1-5-pruned-emaonly.ckpt
novelAI
https://huggingface.co/acheong08/secretAI/resolve/main/stableckpt/animefull-final-pruned/model.ckpt
用Colab免费部署自己的AI绘画云平台—— Stable Diffusion
最简单全面本地运行Colab及Disco Diffusion教程
堪比艺术家!被疯狂安利的 AI 插画神器 Disco Diffusion 有多强?
厂商 | 地址 | 价格 |
---|---|---|
autodl | https://www.autodl.com | 大约1元左右/h,根据不同显卡定价 |
智星云 | http://gpu.ai-galaxy.cn/ | 大约1元左右/h,根据不同显卡定价 |
恒源云 | https://gpushare.com/ | 大约1元左右/h,根据不同显卡定价 |
腾讯云 | https://cloud.tencent.com/act/pro/gpu-study | 最低 60元/0.5个月 |
仙宫云 | https://www.xiangongyun.com/ | 大约1元左右/h,根据不同显卡定价 |
阿里云 | https://www.aliyun.com/activity/bigdata/pai/studio | 免费 A10/T4/G6 1个月 |
显卡速度
来源: https://www.tomshardware.com/pc-components/gpus/stable-diffusion-benchmarks
显卡性价比跑分图
时代变了,大人:RTX 3090时代,哪款显卡配得上我的炼丹炉?
人人都能用的「AI 作画」,如何把 Stable Diffusion 装进电脑?
https://github.com/fboulnois/stable-diffusion-docker
https://github.com/AbdBarho/stable-diffusion-webui-docker
https://github.com/divamgupta/diffusionbee-stable-diffusion-ui
Novel AI 元素魔法全收录 https://docs.qq.com/doc/DWHl3am5Zb05QbGVs
元素法典——Novel AI 元素魔法全收录: https://docs.qq.com/doc/DWHl3am5Zb05QbGVs
NovelAI 法术解析: https://spell.novelai.dev/
https://www.krea.ai/?continueFlag=6591d07b3186f4c7e58de1a4bcfaefb0
https://promptomania.com/stable-diffusion-prompt-builder/
https://prompt.noonshot.com/midjourney
https://huggingface.co/spaces/doevent/prompt-generator
https://midjourney-prompt-helper.netlify.app/
https://promptsalsa.com/midjourney-prompt-generator/
Deep Danbooru: http://dev.kanotype.net:8003/deepdanbooru/
微信群:已满群可加 qiufengblue
QQ群: 713773093
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for awesome-ai-painting
Similar Open Source Tools
awesome-ai-painting
This repository, named 'awesome-ai-painting', is a comprehensive collection of resources related to AI painting. It is curated by a user named 秋风, who is an AI painting enthusiast with a background in the AIGC industry. The repository aims to help more people learn AI painting and also documents the user's goal of creating 100 AI products, with current progress at 4/100. The repository includes information on various AI painting products, tutorials, tools, and models, providing a valuable resource for individuals interested in AI painting and related technologies.
Firefly
Firefly is an open-source large model training project that supports pre-training, fine-tuning, and DPO of mainstream large models. It includes models like Llama3, Gemma, Qwen1.5, MiniCPM, Llama, InternLM, Baichuan, ChatGLM, Yi, Deepseek, Qwen, Orion, Ziya, Xverse, Mistral, Mixtral-8x7B, Zephyr, Vicuna, Bloom, etc. The project supports full-parameter training, LoRA, QLoRA efficient training, and various tasks such as pre-training, SFT, and DPO. Suitable for users with limited training resources, QLoRA is recommended for fine-tuning instructions. The project has achieved good results on the Open LLM Leaderboard with QLoRA training process validation. The latest version has significant updates and adaptations for different chat model templates.
Llama-Chinese
Llama中文社区是一个专注于Llama模型在中文方面的优化和上层建设的高级技术社区。 **已经基于大规模中文数据,从预训练开始对Llama2模型进行中文能力的持续迭代升级【Done】**。**正在对Llama3模型进行中文能力的持续迭代升级【Doing】** 我们热忱欢迎对大模型LLM充满热情的开发者和研究者加入我们的行列。
TigerBot
TigerBot is a cutting-edge foundation for your very own LLM, providing a world-class large model for innovative Chinese-style contributions. It offers various upgrades and features, such as search mode enhancements, support for large context lengths, and the ability to play text-based games. TigerBot is suitable for prompt-based game engine development, interactive game design, and real-time feedback for playable games.
EmoLLM
EmoLLM is a series of large-scale psychological health counseling models that can support **understanding-supporting-helping users** in the psychological health counseling chain, which is fine-tuned from `LLM` instructions. Welcome everyone to star~⭐⭐. The currently open source `LLM` fine-tuning configurations are as follows:
gpt_server
The GPT Server project leverages the basic capabilities of FastChat to provide the capabilities of an openai server. It perfectly adapts more models, optimizes models with poor compatibility in FastChat, and supports loading vllm, LMDeploy, and hf in various ways. It also supports all sentence_transformers compatible semantic vector models, including Chat templates with function roles, Function Calling (Tools) capability, and multi-modal large models. The project aims to reduce the difficulty of model adaptation and project usage, making it easier to deploy the latest models with minimal code changes.
LLMs
LLMs is a Chinese large language model technology stack for practical use. It includes high-availability pre-training, SFT, and DPO preference alignment code framework. The repository covers pre-training data cleaning, high-concurrency framework, SFT dataset cleaning, data quality improvement, and security alignment work for Chinese large language models. It also provides open-source SFT dataset construction, pre-training from scratch, and various tools and frameworks for data cleaning, quality optimization, and task alignment.
video-subtitle-remover
Video-subtitle-remover (VSR) is a software based on AI technology that removes hard subtitles from videos. It achieves the following functions: - Lossless resolution: Remove hard subtitles from videos, generate files with subtitles removed - Fill the region of removed subtitles using a powerful AI algorithm model (non-adjacent pixel filling and mosaic removal) - Support custom subtitle positions, only remove subtitles in defined positions (input position) - Support automatic removal of all text in the entire video (no input position required) - Support batch removal of watermark text from multiple images.
fastapi
智元 Fast API is a one-stop API management system that unifies various LLM APIs in terms of format, standards, and management, achieving the ultimate in functionality, performance, and user experience. It supports various models from companies like OpenAI, Azure, Baidu, Keda Xunfei, Alibaba Cloud, Zhifu AI, Google, DeepSeek, 360 Brain, and Midjourney. The project provides user and admin portals for preview, supports cluster deployment, multi-site deployment, and cross-zone deployment. It also offers Docker deployment, a public API site for registration, and screenshots of the admin and user portals. The API interface is similar to OpenAI's interface, and the project is open source with repositories for API, web, admin, and SDK on GitHub and Gitee.
widgets
Widgets is a desktop component front-end open source component. The project is still being continuously improved. The desktop component client can be downloaded and run in two ways: 1. https://www.microsoft.com/store/productId/9NPR50GQ7T53 2. https://widgetjs.cn After cloning the code, you need to download the dependency in the project directory: `shell pnpm install` and run: `shell pnpm serve`
PaddleNLP
PaddleNLP is an easy-to-use and high-performance NLP library. It aggregates high-quality pre-trained models in the industry and provides out-of-the-box development experience, covering a model library for multiple NLP scenarios with industry practice examples to meet developers' flexible customization needs.
MedicalGPT
MedicalGPT is a training medical GPT model with ChatGPT training pipeline, implement of Pretraining, Supervised Finetuning, RLHF(Reward Modeling and Reinforcement Learning) and DPO(Direct Preference Optimization).
Awesome-Interpretability-in-Large-Language-Models
This repository is a collection of resources focused on interpretability in large language models (LLMs). It aims to help beginners get started in the area and keep researchers updated on the latest progress. It includes libraries, blogs, tutorials, forums, tools, programs, papers, and more related to interpretability in LLMs.
LLM4SE
The collection is actively updated with the help of an internal literature search engine.
Chinese-Mixtral-8x7B
Chinese-Mixtral-8x7B is an open-source project based on Mistral's Mixtral-8x7B model for incremental pre-training of Chinese vocabulary, aiming to advance research on MoE models in the Chinese natural language processing community. The expanded vocabulary significantly improves the model's encoding and decoding efficiency for Chinese, and the model is pre-trained incrementally on a large-scale open-source corpus, enabling it with powerful Chinese generation and comprehension capabilities. The project includes a large model with expanded Chinese vocabulary and incremental pre-training code.
pmhub
PmHub is a smart project management system based on SpringCloud, SpringCloud Alibaba, and LLM. It aims to help students quickly grasp the architecture design and development process of microservices/distributed projects. PmHub provides a platform for students to experience the transformation from monolithic to microservices architecture, understand the pros and cons of both architectures, and prepare for job interviews. It offers popular technologies like SpringCloud-Gateway, Nacos, Sentinel, and provides high-quality code, continuous integration, product design documents, and an enterprise workflow system. PmHub is suitable for beginners and advanced learners who want to master core knowledge of microservices/distributed projects.
For similar tasks
daily-poetry-image
Daily Chinese ancient poetry and AI-generated images powered by Bing DALL-E-3. GitHub Action triggers the process automatically. Poetry is provided by Today's Poem API. The website is built with Astro.
civitai
Civitai is a platform where people can share their stable diffusion models (textual inversions, hypernetworks, aesthetic gradients, VAEs, and any other crazy stuff people do to customize their AI generations), collaborate with others to improve them, and learn from each other's work. The platform allows users to create an account, upload their models, and browse models that have been shared by others. Users can also leave comments and feedback on each other's models to facilitate collaboration and knowledge sharing.
geekai
GeekAI is an open-source AI assistant solution based on AI large language model API, featuring a complete system with ready-to-use front-end and back-end management, providing a seamless typing experience via Websocket. It integrates various pre-trained character applications like Xiaohongshu writing assistant, English translation master, Socrates, Confucius, Steve Jobs, and weekly report assistant. The tool supports multiple large language models from platforms like OpenAI, Azure, Wenxin Yanyan, Xunfei Xinghuo, and Tsinghua ChatGLM. Additionally, it includes MidJourney and Stable Diffusion AI drawing functionalities for creating various artworks such as text-based images, face swapping, and blending images. Users can utilize personal WeChat QR codes for payment without the need for enterprise payment channels, and the tool offers integrated payment options like Alipay and WeChat Pay with support for multiple membership packages and point card purchases. It also features a plugin API for developing powerful plugins using large language model functions, including built-in plugins for Weibo hot search, today's headlines, morning news, and AI drawing functions.
awesome-ai-painting
This repository, named 'awesome-ai-painting', is a comprehensive collection of resources related to AI painting. It is curated by a user named 秋风, who is an AI painting enthusiast with a background in the AIGC industry. The repository aims to help more people learn AI painting and also documents the user's goal of creating 100 AI products, with current progress at 4/100. The repository includes information on various AI painting products, tutorials, tools, and models, providing a valuable resource for individuals interested in AI painting and related technologies.
StableSwarmUI
StableSwarmUI is a modular Stable Diffusion web user interface that emphasizes making power tools easily accessible, high performance, and extensible. It is designed to be a one-stop-shop for all things Stable Diffusion, providing a wide range of features and capabilities to enhance the user experience.
upscayl
Upscayl is a free and open-source AI image upscaler that uses advanced AI algorithms to enlarge and enhance low-resolution images without losing quality. It is a cross-platform application built with the Linux-first philosophy, available on all major desktop operating systems. Upscayl utilizes Real-ESRGAN and Vulkan architecture for image enhancement, and its backend is fully open-source under the AGPLv3 license. It is important to note that a Vulkan compatible GPU is required for Upscayl to function effectively.
ailia-models
The collection of pre-trained, state-of-the-art AI models. ailia SDK is a self-contained, cross-platform, high-speed inference SDK for AI. The ailia SDK provides a consistent C++ API across Windows, Mac, Linux, iOS, Android, Jetson, and Raspberry Pi platforms. It also supports Unity (C#), Python, Rust, Flutter(Dart) and JNI for efficient AI implementation. The ailia SDK makes extensive use of the GPU through Vulkan and Metal to enable accelerated computing. # Supported models 323 models as of April 8th, 2024
models
This repository contains self-trained single image super resolution (SISR) models. The models are trained on various datasets and use different network architectures. They can be used to upscale images by 2x, 4x, or 8x, and can handle various types of degradation, such as JPEG compression, noise, and blur. The models are provided as safetensors files, which can be loaded into a variety of deep learning frameworks, such as PyTorch and TensorFlow. The repository also includes a number of resources, such as examples, results, and a website where you can compare the outputs of different models.
For similar jobs
sweep
Sweep is an AI junior developer that turns bugs and feature requests into code changes. It automatically handles developer experience improvements like adding type hints and improving test coverage.
teams-ai
The Teams AI Library is a software development kit (SDK) that helps developers create bots that can interact with Teams and Microsoft 365 applications. It is built on top of the Bot Framework SDK and simplifies the process of developing bots that interact with Teams' artificial intelligence capabilities. The SDK is available for JavaScript/TypeScript, .NET, and Python.
ai-guide
This guide is dedicated to Large Language Models (LLMs) that you can run on your home computer. It assumes your PC is a lower-end, non-gaming setup.
classifai
Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence. Tap into leading cloud-based services like OpenAI, Microsoft Azure AI, Google Gemini and IBM Watson to augment your WordPress-powered websites. Publish content faster while improving SEO performance and increasing audience engagement. ClassifAI integrates Artificial Intelligence and Machine Learning technologies to lighten your workload and eliminate tedious tasks, giving you more time to create original content that matters.
chatbot-ui
Chatbot UI is an open-source AI chat app that allows users to create and deploy their own AI chatbots. It is easy to use and can be customized to fit any need. Chatbot UI is perfect for businesses, developers, and anyone who wants to create a chatbot.
BricksLLM
BricksLLM is a cloud native AI gateway written in Go. Currently, it provides native support for OpenAI, Anthropic, Azure OpenAI and vLLM. BricksLLM aims to provide enterprise level infrastructure that can power any LLM production use cases. Here are some use cases for BricksLLM: * Set LLM usage limits for users on different pricing tiers * Track LLM usage on a per user and per organization basis * Block or redact requests containing PIIs * Improve LLM reliability with failovers, retries and caching * Distribute API keys with rate limits and cost limits for internal development/production use cases * Distribute API keys with rate limits and cost limits for students
uAgents
uAgents is a Python library developed by Fetch.ai that allows for the creation of autonomous AI agents. These agents can perform various tasks on a schedule or take action on various events. uAgents are easy to create and manage, and they are connected to a fast-growing network of other uAgents. They are also secure, with cryptographically secured messages and wallets.
griptape
Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.