sora-prompt-zh

Sora 中文的提示词 | 短视频提示词（prompt）技巧 | 调教指南。各种场景使用指南。学习怎么让它听你的话。兼顾了 Sora 的多场景应用。

Stars: 121

Visit

Sora-prompt-zh is a repository providing guidance on using Sora in various scenarios, learning how to make it understand your commands, and exploring Sora's multiple applications. It offers AI models that can create realistic and imaginative scenes from OpenAI's text instructions. The repository includes prompts for generating videos, animations, video editing, image generation, and more. Users can find examples and generated videos based on different video styles and modify them as needed. Although Sora is not officially released yet, the repository aims to collect prompts to help users quickly start using Sora to generate desired videos.

README:

sora-prompt-zh

English • 中文

Sora 中文的提示词 | 调教指南。各种场景使用指南。学习怎么让它听你的话。兼顾了 Sora 的多场景应用。

Sora | 索拉是一个AI模型，可以从OpenAI的文本指令中创建逼真和富有想象力的场景。OpenAI正在教AI理解和模拟运动中的物理世界，目标是训练模型，帮助人们解决需要现实世界交互的问题。

如果你是 sora 的学习者，希望获取到 sora 的最新的咨询和相关的开发项目，以及 sora 相关的开源项目，这里 awesome-sora 提供了 sora 相关的Sora 中文指南，指令指南，应用开发指南，精选资源清单，Sora 开发者精选工具框架。

索拉可提供以下功能：

文本到视频
动画
扩展生成的视频
视频到视频编辑
连接视频
图像生成（文本到图像）

在这个存储库中，你会发现各种可以和索拉一起使用的提示。我们根据视频的风格分配了不同的标签，让你可以根据标签快速找到提示示例（Prompt）和生成的视频，并根据需要进行修改。

虽然索拉尚未正式发布，但我们正在全面收集提示，以帮助你快速开始使用索拉生成您想要的视频。

提示词

官方提示词生成器

视频生成提示

官方视频生成提示

点击查看更多示例

一位时尚女性穿着一件黑色皮夹克，一条长长的红色裙子和黑色靴子，手拿一个黑色的手提包，在热闹的东京街道上行走。周围充满了温暖的霓虹灯和动态的城市标识。她戴着太阳镜和红色口红，自信而随意地行走。街道潮湿而反光，形成了五彩灯光的镜面效果。许多行人在周围走动。

生成视频链接

几只巨大的长毛猛犸象漫步在积雪覆盖的草地上，它们的长毛在微风中轻轻飘动，远处是积雪覆盖的树木和戏剧性的雪山，午后的光线和稀薄的云彩以及高高悬挂的太阳形成了温暖的光芒。低角度的摄像视角令人惊叹，捕捉到了这些大型毛茸茸的哺乳动物和美丽的摄影，景深感非常强烈。

生成视频链接

一个电影预告片，讲述了一位30岁的太空人的冒险故事，他戴着一顶红色的羊毛编织头盔，蓝天，盐沙漠，电影风格，35mm胶片拍摄，色彩生动。

生成视频链接

无人机俯视着波涛汹涌的大苏尔加雷角海滩的崎岖悬崖。蓝色的海水拍打着，形成了白色的波浪，而夕阳的金光照亮了岩石海岸。远处有一座灯塔的小岛，悬崖边覆盖着绿色的灌木。从道路到海滩的陡峭下滑是一个戏剧性的壮举，悬崖边突出在海面上。这是一个捕捉到海岸的原始美和太平洋海岸公路崎岖风景的景色。

生成视频链接

动画场景展示了一个近距离的短毛怪兽跪在一个正在融化的红色蜡烛旁边。艺术风格是3D和逼真的，重点放在光线和纹理上。画面的情绪是惊奇和好奇，怪兽睁着大眼睛，张着大嘴盯着火焰看。它的姿势和表情传达出一种天真和俏皮的感觉，好像它是第一次探索周围的世界一样。温暖色调和戏剧性的光线进一步增强了图像的舒适氛围。

生成视频链接

一个华丽的纸艺世界，一个丰富多彩的珊瑚礁，到处都是色彩缤纷的鱼类和海洋生物。

生成视频链接

这个特写镜头展示了维多利亚皇冠鸽子引人注目的蓝色羽毛和红色胸膛。它的羽冠由精致的蕾丝羽毛制成，而它的眼睛是醒目的红色。鸟的头微微倾斜，给人一种威严和威严的印象。背景模糊，突出了鸟的引人注目的外观。

生成视频链接

两艘海盗船激战的写实特写视频，它们在一杯咖啡中航行。

生成视频链接

一位20岁左右的年轻男子坐在天空中的一块云朵上，读着一本书。

生成视频链接

加利福尼亚淘金热的历史影像。

生成视频链接

一个玻璃球的特写视角，里面有一个有竹林的禅园，一个小矮人正在禅园里耙平沙子并在沙子上创造图案。

生成视频链接

在魔幻的黄昏中，一个24岁女子的眼睛在眨眼，站在马拉喀什，70毫米胶片拍摄的电影，景深，鲜艳的色彩，电影感觉的摄影。

生成视频链接

一只卡通袋鼠在迪斯科舞动。

生成视频链接

一个美丽的自制视频，展示了2056年尼日利亚拉各斯的人们。使用手机摄像头拍摄。

生成视频链接

一个培养着许多彩色鱼类和海洋生物的珊瑚礁的华丽渲染的纸艺世界。

生成视频链接

摄像机围绕着一堆大量显示不同节目的老式电视，1950年代的科幻电影，恐怖电影，新闻，静态画面，1970年代的情景喜剧等，设置在纽约一家大型博物馆画廊内。

生成视频链接

3D动画中，一个小，圆，毛茸茸的生物，有着大大的，有表情的眼睛，探索着一个充满生机的，神奇的魔法森林。这个生物是兔子和松鼠的奇妙融合，有着柔软的蓝色皮毛和一条松软的，带条纹的尾巴。它在闪闪发光的小溪旁跳跃，眼睛里充满了惊奇。森林中充满了魔法元素：发光并变色的花朵，紫色和银色树叶的树木，以及看起来像萤火虫的小飘浮灯光。生物停下来和一群像仙子一样的小精灵玩耍，围绕着一个蘑菇环舞动。生物惊叹地抬头望着一棵巨大的，发着光的树，它似乎是森林的心脏。

生成视频链接

摄像机跟随着一辆白色老式SUV，车顶有一个黑色行李架，它快速地驶过陡峭的山路，周围是松树，车轮的灰尘飞扬，阳光照在SUV上，照在山路上，给整个场景带来了温暖的光芒。土路缓缓弯曲，远处看不到其他汽车或车辆。路两旁的树是红杉树，零零散散地散布着绿色植被。车辆从后方视角看上去轻松地跟着弯道转弯，好像它在崎岖的地形中行驶一样。土路本身被陡峭的山丘和山脉所环绕，天空晴朗，白云飘荡。

生成视频链接

火车途经东京郊区的窗户反射。

生成视频链接

无人机摄影展示了一座建在阿马尔菲海岸岩石高地上的美丽历史教堂，视角展示了历史悠久且宏伟的建筑细节，以及分层的路径和露台，海平面下的海浪拍打在下方的岩石上，远眺海岸水域和意大利阿马尔菲海岸的丘陵风景，几个远处的人在走动，并在欣赏悬崖海景的露台上欣赏风景，午后的阳光营造出一种神奇和浪漫的氛围，摄影以美丽的摄影捕捉了这一场景。

生成视频链接

一个大号橙色章鱼躺在海底，与沙质和岩石的地形融为一体。它的触手围绕着身体，眼睛闭着。章鱼不知道一只螃蟹从岩石后爬向它，它的爪子举起准备进攻。螃蟹是棕色的，长满刺，有着长腿和触角。镜头采用广角拍摄，展示了海洋的广阔和深度。水是清澈的蓝色，阳光透过水面，形成光束。镜头清晰锐利，动态范围高。章鱼和螃蟹都处于焦点状态，而背景略微模糊，产生了眨眼，站在马拉喀什，70毫米胶片拍摄的电影，景深，鲜艳的色彩，电影感觉的摄影。

生成视频链接

一只卡通袋鼠在迪斯科舞动。

生成视频链接

一个美丽的自制视频，展示了2056年尼日利亚拉各斯的人们。使用手机摄像头拍摄。

生成视频链接

一个培养着许多彩色鱼类和海洋生物的珊瑚礁的华丽渲染的纸艺世界。

生成视频链接

摄像机围绕着一堆大量显示不同节目的老式电视，1950年代的科幻电影，恐怖电影，新闻，静态画面，1970年代的情景喜剧等，设置在纽约一家大型博物馆画廊内。

生成视频链接

3D动画中，一个小，圆，毛茸茸的生物，有着大大的，有表情的眼睛，探索着一个充满生机的，神奇的魔法森林。这个生物是兔子和松鼠的奇妙融合，有着柔软的蓝色皮毛和一条松软的，带条纹的尾巴。它在闪闪发光的小溪旁跳跃，眼睛里充满了惊奇。森林中充满了魔法元素：发光并变色的花朵，紫色和银色树叶的树木，以及看起来像萤火虫的小飘浮灯光。生物停下来和一群像仙子一样的小精灵玩耍，围绕着一个蘑菇环舞动。生物惊叹地抬头望着一棵巨大的，发着光的树，它似乎是森林的心脏。

生成视频链接

摄像机跟随着一辆白色老式SUV，车顶有一个黑色行李架，它快速地驶过陡峭的山路，周围是松树，车轮的灰尘飞扬，阳光照在SUV上，照在山路上，给整个场景带来了温暖的光芒。土路缓缓弯曲，远处看不到其他汽车或车辆。路两旁的树是红杉树，零零散散地散布着绿色植被。车辆从后方视角看上去轻松地跟着弯道转弯，好像它在崎岖的地形中行驶一样。土路本身被陡峭的山丘和山脉所环绕，天空晴朗，白云飘荡。

生成视频链接

火车途经东京郊区的窗户反射。

生成视频链接

无人机摄影展示了一座建在阿马尔菲海岸岩石高地上的美丽历史教堂，视角展示了历史悠久且宏伟的建筑细节，以及分层的路径和露台，海平面下的海浪拍打在下方的岩石上，远眺海岸水域和意大利阿马尔菲海岸的丘陵风景，几个远处的人在走动，并在欣赏悬崖海景的露台上欣赏风景，午后的阳光营造出一种神奇和浪漫的氛围，摄影以美丽的摄影捕捉了这一场景。

生成视频链接

一只巨大的橙色章鱼栖息在海底，与沙质和岩石地形融为一体。它的触手散布在身体周围，眼睛紧闭。章鱼没有意识到一只帝王蟹正从岩石后面爬向它，它的爪子举起并准备攻击。螃蟹呈棕色，多刺，有长腿和触角。该场景是从广角拍摄的，展现了海洋的浩瀚和深度。海水清澈碧蓝，阳光透过来。镜头锐利、清晰，具有高动态范围。章鱼和螃蟹清晰对焦，背景略微模糊，营造出景深效果。

官方 Twitter 上的提示词以及视频展现

点击查看更多示例

一只小熊猫和一只巨嘴鸟是最好的朋友，在圣托里尼的蓝色时刻散步。生成视频链接
一名水肺潜水员发现了一个隐藏的未来主义沉船，里面有赛博海洋生物和先进的外星科技。生成视频链接
特写镜头展示了一只雄伟的白色龙，拥有珍珠般的银边鳞片、冰蓝色的眼睛、优雅的象牙色角和雾气般的呼吸。着重展示了详细的面部特征和有纹理的鳞片，背景柔和模糊。生成视频链接
在一个精美渲染的纸艺世界中，一艘蒸汽船穿越辽阔的海洋，天空中有薄云。遥远的草坡在背景中若隐若现，纸艺海洋表面附近可见一些海洋生物。生成视频链接
一个人在夏威夷的热带水域进行BASE跳伞。他的宠物金刚鹦鹉在他身边飞行。生成视频链接
一个被黑暗霓虹灯光照亮的热带雨林，充满了奇幻的动植物和动物。生成视频链接
一只玻璃制成的乌龟，裂缝用金缮修复过，正走在日落时分的黑沙滩上。生成视频链接
一群萨摩耶小狗学习成为厨师的宣传片。生成视频链接
一群冒险的小狗在天空废墟中探险的宣传片。生成视频链接
夜间镜头，一个寄居蟹把白炽灯泡当做壳。生成视频链接
Minecraft使用最华丽的高清8K材质包。生成视频链接
这个近景镜头展示了一只未来主义赛博德国牧羊犬，展示了它引人注目的棕黑色毛发。生成视频链接
一个蚂蚁在蚁巢内部导航的POV镜头。生成视频链接
一片叶子的微距镜头，展示了微小的火车穿过它的叶脉。生成视频链接
一只白色和橙色虎斑巷猫在大雨中穿过后巷，寻找庇护所。生成视频链接
一只可以游泳的蝴蝶在美丽的珊瑚礁下水中航行的逼真视频。生成视频链接
一只巨大的鸭子走过波士顿的街道。生成视频链接
相机下降并放大，俯瞰着美丽的海洋和历史建筑，沿着悬崖上的壮丽海岸风景小镇。生成视频链接
一个由水构成的行走人形体参观一个艺术画廊，里面有许多不同风格的美丽作品。生成视频链接
一个绿色的斑点和一个橙色的斑点相爱并一起跳舞。生成视频链接
一个阴森的闹鬼大宅，友好的南瓜灯和鬼魂角色欢迎着来敲门的孩子们，倾斜移轴摄影。生成视频链接
一个巨大的教堂被猫填满。你看到的地方到处都是猫。一个男人走进教堂，在一只巨大的猫王宝座前鞠躬。生成视频链接
人们在海滩放松的逼真视频，然后一条鲨鱼从水中跳出，惊吓了所有人。生成视频链接

官方 TikTok 上面的提示词以及视频展示

点击查看更多示例

穿着雄伟王冠的小土豆国王，坐在王座上，监视着他们的土豆王国，充满了土豆臣民和土豆城堡。生成视频链接
一家装饰有室内植物的咖啡馆的微缩地图。木梁在上方交叉，一台冷冻咖啡站用小瓶子和玻璃杯装点着。生成视频链接
一个拼成“SORA”字样的逼真云图。生成视频链接
公园里的猴子下棋。生成视频链接
叶子的微距镜头，展示了微小的火车穿过它的叶脉。生成视频链接
一只黑色连帽卫衣的计算机黑客拉布拉多在计算机前面坐着，屏幕的反光照在狗脸上，它正在快速打字。生成视频链接
低角度摄影紧随丛林中的蚂蚁，进入地面，进入它们的世界。生成视频链接
比萨斜塔。生成视频链接
一个低质量、视觉上令人失望的超级碗广告。生成视频链接

如何做提示词

摄影技术/设备

35毫米电影胶片拍摄
70毫米电影胶片拍摄
使用手机相机拍摄

视觉风格

电影感
3D数字渲染艺术风格
宽幅镜头
黑白影调
老电影风格的颗粒感
日落金时光
星轨长曝光
街头纪实风格
HDR高动态范围
慢动作拍摄
时光流逝摄影
创意光绘
虚拟现实全景
微距摄影

拍摄技巧

景深
特写镜头
画面清晰锐利，具有浅景深
鲜艳的色彩

技术效果

稳定镜头：去除抖动，保持画面稳定。
色彩校正：调整视频的色温、饱和度、对比度等。
光线效果：模拟自然光、背光或特殊光源效果。
绿幕抠图：将特定颜色（通常是绿色或蓝色）的背景替换为其他画面。
视频转场：平滑或创意地过渡两个镜头之间的切换。
文字动画：文字的出现、消失或移动效果。
时间线编辑：对视频片段进行裁剪、拼接、速度调整等。

视觉风格

复古风格：模仿旧电影或某个时代的视觉效果。
动漫风格：将视频处理成类似动漫或手绘的艺术风格。
科幻风格：赋予视频未来主义或科幻电影的视觉特征。
梦幻效果：使用模糊、光晕等效果创造出梦幻般的视觉体验。
纪录片风格：模仿纪录片的摄影和剪辑手法。

情绪表达

欢快：通过明亮的色彩、快节奏的剪辑传达快乐情绪。
怀旧：使用温暖的色调和复古转场回忆过去。
紧张：通过快速剪辑、突兀的音效制造紧张气氛。
浪漫：利用柔和的光线、慢动作和温馨的背景音乐营造浪漫氛围。

特殊效果

VR/360度视频：支持创建或编辑虚拟现实视频。
AR效果：添加增强现实元素和图层。
音频效果：背景音乐、声音混音、音频过滤和效果处理。
互动视频：允许创作者加入互动元素，如点击跳转、问卷调查等。

关于结构化提示词的猜想

社区资源

SoraEase 提供了开发工具和资源，为每个人简化 Sora 的 AI 视频技术，开发者更好的利用我们的各个工具做 Sora 开发，用户可以更方便的与我们一起使用我们的工具完成人工智能视频创作。

GitHub地址：SoraEase GitHub
加入我们的社群：添加 Wechat nsddd_top 并回复 sora 进群。在我们的微信社群中，你可以获取 Sora 的最新咨询，技术分享，同时也是Sora爱好者和开发者的交流平台。

我们期待你的加入，一起探索Sora技术的无限可能！

For Tasks:

Click tags to check more tools for each tasks

create videos edit videos generate images explore ai models develop ai applications

For Jobs:

video editor content creator ai developer digital artist creative director

Alternative AI tools for sora-prompt-zh

Similar Open Source Tools

sora-prompt-zh

github

: 121

get_jobs

Get Jobs is a tool designed to help users find and apply for job positions on various recruitment platforms in China. It features AI job matching, automatic cover letter generation, multi-platform job application, automated filtering of inactive HR and headhunter positions, real-time WeChat message notifications, blacklisted company updates, driver adaptation for Win11, centralized configuration, long-lasting cookie login, XPathHelper plugin, global logging, and more. The tool supports platforms like Boss直聘, 猎聘, 拉勾, 51job, and 智联招聘. Users can configure the tool for customized job searches and applications.

github

: 3.9k

GoMaxAI-ChatGPT-Midjourney-Pro

GoMaxAI Pro is an AI-powered application for personal, team, and enterprise private operations. It supports various models like ChatGPT, Claude, Gemini, Kimi, Wenxin Yiyuan, Xunfei Xinghuo, Tsinghua Zhipu, Suno-v3.5, and Luma-video. The Pro version offers a new UI interface, member points system, management backend, homepage features, support for various content formats, AI video capabilities, SAAS multi-opening function, bug fixes, and more. It is built using web frontend with Vue3, mobile frontend with Uniapp, management frontend with Vue3, backend with Nodejs, and uses MySQL5.7(+) + Redis for data support. It can be deployed on Linux, Windows, or MacOS, with data storage options including local storage, Aliyun OSS, Tencent Cloud COS, and Chevereto image bed.

github

: 233

Awesome-Chinese-LLM

Analyze the following text from a github repository (name and readme text at end) . Then, generate a JSON object with the following keys and provide the corresponding information for each key, ,'for_jobs' (List 5 jobs suitable for this tool,in lowercase letters), 'ai_keywords' (keywords of the tool,in lowercase letters), 'for_tasks' (list of 5 specific tasks user can use this tool to do,in less than 3 words,Verb + noun form,in daily spoken language,in lowercase letters).Answer in english languagesname:Awesome-Chinese-LLM readme:# Awesome Chinese LLM ![](https://awesome.re/badge.svg) ![Awesome-Chinese-LLM](src/icon.png) An Awesome Collection for LLM in Chinese 收集和梳理中文LLM相关 ![GitHub stars](https://img.shields.io/github/stars/HqWu-HITCS/Awesome-Chinese-LLM.svg?style=popout-square) ![GitHub issues](https://img.shields.io/github/issues/HqWu-HITCS/Awesome-Chinese- LLM.svg?style=popout-square) ![GitHub forks](https://img.shields.io/github/forks/HqWu-HITCS/Awesome-Chinese- LLM.svg?style=popout-square) 自ChatGPT为代表的大语言模型（Large Language Model, LLM）出现以后，由于其惊人的类通用人工智能（AGI）的能力，掀起了新一轮自然语言处理领域的研究和应用的浪潮。尤其是以ChatGLM、LLaMA等平民玩家都能跑起来的较小规模的LLM开源之后，业界涌现了非常多基于LLM的二次微调或应用的案例。本项目旨在收集和梳理中文LLM相关的开源模型、应用、数据集及教程等资料，目前收录的资源已达100+个！如果本项目能给您带来一点点帮助，麻烦点个⭐️吧～同时也欢迎大家贡献本项目未收录的开源模型、应用、数据集等。提供新的仓库信息请发起PR，并按照本项目的格式提供仓库链接、star数，简介等相关信息，感谢~

github

: 15.0k

KubeDoor

KubeDoor is a microservice resource management platform developed using Python and Vue, based on K8S admission control mechanism. It supports unified remote storage, monitoring, alerting, notification, and display for multiple K8S clusters. The platform focuses on resource analysis and control during daily peak hours of microservices, ensuring consistency between resource request rate and actual usage rate.

github

: 272

douyin-chatgpt-bot

Douyin ChatGPT Bot is an AI-driven system for automatic replies on Douyin, including comment and private message replies. It offers features such as comment filtering, customizable robot responses, and automated account management. The system aims to enhance user engagement and brand image on the Douyin platform, providing a seamless experience for managing interactions with followers and potential customers.

github

: 166

PromptHub

PromptHub is a versatile tool for generating prompts and ideas to spark creativity and overcome writer's block. It provides a wide range of customizable prompts and exercises to inspire writers, artists, educators, and anyone looking to enhance their creative thinking. With PromptHub, users can access a diverse collection of prompts across various categories such as writing, drawing, brainstorming, and more. The tool offers a user-friendly interface and allows users to save and share their favorite prompts for future reference. Whether you're a professional writer seeking inspiration or a student looking to boost your creativity, PromptHub is the perfect companion to ignite your imagination and enhance your creative process.

github

: 545

LLMOne

LLMOne is an open-source, lightweight enterprise-level platform for deploying and serving large language models. It aims to address pain points in traditional large model private deployment such as long cycles, complex configurations, performance challenges, and high operational costs. LLMOne simplifies the deployment process with highly automated workflows and optimized runtime environments, ensuring enterprise-level performance and stability. It caters to developers, manufacturers, and users of large language models, providing features like rapid deployment, professional inference performance, broad compatibility with AI hardware, flexible model and application management, visual operational monitoring, and an open application ecosystem.

github

: 82

99AI

99AI is a commercializable AI web application based on NineAI 2.4.2 (no authorization, no backdoors, no piracy, integrated front-end and back-end integration packages, supports Docker rapid deployment). The uncompiled source code is temporarily closed. Compared with the stable version, the development version is faster.

github

: 736

easyaiot

EasyAIoT is an AI cloud platform designed to support camera integration, annotation, training, inference, data collection, analysis, alerts, recording, storage, and deployment. It aims to provide a zero-threshold AI experience for everyone, with a focus on cameras below a hundred levels. The platform consists of five core projects: WEB module for frontend management, DEVICE module for device management, VIDEO module for video processing, AI module for AI analysis, and TASK module for high-performance task execution. EasyAIoT combines Java, Python, and C++ to create a versatile and user-friendly AIoT platform.

github

: 51

manga-translator-ui

This repository is a manga image translator tool that allows users to translate text in manga images automatically. It supports various types of manga, including Japanese, Korean, and American, in both black and white and color formats. The tool can detect, translate, and embed text, supporting multiple languages such as Japanese, Chinese, and English. It also includes a visual editor for adjusting text boxes. Users can interact with the tool through a Qt interface or command-line mode for batch processing. The tool offers features like intelligent text detection, multi-language OCR, multiple translation engines, high-quality translation using AI models, automatic term extraction, AI sentence segmentation, intelligent typesetting, PSD export, and batch processing. Additionally, it provides a visual editor for region editing, text editing, mask editing, undo/redo functionality, shortcut key support, and mouse wheel shortcuts.

github

: 879

All-Model-Chat

All Model Chat is a feature-rich, highly customizable web chat application designed specifically for the Google Gemini API family. It integrates dynamic model selection, multimodal file input, streaming responses, comprehensive chat history management, and extensive customization options to provide an unparalleled AI interactive experience.

github

: 744

AI-Compass

github

: 288

Saber-Translator

Saber-Translator is your exclusive AI comic translation tool, designed to effortlessly eliminate language barriers and enjoy the original comic fun. It offers features like translating comic images/PDFs, intelligent bubble detection and text recognition, powerful AI translation engine with multiple service providers, highly customizable translation effects, real-time preview and convenient operations, efficient image management and download, model recording and recommendation, and support for language learning with dual prompt word outputs.

github

: 2.7k

NovelForge

NovelForge is an AI-assisted writing tool with the potential for creating long-form content of millions of words. It offers a solution that combines world-building, structured content generation, and consistency maintenance. The tool is built around four core concepts: modular 'cards', customizable 'dynamic output models', flexible 'context injection', and consistency assurance through a 'knowledge graph'. It provides a highly structured and configurable writing environment, inspired by the Snowflake Method, allowing users to create and organize their content in a tree-like structure. NovelForge is highly customizable and extensible, allowing users to tailor their writing workflow to their specific needs.

github

: 134

SwanLab

SwanLab is an open-source, lightweight AI experiment tracking tool that provides a platform for tracking, comparing, and collaborating on experiments, aiming to accelerate the research and development efficiency of AI teams by 100 times. It offers a friendly API and a beautiful interface, combining hyperparameter tracking, metric recording, online collaboration, experiment link sharing, real-time message notifications, and more. With SwanLab, researchers can document their training experiences, seamlessly communicate and collaborate with collaborators, and machine learning engineers can develop models for production faster.

github

: 3.6k

For similar tasks

InvokeAI

InvokeAI is a leading creative engine built to empower professionals and enthusiasts alike. Generate and create stunning visual media using the latest AI-driven technologies. InvokeAI offers an industry leading Web Interface, interactive Command Line Interface, and also serves as the foundation for multiple commercial products.

github

: 26.8k

Open-Sora-Plan

Open-Sora-Plan is a project that aims to create a simple and scalable repo to reproduce Sora (OpenAI, but we prefer to call it "ClosedAI"). The project is still in its early stages, but the team is working hard to improve it and make it more accessible to the open-source community. The project is currently focused on training an unconditional model on a landscape dataset, but the team plans to expand the scope of the project in the future to include text2video experiments, training on video2text datasets, and controlling the model with more conditions.

github

: 11.8k

comflowyspace

Comflowyspace is an open-source AI image and video generation tool that aims to provide a more user-friendly and accessible experience than existing tools like SDWebUI and ComfyUI. It simplifies the installation, usage, and workflow management of AI image and video generation, making it easier for users to create and explore AI-generated content. Comflowyspace offers features such as one-click installation, workflow management, multi-tab functionality, workflow templates, and an improved user interface. It also provides tutorials and documentation to lower the learning curve for users. The tool is designed to make AI image and video generation more accessible and enjoyable for a wider range of users.

github

: 1.8k

Rewind-AI-Main

Rewind AI is a free and open-source AI-powered video editing tool that allows users to easily create and edit videos. It features a user-friendly interface, a wide range of editing tools, and support for a variety of video formats. Rewind AI is perfect for beginners and experienced video editors alike.

github

: 248

MoneyPrinterTurbo

MoneyPrinterTurbo is a tool that can automatically generate video content based on a provided theme or keyword. It can create video scripts, materials, subtitles, and background music, and then compile them into a high-definition short video. The tool features a web interface and an API interface, supporting AI-generated video scripts, customizable scripts, multiple HD video sizes, batch video generation, customizable video segment duration, multilingual video scripts, multiple voice synthesis options, subtitle generation with font customization, background music selection, access to high-definition and copyright-free video materials, and integration with various AI models like OpenAI, moonshot, Azure, and more. The tool aims to simplify the video creation process and offers future plans to enhance voice synthesis, add video transition effects, provide more video material sources, offer video length options, include free network proxies, enable real-time voice and music previews, support additional voice synthesis services, and facilitate automatic uploads to YouTube platform.

github

: 25.7k

Dough

Dough is a tool for crafting videos with AI, allowing users to guide video generations with precision using images and example videos. Users can create guidance frames, assemble shots, and animate them by defining parameters and selecting guidance videos. The tool aims to help users make beautiful and unique video creations, providing control over the generation process. Setup instructions are available for Linux and Windows platforms, with detailed steps for installation and running the app.

github

: 395

ragdoll-studio

Ragdoll Studio is a platform offering web apps and libraries for interacting with Ragdoll, enabling users to go beyond fine-tuning and create flawless creative deliverables, rich multimedia, and engaging experiences. It provides various modes such as Story Mode for creating and chatting with characters, Vector Mode for producing vector art, Raster Mode for producing raster art, Video Mode for producing videos, Audio Mode for producing audio, and 3D Mode for producing 3D objects. Users can export their content in various formats and share their creations on the community site. The platform consists of a Ragdoll API and a front-end React application for seamless usage.

github

: 156

Whisper-TikTok

Discover Whisper-TikTok, an innovative AI-powered tool that leverages the prowess of Edge TTS, OpenAI-Whisper, and FFMPEG to craft captivating TikTok videos. Whisper-TikTok effortlessly generates accurate transcriptions from audio files and integrates Microsoft Edge Cloud Text-to-Speech API for vibrant voiceovers. The program orchestrates the synthesis of videos using a structured JSON dataset, generating mesmerizing TikTok content in minutes.

github

: 148

For similar jobs

promptflow

**Prompt flow** is a suite of development tools designed to streamline the end-to-end development cycle of LLM-based AI applications, from ideation, prototyping, testing, evaluation to production deployment and monitoring. It makes prompt engineering much easier and enables you to build LLM apps with production quality.

github

: 9.2k

deepeval

DeepEval is a simple-to-use, open-source LLM evaluation framework specialized for unit testing LLM outputs. It incorporates various metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., and runs locally on your machine for evaluation. It provides a wide range of ready-to-use evaluation metrics, allows for creating custom metrics, integrates with any CI/CD environment, and enables benchmarking LLMs on popular benchmarks. DeepEval is designed for evaluating RAG and fine-tuning applications, helping users optimize hyperparameters, prevent prompt drifting, and transition from OpenAI to hosting their own Llama2 with confidence.

github

: 13.7k

MegaDetector

MegaDetector is an AI model that identifies animals, people, and vehicles in camera trap images (which also makes it useful for eliminating blank images). This model is trained on several million images from a variety of ecosystems. MegaDetector is just one of many tools that aims to make conservation biologists more efficient with AI. If you want to learn about other ways to use AI to accelerate camera trap workflows, check out our of the field, affectionately titled "Everything I know about machine learning and camera traps".

github

: 245

leapfrogai

LeapfrogAI is a self-hosted AI platform designed to be deployed in air-gapped resource-constrained environments. It brings sophisticated AI solutions to these environments by hosting all the necessary components of an AI stack, including vector databases, model backends, API, and UI. LeapfrogAI's API closely matches that of OpenAI, allowing tools built for OpenAI/ChatGPT to function seamlessly with a LeapfrogAI backend. It provides several backends for various use cases, including llama-cpp-python, whisper, text-embeddings, and vllm. LeapfrogAI leverages Chainguard's apko to harden base python images, ensuring the latest supported Python versions are used by the other components of the stack. The LeapfrogAI SDK provides a standard set of protobuffs and python utilities for implementing backends and gRPC. LeapfrogAI offers UI options for common use-cases like chat, summarization, and transcription. It can be deployed and run locally via UDS and Kubernetes, built out using Zarf packages. LeapfrogAI is supported by a community of users and contributors, including Defense Unicorns, Beast Code, Chainguard, Exovera, Hypergiant, Pulze, SOSi, United States Navy, United States Air Force, and United States Space Force.

github

: 255

llava-docker

This Docker image for LLaVA (Large Language and Vision Assistant) provides a convenient way to run LLaVA locally or on RunPod. LLaVA is a powerful AI tool that combines natural language processing and computer vision capabilities. With this Docker image, you can easily access LLaVA's functionalities for various tasks, including image captioning, visual question answering, text summarization, and more. The image comes pre-installed with LLaVA v1.2.0, Torch 2.1.2, xformers 0.0.23.post1, and other necessary dependencies. You can customize the model used by setting the MODEL environment variable. The image also includes a Jupyter Lab environment for interactive development and exploration. Overall, this Docker image offers a comprehensive and user-friendly platform for leveraging LLaVA's capabilities.

github

: 59

carrot

The 'carrot' repository on GitHub provides a list of free and user-friendly ChatGPT mirror sites for easy access. The repository includes sponsored sites offering various GPT models and services. Users can find and share sites, report errors, and access stable and recommended sites for ChatGPT usage. The repository also includes a detailed list of ChatGPT sites, their features, and accessibility options, making it a valuable resource for ChatGPT users seeking free and unlimited GPT services.

github

: 17.1k

TrustLLM

TrustLLM is a comprehensive study of trustworthiness in LLMs, including principles for different dimensions of trustworthiness, established benchmark, evaluation, and analysis of trustworthiness for mainstream LLMs, and discussion of open challenges and future directions. Specifically, we first propose a set of principles for trustworthy LLMs that span eight different dimensions. Based on these principles, we further establish a benchmark across six dimensions including truthfulness, safety, fairness, robustness, privacy, and machine ethics. We then present a study evaluating 16 mainstream LLMs in TrustLLM, consisting of over 30 datasets. The document explains how to use the trustllm python package to help you assess the performance of your LLM in trustworthiness more quickly. For more details about TrustLLM, please refer to project website.

github

: 535

AI-YinMei

AI-YinMei is an AI virtual anchor Vtuber development tool (N card version). It supports fastgpt knowledge base chat dialogue, a complete set of solutions for LLM large language models: [fastgpt] + [one-api] + [Xinference], supports docking bilibili live broadcast barrage reply and entering live broadcast welcome speech, supports Microsoft edge-tts speech synthesis, supports Bert-VITS2 speech synthesis, supports GPT-SoVITS speech synthesis, supports expression control Vtuber Studio, supports painting stable-diffusion-webui output OBS live broadcast room, supports painting picture pornography public-NSFW-y-distinguish, supports search and image search service duckduckgo (requires magic Internet access), supports image search service Baidu image search (no magic Internet access), supports AI reply chat box [html plug-in], supports AI singing Auto-Convert-Music, supports playlist [html plug-in], supports dancing function, supports expression video playback, supports head touching action, supports gift smashing action, supports singing automatic start dancing function, chat and singing automatic cycle swing action, supports multi scene switching, background music switching, day and night automatic switching scene, supports open singing and painting, let AI automatically judge the content.

github

: 529

sora-prompt-zh

README:

sora-prompt-zh

提示词

官方提示词生成器

视频生成提示

官方视频生成提示

官方 Twitter 上的提示词以及视频展现

官方 TikTok 上面的提示词以及视频展示

如何做提示词

摄影技术/设备

视觉风格

技术效果

视觉风格

情绪表达

特殊效果

相关文章

社区资源

For Tasks:

For Jobs:

Alternative AI tools for sora-prompt-zh

Similar Open Source Tools

sora-prompt-zh

get_jobs

GoMaxAI-ChatGPT-Midjourney-Pro

Awesome-Chinese-LLM

KubeDoor

douyin-chatgpt-bot

PromptHub

LLMOne

99AI

easyaiot

manga-translator-ui

All-Model-Chat

AI-Compass

Saber-Translator

NovelForge

SwanLab

For similar tasks

InvokeAI

Open-Sora-Plan

comflowyspace

Rewind-AI-Main

MoneyPrinterTurbo

Dough

ragdoll-studio

Whisper-TikTok

For similar jobs

promptflow

deepeval

MegaDetector

leapfrogai

llava-docker

carrot

TrustLLM

AI-YinMei