
sora-prompt-zh
Sora 中文的提示词 | 短视频提示词(prompt)技巧 | 调教指南。各种场景使用指南。学习怎么让它听你的话。兼顾了 Sora 的多场景应用。
Stars: 52

Sora-prompt-zh is a repository providing guidance on using Sora in various scenarios, learning how to make it understand your commands, and exploring Sora's multiple applications. It offers AI models that can create realistic and imaginative scenes from OpenAI's text instructions. The repository includes prompts for generating videos, animations, video editing, image generation, and more. Users can find examples and generated videos based on different video styles and modify them as needed. Although Sora is not officially released yet, the repository aims to collect prompts to help users quickly start using Sora to generate desired videos.
README:
Sora 中文的提示词 | 调教指南。各种场景使用指南。学习怎么让它听你的话。兼顾了 Sora 的多场景应用。
Sora | 索拉 是一个AI模型,可以从OpenAI的文本指令中创建逼真和富有想象力的场景。OpenAI正在教AI理解和模拟运动中的物理世界,目标是训练模型,帮助人们解决需要现实世界交互的问题。
如果你是 sora 的学习者,希望获取到 sora 的最新的咨询和相关的开发项目,以及 sora 相关的开源项目,这里 awesome-sora 提供了 sora 相关的Sora 中文指南,指令指南,应用开发指南,精选资源清单,Sora 开发者精选工具框架。
索拉可提供以下功能:
- 文本到视频
- 动画
- 扩展生成的视频
- 视频到视频编辑
- 连接视频
- 图像生成(文本到图像)
在这个存储库中,你会发现各种可以和索拉一起使用的提示。我们根据视频的风格分配了不同的标签,让你可以根据标签快速找到提示示例(Prompt)和生成的视频,并根据需要进行修改。
虽然索拉尚未正式发布,但我们正在全面收集提示,以帮助你快速开始使用索拉生成您想要的视频。
点击查看更多示例
一位时尚女性穿着一件黑色皮夹克,一条长长的红色裙子和黑色靴子,手拿一个黑色的手提包,在热闹的东京街道上行走。周围充满了温暖的霓虹灯和动态的城市标识。她戴着太阳镜和红色口红,自信而随意地行走。街道潮湿而反光,形成了五彩灯光的镜面效果。许多行人在周围走动。
几只巨大的长毛猛犸象漫步在积雪覆盖的草地上,它们的长毛在微风中轻轻飘动,远处是积雪覆盖的树木和戏剧性的雪山,午后的光线和稀薄的云彩以及高高悬挂的太阳形成了温暖的光芒。低角度的摄像视角令人惊叹,捕捉到了这些大型毛茸茸的哺乳动物和美丽的摄影,景深感非常强烈。
一个电影预告片,讲述了一位30岁的太空人的冒险故事,他戴着一顶红色的羊毛编织头盔,蓝天,盐沙漠,电影风格,35mm胶片拍摄,色彩生动。
无人机俯视着波涛汹涌的大苏尔加雷角海滩的崎岖悬崖。蓝色的海水拍打着,形成了白色的波浪,而夕阳的金光照亮了岩石海岸。远处有一座灯塔的小岛,悬崖边覆盖着绿色的灌木。从道路到海滩的陡峭下滑是一个戏剧性的壮举,悬崖边突出在海面上。这是一个捕捉到海岸的原始美和太平洋海岸公路崎岖风景的景色。
动画场景展示了一个近距离的短毛怪兽跪在一个正在融化的红色蜡烛旁边。艺术风格是3D和逼真的,重点放在光线和纹理上。画面的情绪是惊奇和好奇,怪兽睁着大眼睛,张着大嘴盯着火焰看。它的姿势和表情传达出一种天真和俏皮的感觉,好像它是第一次探索周围的世界一样。温暖色调和戏剧性的光线进一步增强了图像的舒适氛围。
一个华丽的纸艺世界,一个丰富多彩的珊瑚礁,到处都是色彩缤纷的鱼类和海洋生物。
这个特写镜头展示了维多利亚皇冠鸽子引人注目的蓝色羽毛和红色胸膛。它的羽冠由精致的蕾丝羽毛制成,而它的眼睛是醒目的红色。鸟的头微微倾斜,给人一种威严和威严的印象。背景模糊,突出了鸟的引人注目的外观。
两艘海盗船激战的写实特写视频,它们在一杯咖啡中航行。
一位20岁左右的年轻男子坐在天空中的一块云朵上,读着一本书。
加利福尼亚淘金热的历史影像。
一个玻璃球的特写视角,里面有一个有竹林的禅园,一个小矮人正在禅园里耙平沙子并在沙子上创造图案。
在魔幻的黄昏中,一个24岁女子的眼睛在眨眼,站在马拉喀什,70毫米胶片拍摄的电影,景深,鲜艳的色彩,电影感觉的摄影。
一只卡通袋鼠在迪斯科舞动。
一个美丽的自制视频,展示了2056年尼日利亚拉各斯的人们。使用手机摄像头拍摄。
一个培养着许多彩色鱼类和海洋生物的珊瑚礁的华丽渲染的纸艺世界。
摄像机围绕着一堆大量显示不同节目的老式电视,1950年代的科幻电影,恐怖电影,新闻,静态画面,1970年代的情景喜剧等,设置在纽约一家大型博物馆画廊内。
3D动画中,一个小,圆,毛茸茸的生物,有着大大的,有表情的眼睛,探索着一个充满生机的,神奇的魔法森林。这个生物是兔子和松鼠的奇妙融合,有着柔软的蓝色皮毛和一条松软的,带条纹的尾巴。它在闪闪发光的小溪旁跳跃,眼睛里充满了惊奇。森林中充满了魔法元素:发光并变色的花朵,紫色和银色树叶的树木,以及看起来像萤火虫的小飘浮灯光。生物停下来和一群像仙子一样的小精灵玩耍,围绕着一个蘑菇环舞动。生物惊叹地抬头望着一棵巨大的,发着光的树,它似乎是森林的心脏。
摄像机跟随着一辆白色老式SUV,车顶有一个黑色行李架,它快速地驶过陡峭的山路,周围是松树,车轮的灰尘飞扬,阳光照在SUV上,照在山路上,给整个场景带来了温暖的光芒。土路缓缓弯曲,远处看不到其他汽车或车辆。路两旁的树是红杉树,零零散散地散布着绿色植被。车辆从后方视角看上去轻松地跟着弯道转弯,好像它在崎岖的地形中行驶一样。土路本身被陡峭的山丘和山脉所环绕,天空晴朗,白云飘荡。
火车途经东京郊区的窗户反射。
无人机摄影展示了一座建在阿马尔菲海岸岩石高地上的美丽历史教堂,视角展示了历史悠久且宏伟的建筑细节,以及分层的路径和露台,海平面下的海浪拍打在下方的岩石上,远眺海岸水域和意大利阿马尔菲海岸的丘陵风景,几个远处的人在走动,并在欣赏悬崖海景的露台上欣赏风景,午后的阳光营造出一种神奇和浪漫的氛围,摄影以美丽的摄影捕捉了这一场景。
一个大号橙色章鱼躺在海底,与沙质和岩石的地形融为一体。它的触手围绕着身体,眼睛闭着。章鱼不知道一只螃蟹从岩石后爬向它,它的爪子举起准备进攻。螃蟹是棕色的,长满刺,有着长腿和触角。镜头采用广角拍摄,展示了海洋的广阔和深度。水是清澈的蓝色,阳光透过水面,形成光束。镜头清晰锐利,动态范围高。章鱼和螃蟹都处于焦点状态,而背景略微模糊,产生了眨眼,站在马拉喀什,70毫米胶片拍摄的电影,景深,鲜艳的色彩,电影感觉的摄影。
一只卡通袋鼠在迪斯科舞动。
一个美丽的自制视频,展示了2056年尼日利亚拉各斯的人们。使用手机摄像头拍摄。
一个培养着许多彩色鱼类和海洋生物的珊瑚礁的华丽渲染的纸艺世界。
摄像机围绕着一堆大量显示不同节目的老式电视,1950年代的科幻电影,恐怖电影,新闻,静态画面,1970年代的情景喜剧等,设置在纽约一家大型博物馆画廊内。
3D动画中,一个小,圆,毛茸茸的生物,有着大大的,有表情的眼睛,探索着一个充满生机的,神奇的魔法森林。这个生物是兔子和松鼠的奇妙融合,有着柔软的蓝色皮毛和一条松软的,带条纹的尾巴。它在闪闪发光的小溪旁跳跃,眼睛里充满了惊奇。森林中充满了魔法元素:发光并变色的花朵,紫色和银色树叶的树木,以及看起来像萤火虫的小飘浮灯光。生物停下来和一群像仙子一样的小精灵玩耍,围绕着一个蘑菇环舞动。生物惊叹地抬头望着一棵巨大的,发着光的树,它似乎是森林的心脏。
摄像机跟随着一辆白色老式SUV,车顶有一个黑色行李架,它快速地驶过陡峭的山路,周围是松树,车轮的灰尘飞扬,阳光照在SUV上,照在山路上,给整个场景带来了温暖的光芒。土路缓缓弯曲,远处看不到其他汽车或车辆。路两旁的树是红杉树,零零散散地散布着绿色植被。车辆从后方视角看上去轻松地跟着弯道转弯,好像它在崎岖的地形中行驶一样。土路本身被陡峭的山丘和山脉所环绕,天空晴朗,白云飘荡。
火车途经东京郊区的窗户反射。
无人机摄影展示了一座建在阿马尔菲海岸岩石高地上的美丽历史教堂,视角展示了历史悠久且宏伟的建筑细节,以及分层的路径和露台,海平面下的海浪拍打在下方的岩石上,远眺海岸水域和意大利阿马尔菲海岸的丘陵风景,几个远处的人在走动,并在欣赏悬崖海景的露台上欣赏风景,午后的阳光营造出一种神奇和浪漫的氛围,摄影以美丽的摄影捕捉了这一场景。
一只巨大的橙色章鱼栖息在海底,与沙质和岩石地形融为一体。 它的触手散布在身体周围,眼睛紧闭。 章鱼没有意识到一只帝王蟹正从岩石后面爬向它,它的爪子举起并准备攻击。 螃蟹呈棕色,多刺,有长腿和触角。 该场景是从广角拍摄的,展现了海洋的浩瀚和深度。 海水清澈碧蓝,阳光透过来。 镜头锐利、清晰,具有高动态范围。 章鱼和螃蟹清晰对焦,背景略微模糊,营造出景深效果。
点击查看更多示例
- 一只小熊猫和一只巨嘴鸟是最好的朋友,在圣托里尼的蓝色时刻散步。 生成视频链接
- 一名水肺潜水员发现了一个隐藏的未来主义沉船,里面有赛博海洋生物和先进的外星科技。 生成视频链接
- 特写镜头展示了一只雄伟的白色龙,拥有珍珠般的银边鳞片、冰蓝色的眼睛、优雅的象牙色角和雾气般的呼吸。着重展示了详细的面部特征和有纹理的鳞片,背景柔和模糊。 生成视频链接
- 在一个精美渲染的纸艺世界中,一艘蒸汽船穿越辽阔的海洋,天空中有薄云。遥远的草坡在背景中若隐若现,纸艺海洋表面附近可见一些海洋生物。 生成视频链接
- 一个人在夏威夷的热带水域进行BASE跳伞。他的宠物金刚鹦鹉在他身边飞行。 生成视频链接
- 一个被黑暗霓虹灯光照亮的热带雨林,充满了奇幻的动植物和动物。 生成视频链接
- 一只玻璃制成的乌龟,裂缝用金缮修复过,正走在日落时分的黑沙滩上。 生成视频链接
- 一群萨摩耶小狗学习成为厨师的宣传片。 生成视频链接
- 一群冒险的小狗在天空废墟中探险的宣传片。 生成视频链接
- 夜间镜头,一个寄居蟹把白炽灯泡当做壳。 生成视频链接
- Minecraft使用最华丽的高清8K材质包。 生成视频链接
- 这个近景镜头展示了一只未来主义赛博德国牧羊犬,展示了它引人注目的棕黑色毛发。 生成视频链接
- 一个蚂蚁在蚁巢内部导航的POV镜头。 生成视频链接
- 一片叶子的微距镜头,展示了微小的火车穿过它的叶脉。 生成视频链接
- 一只白色和橙色虎斑巷猫在大雨中穿过后巷,寻找庇护所。 生成视频链接
- 一只可以游泳的蝴蝶在美丽的珊瑚礁下水中航行的逼真视频。 生成视频链接
- 一只巨大的鸭子走过波士顿的街道。 生成视频链接
- 相机下降并放大,俯瞰着美丽的海洋和历史建筑,沿着悬崖上的壮丽海岸风景小镇。 生成视频链接
- 一个由水构成的行走人形体参观一个艺术画廊,里面有许多不同风格的美丽作品。 生成视频链接
- 一个绿色的斑点和一个橙色的斑点相爱并一起跳舞。 生成视频链接
- 一个阴森的闹鬼大宅,友好的南瓜灯和鬼魂角色欢迎着来敲门的孩子们,倾斜移轴摄影。 生成视频链接
- 一个巨大的教堂被猫填满。你看到的地方到处都是猫。一个男人走进教堂,在一只巨大的猫王宝座前鞠躬。 生成视频链接
- 人们在海滩放松的逼真视频,然后一条鲨鱼从水中跳出,惊吓了所有人。 生成视频链接
点击查看更多示例
- 穿着雄伟王冠的小土豆国王,坐在王座上,监视着他们的土豆王国,充满了土豆臣民和土豆城堡。 生成视频链接
- 一家装饰有室内植物的咖啡馆的微缩地图。 木梁在上方交叉,一台冷冻咖啡站用小瓶子和玻璃杯装点着。 生成视频链接
- 一个拼成“SORA”字样的逼真云图。 生成视频链接
- 公园里的猴子下棋。 生成视频链接
- 叶子的微距镜头,展示了微小的火车穿过它的叶脉。 生成视频链接
- 一只黑色连帽卫衣的计算机黑客拉布拉多在计算机前面坐着,屏幕的反光照在狗脸上,它正在快速打字。 生成视频链接
- 低角度摄影紧随丛林中的蚂蚁,进入地面,进入它们的世界。 生成视频链接
- 比萨斜塔。 生成视频链接
- 一个低质量、视觉上令人失望的超级碗广告。 生成视频链接
- 35毫米电影胶片拍摄
- 70毫米电影胶片拍摄
- 使用手机相机拍摄
- 电影感
- 3D数字渲染艺术风格
- 宽幅镜头
- 黑白影调
- 老电影风格的颗粒感
- 日落金时光
- 星轨长曝光
- 街头纪实风格
- HDR高动态范围
- 慢动作拍摄
- 时光流逝摄影
- 创意光绘
- 虚拟现实全景
- 微距摄影
拍摄技巧
- 景深
- 特写镜头
- 画面清晰锐利,具有浅景深
- 鲜艳的色彩
- 稳定镜头:去除抖动,保持画面稳定。
- 色彩校正:调整视频的色温、饱和度、对比度等。
- 光线效果:模拟自然光、背光或特殊光源效果。
- 绿幕抠图:将特定颜色(通常是绿色或蓝色)的背景替换为其他画面。
- 视频转场:平滑或创意地过渡两个镜头之间的切换。
- 文字动画:文字的出现、消失或移动效果。
- 时间线编辑:对视频片段进行裁剪、拼接、速度调整等。
- 复古风格:模仿旧电影或某个时代的视觉效果。
- 动漫风格:将视频处理成类似动漫或手绘的艺术风格。
- 科幻风格:赋予视频未来主义或科幻电影的视觉特征。
- 梦幻效果:使用模糊、光晕等效果创造出梦幻般的视觉体验。
- 纪录片风格:模仿纪录片的摄影和剪辑手法。
- 欢快:通过明亮的色彩、快节奏的剪辑传达快乐情绪。
- 怀旧:使用温暖的色调和复古转场回忆过去。
- 紧张:通过快速剪辑、突兀的音效制造紧张气氛。
- 浪漫:利用柔和的光线、慢动作和温馨的背景音乐营造浪漫氛围。
- VR/360度视频:支持创建或编辑虚拟现实视频。
- AR效果:添加增强现实元素和图层。
- 音频效果:背景音乐、声音混音、音频过滤和效果处理。
- 互动视频:允许创作者加入互动元素,如点击跳转、问卷调查等。
SoraEase 提供了开发工具和资源,为每个人简化 Sora 的 AI 视频技术,开发者更好的利用我们的各个工具做 Sora 开发,用户可以更方便的与我们一起使用我们的工具完成人工智能视频创作。
- GitHub地址:SoraEase GitHub
-
加入我们的社群:添加 Wechat nsddd_top 并回复
sora
进群。在我们的微信社群中,你可以获取 Sora 的最新咨询,技术分享,同时也是Sora爱好者和开发者的交流平台。
我们期待你的加入,一起探索Sora技术的无限可能!
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for sora-prompt-zh
Similar Open Source Tools

sora-prompt-zh
Sora-prompt-zh is a repository providing guidance on using Sora in various scenarios, learning how to make it understand your commands, and exploring Sora's multiple applications. It offers AI models that can create realistic and imaginative scenes from OpenAI's text instructions. The repository includes prompts for generating videos, animations, video editing, image generation, and more. Users can find examples and generated videos based on different video styles and modify them as needed. Although Sora is not officially released yet, the repository aims to collect prompts to help users quickly start using Sora to generate desired videos.

GoMaxAI-ChatGPT-Midjourney-Pro
GoMaxAI Pro is an AI-powered application for personal, team, and enterprise private operations. It supports various models like ChatGPT, Claude, Gemini, Kimi, Wenxin Yiyuan, Xunfei Xinghuo, Tsinghua Zhipu, Suno-v3.5, and Luma-video. The Pro version offers a new UI interface, member points system, management backend, homepage features, support for various content formats, AI video capabilities, SAAS multi-opening function, bug fixes, and more. It is built using web frontend with Vue3, mobile frontend with Uniapp, management frontend with Vue3, backend with Nodejs, and uses MySQL5.7(+) + Redis for data support. It can be deployed on Linux, Windows, or MacOS, with data storage options including local storage, Aliyun OSS, Tencent Cloud COS, and Chevereto image bed.

KubeDoor
KubeDoor is a microservice resource management platform developed using Python and Vue, based on K8S admission control mechanism. It supports unified remote storage, monitoring, alerting, notification, and display for multiple K8S clusters. The platform focuses on resource analysis and control during daily peak hours of microservices, ensuring consistency between resource request rate and actual usage rate.

Operit
Operit AI is a fully functional AI assistant application for mobile devices, running independently on Android devices with powerful tool invocation capabilities. It offers over 40 built-in tools for file system operations, HTTP requests, system operations, UI automation, and media processing. The app combines these tools with rich plugins to enable a wide range of tasks, from simple to complex, providing a comprehensive experience of a smartphone AI assistant.

aituber-kit
AITuber-Kit is a tool that enables users to interact with AI characters, conduct AITuber live streams, and engage in external integration modes. Users can easily converse with AI characters using various LLM APIs, stream on YouTube with AI character reactions, and send messages to server apps via WebSocket. The tool provides settings for API keys, character configurations, voice synthesis engines, and more. It supports multiple languages and allows customization of VRM models and background images. AITuber-Kit follows the MIT license and offers guidelines for adding new languages to the project.

omnia
Omnia is a deployment tool designed to turn servers with RPM-based Linux images into functioning Slurm/Kubernetes clusters. It provides an Ansible playbook-based deployment for Slurm and Kubernetes on servers running an RPM-based Linux OS. The tool simplifies the process of setting up and managing clusters, making it easier for users to deploy and maintain their infrastructure.

get_jobs
Get Jobs is a tool designed to help users find and apply for job positions on various recruitment platforms in China. It features AI job matching, automatic cover letter generation, multi-platform job application, automated filtering of inactive HR and headhunter positions, real-time WeChat message notifications, blacklisted company updates, driver adaptation for Win11, centralized configuration, long-lasting cookie login, XPathHelper plugin, global logging, and more. The tool supports platforms like Boss直聘, 猎聘, 拉勾, 51job, and 智联招聘. Users can configure the tool for customized job searches and applications.

easyaiot
EasyAIoT is an AI cloud platform designed to support camera integration, annotation, training, inference, data collection, analysis, alerts, recording, storage, and deployment. It aims to provide a zero-threshold AI experience for everyone, with a focus on cameras below a hundred levels. The platform consists of five core projects: WEB module for frontend management, DEVICE module for device management, VIDEO module for video processing, AI module for AI analysis, and TASK module for high-performance task execution. EasyAIoT combines Java, Python, and C++ to create a versatile and user-friendly AIoT platform.

LxgwZhenKai
LxgwZhenKai is a Chinese font derived from LXGW WenKai, manually adjusted for boldness and supplemented with AI assistance for character additions. The font aims to provide a comfortable reading experience on screens while also serving as a bold version of LXGW WenKai for temporary use. It contains over 13,000 characters, including common simplified and traditional Chinese characters, and is licensed under SIL Open Font License 1.1. Users are allowed to freely use, distribute, modify, and create derivative fonts based on LxgwZhenKai.

Awesome-Chinese-LLM
Analyze the following text from a github repository (name and readme text at end) . Then, generate a JSON object with the following keys and provide the corresponding information for each key, ,'for_jobs' (List 5 jobs suitable for this tool,in lowercase letters), 'ai_keywords' (keywords of the tool,in lowercase letters), 'for_tasks' (list of 5 specific tasks user can use this tool to do,in less than 3 words,Verb + noun form,in daily spoken language,in lowercase letters).Answer in english languagesname:Awesome-Chinese-LLM readme:# Awesome Chinese LLM   An Awesome Collection for LLM in Chinese 收集和梳理中文LLM相关    自ChatGPT为代表的大语言模型(Large Language Model, LLM)出现以后,由于其惊人的类通用人工智能(AGI)的能力,掀起了新一轮自然语言处理领域的研究和应用的浪潮。尤其是以ChatGLM、LLaMA等平民玩家都能跑起来的较小规模的LLM开源之后,业界涌现了非常多基于LLM的二次微调或应用的案例。本项目旨在收集和梳理中文LLM相关的开源模型、应用、数据集及教程等资料,目前收录的资源已达100+个! 如果本项目能给您带来一点点帮助,麻烦点个⭐️吧~ 同时也欢迎大家贡献本项目未收录的开源模型、应用、数据集等。提供新的仓库信息请发起PR,并按照本项目的格式提供仓库链接、star数,简介等相关信息,感谢~

douyin-chatgpt-bot
Douyin ChatGPT Bot is an AI-driven system for automatic replies on Douyin, including comment and private message replies. It offers features such as comment filtering, customizable robot responses, and automated account management. The system aims to enhance user engagement and brand image on the Douyin platform, providing a seamless experience for managing interactions with followers and potential customers.

ai_paper
The AI Paper tool is a powerful platform for generating various types of academic papers, including graduation papers, course papers, journal papers, title papers, opening reports, task books, AIGC reduction, and custom literature. It supports unlimited revisions, AI 4.0 technology, AIGC rate reduction, custom references, outlines, feeding of materials, various types of charts and tables, multiple languages, and anonymous mode. The tool offers a wide range of paper formats and supports tasks like writing papers, assignments, internship reports, job planning, and long-form writing.

WeChatMsg
WeChatMsg is a tool designed to help users manage and analyze their WeChat data. It aims to provide users with the ability to preserve their precious memories and create a personalized AI companion. The tool allows users to extract and export various types of data from WeChat, such as text, images, contacts, and more. Additionally, it offers features like analyzing chat data and generating visual annual reports. WeChatMsg is built on the idea of empowering users to take control of their data and foster emotional connections through technology.

Chenyme-AAVT
Chenyme-AAVT is a user-friendly tool that provides automatic video and audio recognition and translation. It leverages the capabilities of Whisper, a powerful speech recognition model, to accurately identify speech in videos and audios. The recognized speech is then translated using ChatGPT or KIMI, ensuring high-quality translations. With Chenyme-AAVT, you can quickly generate字幕 files and merge them with the original video, making video translation a breeze. The tool supports various languages, allowing you to translate videos and audios into your desired language. Additionally, Chenyme-AAVT offers features such as VAD (Voice Activity Detection) to enhance recognition accuracy, GPU acceleration for faster processing, and support for multiple字幕 formats. Whether you're a content creator, translator, or anyone looking to make video translation more efficient, Chenyme-AAVT is an invaluable tool.

agenta
Agenta is an open-source LLM developer platform for prompt engineering, evaluation, human feedback, and deployment of complex LLM applications. It provides tools for prompt engineering and management, evaluation, human annotation, and deployment, all without imposing any restrictions on your choice of framework, library, or model. Agenta allows developers and product teams to collaborate in building production-grade LLM-powered applications in less time.

activepieces
Activepieces is an open source replacement for Zapier, designed to be extensible through a type-safe pieces framework written in Typescript. It features a user-friendly Workflow Builder with support for Branches, Loops, and Drag and Drop. Activepieces integrates with Google Sheets, OpenAI, Discord, and RSS, along with 80+ other integrations. The list of supported integrations continues to grow rapidly, thanks to valuable contributions from the community. Activepieces is an open ecosystem; all piece source code is available in the repository, and they are versioned and published directly to npmjs.com upon contributions. If you cannot find a specific piece on the pieces roadmap, please submit a request by visiting the following link: Request Piece Alternatively, if you are a developer, you can quickly build your own piece using our TypeScript framework. For guidance, please refer to the following guide: Contributor's Guide
For similar tasks

InvokeAI
InvokeAI is a leading creative engine built to empower professionals and enthusiasts alike. Generate and create stunning visual media using the latest AI-driven technologies. InvokeAI offers an industry leading Web Interface, interactive Command Line Interface, and also serves as the foundation for multiple commercial products.

Open-Sora-Plan
Open-Sora-Plan is a project that aims to create a simple and scalable repo to reproduce Sora (OpenAI, but we prefer to call it "ClosedAI"). The project is still in its early stages, but the team is working hard to improve it and make it more accessible to the open-source community. The project is currently focused on training an unconditional model on a landscape dataset, but the team plans to expand the scope of the project in the future to include text2video experiments, training on video2text datasets, and controlling the model with more conditions.

comflowyspace
Comflowyspace is an open-source AI image and video generation tool that aims to provide a more user-friendly and accessible experience than existing tools like SDWebUI and ComfyUI. It simplifies the installation, usage, and workflow management of AI image and video generation, making it easier for users to create and explore AI-generated content. Comflowyspace offers features such as one-click installation, workflow management, multi-tab functionality, workflow templates, and an improved user interface. It also provides tutorials and documentation to lower the learning curve for users. The tool is designed to make AI image and video generation more accessible and enjoyable for a wider range of users.

Rewind-AI-Main
Rewind AI is a free and open-source AI-powered video editing tool that allows users to easily create and edit videos. It features a user-friendly interface, a wide range of editing tools, and support for a variety of video formats. Rewind AI is perfect for beginners and experienced video editors alike.

MoneyPrinterTurbo
MoneyPrinterTurbo is a tool that can automatically generate video content based on a provided theme or keyword. It can create video scripts, materials, subtitles, and background music, and then compile them into a high-definition short video. The tool features a web interface and an API interface, supporting AI-generated video scripts, customizable scripts, multiple HD video sizes, batch video generation, customizable video segment duration, multilingual video scripts, multiple voice synthesis options, subtitle generation with font customization, background music selection, access to high-definition and copyright-free video materials, and integration with various AI models like OpenAI, moonshot, Azure, and more. The tool aims to simplify the video creation process and offers future plans to enhance voice synthesis, add video transition effects, provide more video material sources, offer video length options, include free network proxies, enable real-time voice and music previews, support additional voice synthesis services, and facilitate automatic uploads to YouTube platform.

Dough
Dough is a tool for crafting videos with AI, allowing users to guide video generations with precision using images and example videos. Users can create guidance frames, assemble shots, and animate them by defining parameters and selecting guidance videos. The tool aims to help users make beautiful and unique video creations, providing control over the generation process. Setup instructions are available for Linux and Windows platforms, with detailed steps for installation and running the app.

ragdoll-studio
Ragdoll Studio is a platform offering web apps and libraries for interacting with Ragdoll, enabling users to go beyond fine-tuning and create flawless creative deliverables, rich multimedia, and engaging experiences. It provides various modes such as Story Mode for creating and chatting with characters, Vector Mode for producing vector art, Raster Mode for producing raster art, Video Mode for producing videos, Audio Mode for producing audio, and 3D Mode for producing 3D objects. Users can export their content in various formats and share their creations on the community site. The platform consists of a Ragdoll API and a front-end React application for seamless usage.

Whisper-TikTok
Discover Whisper-TikTok, an innovative AI-powered tool that leverages the prowess of Edge TTS, OpenAI-Whisper, and FFMPEG to craft captivating TikTok videos. Whisper-TikTok effortlessly generates accurate transcriptions from audio files and integrates Microsoft Edge Cloud Text-to-Speech API for vibrant voiceovers. The program orchestrates the synthesis of videos using a structured JSON dataset, generating mesmerizing TikTok content in minutes.
For similar jobs

promptflow
**Prompt flow** is a suite of development tools designed to streamline the end-to-end development cycle of LLM-based AI applications, from ideation, prototyping, testing, evaluation to production deployment and monitoring. It makes prompt engineering much easier and enables you to build LLM apps with production quality.

deepeval
DeepEval is a simple-to-use, open-source LLM evaluation framework specialized for unit testing LLM outputs. It incorporates various metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., and runs locally on your machine for evaluation. It provides a wide range of ready-to-use evaluation metrics, allows for creating custom metrics, integrates with any CI/CD environment, and enables benchmarking LLMs on popular benchmarks. DeepEval is designed for evaluating RAG and fine-tuning applications, helping users optimize hyperparameters, prevent prompt drifting, and transition from OpenAI to hosting their own Llama2 with confidence.

MegaDetector
MegaDetector is an AI model that identifies animals, people, and vehicles in camera trap images (which also makes it useful for eliminating blank images). This model is trained on several million images from a variety of ecosystems. MegaDetector is just one of many tools that aims to make conservation biologists more efficient with AI. If you want to learn about other ways to use AI to accelerate camera trap workflows, check out our of the field, affectionately titled "Everything I know about machine learning and camera traps".

leapfrogai
LeapfrogAI is a self-hosted AI platform designed to be deployed in air-gapped resource-constrained environments. It brings sophisticated AI solutions to these environments by hosting all the necessary components of an AI stack, including vector databases, model backends, API, and UI. LeapfrogAI's API closely matches that of OpenAI, allowing tools built for OpenAI/ChatGPT to function seamlessly with a LeapfrogAI backend. It provides several backends for various use cases, including llama-cpp-python, whisper, text-embeddings, and vllm. LeapfrogAI leverages Chainguard's apko to harden base python images, ensuring the latest supported Python versions are used by the other components of the stack. The LeapfrogAI SDK provides a standard set of protobuffs and python utilities for implementing backends and gRPC. LeapfrogAI offers UI options for common use-cases like chat, summarization, and transcription. It can be deployed and run locally via UDS and Kubernetes, built out using Zarf packages. LeapfrogAI is supported by a community of users and contributors, including Defense Unicorns, Beast Code, Chainguard, Exovera, Hypergiant, Pulze, SOSi, United States Navy, United States Air Force, and United States Space Force.

llava-docker
This Docker image for LLaVA (Large Language and Vision Assistant) provides a convenient way to run LLaVA locally or on RunPod. LLaVA is a powerful AI tool that combines natural language processing and computer vision capabilities. With this Docker image, you can easily access LLaVA's functionalities for various tasks, including image captioning, visual question answering, text summarization, and more. The image comes pre-installed with LLaVA v1.2.0, Torch 2.1.2, xformers 0.0.23.post1, and other necessary dependencies. You can customize the model used by setting the MODEL environment variable. The image also includes a Jupyter Lab environment for interactive development and exploration. Overall, this Docker image offers a comprehensive and user-friendly platform for leveraging LLaVA's capabilities.

carrot
The 'carrot' repository on GitHub provides a list of free and user-friendly ChatGPT mirror sites for easy access. The repository includes sponsored sites offering various GPT models and services. Users can find and share sites, report errors, and access stable and recommended sites for ChatGPT usage. The repository also includes a detailed list of ChatGPT sites, their features, and accessibility options, making it a valuable resource for ChatGPT users seeking free and unlimited GPT services.

TrustLLM
TrustLLM is a comprehensive study of trustworthiness in LLMs, including principles for different dimensions of trustworthiness, established benchmark, evaluation, and analysis of trustworthiness for mainstream LLMs, and discussion of open challenges and future directions. Specifically, we first propose a set of principles for trustworthy LLMs that span eight different dimensions. Based on these principles, we further establish a benchmark across six dimensions including truthfulness, safety, fairness, robustness, privacy, and machine ethics. We then present a study evaluating 16 mainstream LLMs in TrustLLM, consisting of over 30 datasets. The document explains how to use the trustllm python package to help you assess the performance of your LLM in trustworthiness more quickly. For more details about TrustLLM, please refer to project website.

AI-YinMei
AI-YinMei is an AI virtual anchor Vtuber development tool (N card version). It supports fastgpt knowledge base chat dialogue, a complete set of solutions for LLM large language models: [fastgpt] + [one-api] + [Xinference], supports docking bilibili live broadcast barrage reply and entering live broadcast welcome speech, supports Microsoft edge-tts speech synthesis, supports Bert-VITS2 speech synthesis, supports GPT-SoVITS speech synthesis, supports expression control Vtuber Studio, supports painting stable-diffusion-webui output OBS live broadcast room, supports painting picture pornography public-NSFW-y-distinguish, supports search and image search service duckduckgo (requires magic Internet access), supports image search service Baidu image search (no magic Internet access), supports AI reply chat box [html plug-in], supports AI singing Auto-Convert-Music, supports playlist [html plug-in], supports dancing function, supports expression video playback, supports head touching action, supports gift smashing action, supports singing automatic start dancing function, chat and singing automatic cycle swing action, supports multi scene switching, background music switching, day and night automatic switching scene, supports open singing and painting, let AI automatically judge the content.