Best AI tools for< produce immersive videos >

20 - AI tool Sites

Avataar

**Avataar** is a generative AI platform for spatial storytelling that empowers users to effortlessly create and share immersive 3D experiences. With its suite of AI-powered tools, Avataar enables users to: * **Build 3D stories and videos:** Craft compelling narratives and experiences using a range of 3D assets and interactive elements. * **Reimagine visual discovery:** Showcase products and services in a captivating and engaging way, enhancing customer engagement and driving conversions. * **Enhance decision-making:** Leverage contextual reality to provide users with immersive and informative experiences, aiding in informed decision-making. Avataar's platform offers a range of features that streamline the creation and delivery of spatial content, including: * **3D from video capture:** Effortlessly generate 3D models from real-world objects using the Incarnate app. * **Import/Export 3D models:** Seamlessly integrate existing 3D models and edit them with preferred tools for complete creative control. * **Create 3D rooms:** Design and furnish stunning 3D spaces, bringing virtual creations to life. * **Generate 3D objects from single 2D images:** Unleash creativity by transforming simple images into compelling 3D models. * **AI-led cost and time efficiencies:** Leverage generative AI to optimize workflows, reducing production time and costs. Avataar's platform finds applications across various industries, including furniture, electronics, footwear, marketplaces, bedding, and luggage. It empowers users to: * **Amplify impact across all consumer touchpoints:** Create immersive product experiences that engage customers and drive conversions. * **Visualize lifestyle looks with virtual photos:** Showcase products in realistic settings, helping customers envision them in their own spaces. * **Configure products in 3D:** Allow customers to customize and visualize products in 3D, enhancing the shopping experience. * **Create AR ads:** Design interactive and attention-grabbing AR ads that captivate audiences. * **Produce virtual videos:** Craft immersive and informative videos that provide detailed product views. * **Evaluate product fit with life-size XR/AR:** Enable customers to virtually try on or place products in their homes, ensuring a perfect fit. Avataar fosters a vibrant community of creators who collaborate, share ideas, and push the boundaries of spatial storytelling. By joining the community, users can access exclusive resources, connect with like-minded individuals, and stay updated on the latest advancements in the field.

site

: 25.0k

KLING AI

KLING AI is a cutting-edge video generation model developed by Kuaishou Kwai company. It can produce detailed and fluid videos at 1080p resolution and 30 frames per second, creating immersive visual experiences up to two minutes in length. The model excels in modeling intricate motion sequences and realistic physical interactions between objects, resulting in highly dynamic and lifelike scenes. From dance routines to action sequences, KLING AI blurs the line between artificial and authentic content.

site

: 0

Wizart

Wizart is a comprehensive platform that provides AI-powered visualization solutions for businesses. It offers a range of tools and services to help companies create engaging and immersive product visualizations, including a visualizer, material cloud, and vision API. With Wizart, businesses can eliminate the imagination gap and increase customer engagement by providing high-quality product content, such as renders, videos, and interactive models.

site

: 62.7k

Lumiere3D

Lumiere3D is an AI-powered 3D product video creation platform that enables businesses to create stunning and engaging product videos for marketing and sales purposes. With its intuitive video editor and advanced AI tools, Lumiere3D makes it easy for users to turn their product ideas into reality, even without any prior video editing experience. The platform offers a wide range of features, including AI-generated 3D scenes, dynamic camera movements, seamless transitions, and personalized effects, allowing users to create videos that capture the attention of their audience and leave a lasting impression.

site

: 20.4k

KERV Solutions

KERV is an AI-powered video and creative technology company that offers ad performance solutions, publisher revenue opportunities, in-show monetization solutions, and data and measurement services. Their patented image recognition and product correlation technology enable deeper relationships between publishers, brands, and consumers. KERV's AI technology makes any video explorable and shoppable with unrivaled speed and precision, delivering real business outcomes. They provide intelligent video solutions, active attention indexing, greater speed and precision, 1st party data insights, and brand safety measures.

site

: 30.1k

AudioShake

AudioShake is a cloud-based audio processing platform that uses artificial intelligence (AI) to separate audio into its component parts, such as vocals, music, and effects. This technology can be used for a variety of applications, including mixing and mastering, localization and captioning, interactive audio, and sync licensing.

site

: 35.6k

MagiScan

MagiScan is an AI-powered 3D scanner app for iOS and Android devices. It allows users to scan objects and create high-quality 3D models. The app is simple to use and affordable, making it a great option for both professionals and hobbyists. MagiScan has a variety of features that make it a powerful tool for 3D scanning. These features include: * **High-quality scanning:** MagiScan uses AI to generate high-quality 3D models. The app's algorithms can automatically remove noise and artifacts from scans, resulting in models that are accurate and detailed. * **Multiple export formats:** MagiScan can export 3D models in a variety of formats, including USDZ, GTLF, GLB, OBJ, STL, FBX, and PLY. This makes it easy to use MagiScan models in a variety of applications. * **Affordable:** MagiScan is one of the most affordable 3D scanner apps on the market. This makes it a great option for users who are on a budget. MagiScan is a versatile tool that can be used for a variety of applications. These applications include: * **E-commerce:** MagiScan can be used to create 3D models of products for e-commerce websites. This can help businesses to increase sales by providing customers with a more immersive shopping experience. * **3D printing:** MagiScan can be used to create 3D models for 3D printing. This can be useful for prototyping, manufacturing, and other applications. * **Education:** MagiScan can be used to create 3D models for educational purposes. This can help students to learn about 3D modeling and design. * **Entertainment:** MagiScan can be used to create 3D models for entertainment purposes. This can be useful for creating video games, movies, and other forms of media.

site

: 21.3k

NovelistAI

NovelistAI is a cutting-edge website that harnesses the power of artificial intelligence to generate completely original novels, stories, and interactive books. With NovelistAI, you can create your own personalized reading experience by selecting from an array of genres and styles. You can also write your own stories using our intuitive AI-powered tools. Whether you're a seasoned author or just starting out, NovelistAI has something to offer everyone.

site

: 25.4k

Touring

Touring is an immersive audio guiding system that adapts to your pace, needs, and interests. It harnesses the power of AI and geolocation to work everywhere in the world. With Touring, you can explore freely, avoid crowded tours, and get a private city tour right in your pocket. The app tailors to your interests, whether you fancy the arts, love history, or are a food enthusiast. You can ask about anything you see, sync with a group, pause and dive deeper, and even pick your voice. Touring leverages generative AI, geolocation, 3D spatial information, speech synthesis, and human-curated content to produce the world's most advanced real-time audio guiding system.

site

: 5.2k

Zolak

Zolak is an AI-powered visual commerce platform that provides immersive experiences for customers. It offers a range of products including a configurator, studio, and showroom, which allow businesses to create and distribute high-quality product images, customize products in real-time, and showcase products in virtual showrooms. Zolak's platform is designed to help businesses increase conversion rates, average order value, and repeat sales, while reducing the time spent creating content.

site

: 10.8k

Vossle

Vossle is an AI-powered cloud-based SaaS platform for businesses and agencies to create web-based augmented reality experiences. Reach millions of users instantly with App-less Augmented Reality (WebAR) Experience that works on every modern smartphone browser the moment you publish! No app installs are required! No need to write a line of code or to develop costly apps. Build immersive AR experiences without installing any apps.

site

: 1.4k

Moises App

Moises App is a music application powered by AI that provides musicians with a range of tools to enhance their practice and performance. With Moises App, users can separate vocals and instruments in any song, adjust the speed and pitch, and detect chords in real time. The app also includes a smart metronome and audio speed changer, making it an ideal tool for musicians of all levels. Moises App is available as a desktop application, iOS app, and web app, making it accessible to musicians on any device.

site

: 2.7m

Inworld

Inworld is a leading AI engine for games that provides developers with the tools to create groundbreaking game mechanics, dynamic NPCs, and worlds that evolve with each action. With Inworld, developers can unlock novel gameplay, create content at scale, improve player immersion, and future-proof their AI infrastructure. Inworld's platform includes the Inworld Engine, Inworld Studio, and Inworld Core, which provide developers with the tools they need to create next-generation games.

site

: 168.4k

Kaba

Kaba is an artificial intelligence that produces software and simulations that collaborate with you. It is the first AI native operating system, designed to be a hybrid system that can boot from disk or run on top of existing systems. Kaba is rooted in privacy and security, offering advanced policy controls that give you extensive utility over your data. It is also extensible and polymorphic, providing an immersive computing environment that can learn about you and present data in ways that are tailored to your particular style of consuming information.

site

: 10.7k

Gatherly

Gatherly is an AI-powered virtual events platform that makes it easy to create engaging and immersive experiences. With Gatherly, you can design custom events, create interactive agendas, and facilitate networking opportunities, all on a platform that lets you walk around and meet new people, just like in real life. Gatherly's AI brings your ideas to reality. Whether you're planning a conference, a networking event, or a social gathering, Gatherly makes it easy to create an unforgettable experience. Have a 10x better experience with a tenth of the work.

site

: 3.8k

Meltface Typeface

Meltface Typeface is a book about the future of design in the age of AI agents, spatial computing, and ambient UX. It is written by Casey Fictum, a designer and philosopher who has been thinking about the future of technology for over 20 years. The book is divided into nine chapters, each of which explores a different aspect of the future of design. Chapter 1, "The Dawn of Ambient Intelligence," discusses the rise of AI agents and their potential to change the way we live and work. Chapter 2, "Artificial - This Thing Isn't Human," explores the challenges of designing AI agents that are both useful and ethical. Chapter 3, "Spatial - Around My Reality," discusses the potential of spatial computing to create new and immersive experiences. Chapter 4, "Ambient - There, But Not," explores the concept of ambient UX and how it can be used to create more seamless and intuitive experiences. Chapter 5, "Actioned - Do Things on Our Behalf," discusses the potential of AI agents to automate tasks and help us get things done. Chapter 6, "Philosophy for AI Agent Design," provides a philosophical framework for designing AI agents that are both ethical and effective. Chapter 7, "Frameworks for the Future of Design," provides a set of frameworks for thinking about the future of design. Chapter 8, "Guessing the Future of UX Design," speculates on what the future of UX design might look like. Chapter 9, "Finding Meaning & Purpose in the Future of Design," discusses the challenges and opportunities of designing for a future that is increasingly shaped by AI.

site

: 0

Chat-docs AI

Chat-docs AI is a revolutionary tool that empowers users to interact with their PDFs in a conversational manner. It leverages advanced AI algorithms to transform documents into intelligent entities capable of answering questions, providing information, and clarifying complex concepts in real time. With Chat-docs AI, users can engage in direct dialogue with their course materials, scientific papers, books, financial reports, legal documents, and product user manuals, unlocking a new dimension of understanding and immersion. The tool's intuitive interface and natural language processing capabilities make it accessible to everyone, regardless of technical expertise. Chat-docs AI is committed to data security and privacy, employing end-to-end encryption and strict policies to safeguard user information.

site

: 6.1k

Hashmeta AI

Hashmeta AI is a digital human AI video production service that allows users to create realistic AI avatars and use them in videos. With Hashmeta AI, users can save time and money on video production, and they can also reach a global audience with their videos. Hashmeta AI's digital humans can be used for a variety of purposes, including education, marketing, and customer service.

site

: 322

Ideacadabra

Ideacadabra is an AI-powered content creation tool that helps creators generate ideas, titles, descriptions, thumbnails, scripts, songs, hashtags, and more for their YouTube, YouTube Shorts, Instagram, TikTok, and X (Twitter) content. The AI is trained on a massive dataset of content and trends, and it uses this knowledge to generate personalized ideas for each creator. Ideacadabra also provides feedback on existing content, helping creators to improve their content's performance. The tool is easy to use and affordable, making it a great option for creators of all levels.

site

: 1.4k

Blaze

Blaze is an AI-powered writing assistant that helps teams of one create high-quality marketing content, including blog posts, social media posts, ad copy, and more. With Blaze, you can create content that is on-brand, engaging, and optimized for search engines. Blaze also offers a variety of features to help you collaborate with your team and manage your content calendar.

site

: 321.4k

20 - Open Source AI Tools

LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing

LLM-PowerHouse is a comprehensive and curated guide designed to empower developers, researchers, and enthusiasts to harness the true capabilities of Large Language Models (LLMs) and build intelligent applications that push the boundaries of natural language understanding. This GitHub repository provides in-depth articles, codebase mastery, LLM PlayLab, and resources for cost analysis and network visualization. It covers various aspects of LLMs, including NLP, models, training, evaluation metrics, open LLMs, and more. The repository also includes a collection of code examples and tutorials to help users build and deploy LLM-based applications.

github

: 508

llm-course

The LLM course is divided into three parts: 1. 🧩 **LLM Fundamentals** covers essential knowledge about mathematics, Python, and neural networks. 2. 🧑‍🔬 **The LLM Scientist** focuses on building the best possible LLMs using the latest techniques. 3. 👷 **The LLM Engineer** focuses on creating LLM-based applications and deploying them. For an interactive version of this course, I created two **LLM assistants** that will answer questions and test your knowledge in a personalized way: * 🤗 **HuggingChat Assistant**: Free version using Mixtral-8x7B. * 🤖 **ChatGPT Assistant**: Requires a premium account. ## 📝 Notebooks A list of notebooks and articles related to large language models. ### Tools | Notebook | Description | Notebook | |----------|-------------|----------| | 🧐 LLM AutoEval | Automatically evaluate your LLMs using RunPod | ![Open In Colab](img/colab.svg) | | 🥱 LazyMergekit | Easily merge models using MergeKit in one click. | ![Open In Colab](img/colab.svg) | | 🦎 LazyAxolotl | Fine-tune models in the cloud using Axolotl in one click. | ![Open In Colab](img/colab.svg) | | ⚡ AutoQuant | Quantize LLMs in GGUF, GPTQ, EXL2, AWQ, and HQQ formats in one click. | ![Open In Colab](img/colab.svg) | | 🌳 Model Family Tree | Visualize the family tree of merged models. | ![Open In Colab](img/colab.svg) | | 🚀 ZeroSpace | Automatically create a Gradio chat interface using a free ZeroGPU. | ![Open In Colab](img/colab.svg) |

github

: 32.7k

AGI-Papers

This repository contains a collection of papers and resources related to Large Language Models (LLMs), including their applications in various domains such as text generation, translation, question answering, and dialogue systems. The repository also includes discussions on the ethical and societal implications of LLMs. **Description** This repository is a collection of papers and resources related to Large Language Models (LLMs). LLMs are a type of artificial intelligence (AI) that can understand and generate human-like text. They have a wide range of applications, including text generation, translation, question answering, and dialogue systems. **For Jobs** - **Content Writer** - **Copywriter** - **Editor** - **Journalist** - **Marketer** **AI Keywords** - **Large Language Models** - **Natural Language Processing** - **Machine Learning** - **Artificial Intelligence** - **Deep Learning** **For Tasks** - **Generate text** - **Translate text** - **Answer questions** - **Engage in dialogue** - **Summarize text**

github

: 243

Awesome-AITools

This repo collects AI-related utilities. ## All Categories * All Categories * ChatGPT and other closed-source LLMs * AI Search engine * Open Source LLMs * GPT/LLMs Applications * LLM training platform * Applications that integrate multiple LLMs * AI Agent * Writing * Programming Development * Translation * AI Conversation or AI Voice Conversation * Image Creation * Speech Recognition * Text To Speech * Voice Processing * AI generated music or sound effects * Speech translation * Video Creation * Video Content Summary * OCR(Optical Character Recognition)

github

: 3.5k

lobe-chat

Lobe Chat is an open-source, modern-design ChatGPT/LLMs UI/Framework. Supports speech-synthesis, multi-modal, and extensible ([function call][docs-functionc-call]) plugin system. One-click **FREE** deployment of your private OpenAI ChatGPT/Claude/Gemini/Groq/Ollama chat application.

github

: 32.6k

blog

这是一个程序员关于 ChatGPT 学习过程的记录，其中包括了 ChatGPT 的使用技巧、相关工具和资源的整理，以及一些个人见解和思考。 **使用技巧** * **充值 OpenAI API**：可以通过 https://beta.openai.com/account/api-keys 进行充值，支持信用卡和 PayPal。 * **使用专梯**：推荐使用稳定的专梯，可以有效提高 ChatGPT 的访问速度和稳定性。 * **使用魔法**：可以通过 https://my.x-air.app:666/#/register?aff=32853 访问 ChatGPT，无需魔法即可访问。 * **下载各种 apk**：可以通过 https://apkcombo.com 下载各种安卓应用的 apk 文件。 * **ChatGPT 官网**：ChatGPT 的官方网站是 https://ai.com。 * **Midjourney**：Midjourney 是一个生成式 AI 图像平台，可以通过 https://midjourney.com 访问。 * **文本转视频**：可以通过 https://www.d-id.com 将文本转换为视频。 * **国内大模型**：国内也有很多大模型，如阿里巴巴的通义千问、百度文心一言、讯飞星火、阿里巴巴通义听悟等。 * **查看 OpenAI 状态**：可以通过 https://status.openai.com/ 查看 OpenAI 的服务状态。 * **Canva 画图**：Canva 是一个在线平面设计平台，可以通过 https://www.canva.cn 进行画图。 **相关工具和资源** * **文字转语音**：可以通过 https://modelscope.cn/models?page=1&tasks=text-to-speech&type=audio 找到文字转语音的模型。 * **可好好玩玩的项目**： * https://github.com/sunner/ChatALL * https://github.com/labring/FastGPT * https://github.com/songquanpeng/one-api * **个人博客**： * https://baoyu.io/ * https://gorden-sun.notion.site/527689cd2b294e60912f040095e803c5?v=4f6cc12006c94f47aee4dc909511aeb5 * **srt 2 lrc 歌词**：可以通过 https://gotranscript.com/subtitle-converter 将 srt 格式的字幕转换为 lrc 格式的歌词。 * **5 种速率限制**：OpenAI API 有 5 种速率限制：RPM（每分钟请求数）、RPD（每天请求数）、TPM（每分钟 tokens 数量）、TPD（每天 tokens 数量）、IPM（每分钟图像数量）。 * **扣子平台**：coze.cn 是一个扣子平台，可以提供各种扣子。 * **通过云函数免费使用 GPT-3.5**：可以通过 https://juejin.cn/post/7353849549540589587 免费使用 GPT-3.5。 * **不蒜子统计网页基数**：可以通过 https://busuanzi.ibruce.info/ 统计网页的基数。 * **视频总结和翻译网页**：可以通过 https://glarity.app/zh-CN 总结和翻译视频。 * **视频翻译和配音工具**：可以通过 https://github.com/jianchang512/pyvideotrans 翻译和配音视频。 * **文字生成音频**：可以通过 https://www.cnblogs.com/jijunjian/p/18118366 将文字生成音频。 * **memo ai**：memo.ac 是一个多模态 AI 平台，可以将视频链接、播客链接、本地音视频转换为文字，支持多语言转录后翻译，还可以将文字转换为新的音频。 * **视频总结工具**：可以通过 https://summarize.ing/ 总结视频。 * **可每天免费玩玩**：可以通过 https://www.perplexity.ai/ 每天免费玩玩。 * **Suno.ai**：Suno.ai 是一个 AI 语言模型，可以通过 https://bibigpt.co/ 访问。 * **CapCut**：CapCut 是一个视频编辑软件，可以通过 https://www.capcut.cn/ 下载。 * **Valla.ai**：Valla.ai 是一个多模态 AI 模型，可以通过 https://www.valla.ai/ 访问。 * **Viggle.ai**：Viggle.ai 是一个 AI 视频生成平台，可以通过 https://viggle.ai 访问。 * **使用免费的 GPU 部署文生图大模型**：可以通过 https://www.cnblogs.com/xuxiaona/p/18088404 部署文生图大模型。 * **语音转文字**：可以通过 https://speech.microsoft.com/portal 将语音转换为文字。 * **投资界的 ai**：可以通过 https://reportify.cc/ 了解投资界的 ai。 * **抓取小视频 app 的各种信息**：可以通过 https://github.com/NanmiCoder/MediaCrawler 抓取小视频 app 的各种信息。 * **马斯克 Grok1 开源**：马斯克的 Grok1 模型已经开源，可以通过 https://github.com/xai-org/grok-1 访问。 * **ChatALL**：ChatALL 是一个跨端支持的聊天机器人，可以通过 https://github.com/sunner/ChatALL 访问。 * **零一万物**：零一万物是一个 AI 平台，可以通过 https://www.01.ai/cn 访问。 * **智普**：智普是一个 AI 语言模型，可以通过 https://chatglm.cn/ 访问。 * **memo ai 下载**：可以通过 https://memo.ac/ 下载 memo ai。 * **ffmpeg 学习**：可以通过 https://www.ruanyifeng.com/blog/2020/01/ffmpeg.html 学习 ffmpeg。 * **自动生成文章小工具**：可以通过 https://www.cognition-labs.com/blog 生成文章。 * **简易商城**：可以通过 https://www.cnblogs.com/whuanle/p/18086537 搭建简易商城。 * **物联网**：可以通过 https://www.cnblogs.com/xuxiaona/p/18088404 学习物联网。 * **自定义表单、自定义列表、自定义上传和下载、自定义流程、自定义报表**：可以通过 https://www.cnblogs.com/whuanle/p/18086537 实现自定义表单、自定义列表、自定义上传和下载、自定义流程、自定义报表。 **个人见解和思考** * ChatGPT 是一个强大的工具，可以用来提高工作效率和创造力。 * ChatGPT 的使用门槛较低，即使是非技术人员也可以轻松上手。 * ChatGPT 的发展速度非常快，未来可能会对各个行业产生深远的影响。 * 我们应该理性看待 ChatGPT，既要看到它的优点，也要意识到它的局限性。 * 我们应该积极探索 ChatGPT 的应用场景，为社会创造价值。

github

: 68

LLMUnity

LLM for Unity enables seamless integration of Large Language Models (LLMs) within the Unity engine, allowing users to create intelligent characters for immersive player interactions. The tool supports major LLM models, runs locally without internet access, offers fast inference on CPU and GPU, and is easy to set up with a single line of code. It is free for both personal and commercial use, tested on Unity 2021 LTS, 2022 LTS, and 2023. Users can build multiple AI characters efficiently, use remote servers for processing, and customize model settings for text generation.

github

: 386

Awesome-AISourceHub

Awesome-AISourceHub is a repository that collects high-quality information sources in the field of AI technology. It serves as a synchronized source of information to avoid information gaps and information silos. The repository aims to provide valuable resources for individuals such as AI book authors, enterprise decision-makers, and tool developers who frequently use Twitter to share insights and updates related to AI advancements. The platform emphasizes the importance of accessing information closer to the source for better quality content. Users can contribute their own high-quality information sources to the repository by following specific steps outlined in the contribution guidelines. The repository covers various platforms such as Twitter, public accounts, knowledge planets, podcasts, blogs, websites, YouTube channels, and more, offering a comprehensive collection of AI-related resources for individuals interested in staying updated with the latest trends and developments in the AI field.

github

: 679

Mastering-GitHub-Copilot-for-Paired-Programming

Mastering GitHub Copilot for AI Paired Programming is a comprehensive course designed to equip you with the skills and knowledge necessary to harness the power of GitHub Copilot, an AI-driven coding assistant. Through a series of engaging lessons, you will learn how to seamlessly integrate GitHub Copilot into your workflow, leveraging its autocompletion, customizable features, and advanced programming techniques. This course is tailored to provide you with a deep understanding of AI-driven algorithms and best practices, enabling you to enhance code quality and accelerate your coding skills. By embracing the transformative power of AI paired programming, you will gain the tools and confidence needed to succeed in today's dynamic software development landscape.

github

: 4.3k

lionagi

LionAGI is a powerful intelligent workflow automation framework that introduces advanced ML models into any existing workflows and data infrastructure. It can interact with almost any model, run interactions in parallel for most models, produce structured pydantic outputs with flexible usage, automate workflow via graph based agents, use advanced prompting techniques, and more. LionAGI aims to provide a centralized agent-managed framework for "ML-powered tools coordination" and to dramatically lower the barrier of entries for creating use-case/domain specific tools. It is designed to be asynchronous only and requires Python 3.10 or higher.

github

: 259

storm

STORM is a LLM system that writes Wikipedia-like articles from scratch based on Internet search. While the system cannot produce publication-ready articles that often require a significant number of edits, experienced Wikipedia editors have found it helpful in their pre-writing stage. **Try out our [live research preview](https://storm.genie.stanford.edu/) to see how STORM can help your knowledge exploration journey and please provide feedback to help us improve the system 🙏!**

github

: 4.4k

redbox-copilot

Redbox Copilot is a retrieval augmented generation (RAG) app that uses GenAI to chat with and summarise civil service documents. It increases organisational memory by indexing documents and can summarise reports read months ago, supplement them with current work, and produce a first draft that lets civil servants focus on what they do best. The project uses a microservice architecture with each microservice running in its own container defined by a Dockerfile. Dependencies are managed using Python Poetry. Contributions are welcome, and the project is licensed under the MIT License.

github

: 65

gpt-researcher

GPT Researcher is an autonomous agent designed for comprehensive online research on a variety of tasks. It can produce detailed, factual, and unbiased research reports with customization options. The tool addresses issues of speed, determinism, and reliability by leveraging parallelized agent work. The main idea involves running 'planner' and 'execution' agents to generate research questions, seek related information, and create research reports. GPT Researcher optimizes costs and completes tasks in around 3 minutes. Features include generating long research reports, aggregating web sources, an easy-to-use web interface, scraping web sources, and exporting reports to various formats.

github

: 12.5k

TensorRT-Model-Optimizer

The NVIDIA TensorRT Model Optimizer is a library designed to quantize and compress deep learning models for optimized inference on GPUs. It offers state-of-the-art model optimization techniques including quantization and sparsity to reduce inference costs for generative AI models. Users can easily stack different optimization techniques to produce quantized checkpoints from torch or ONNX models. The quantized checkpoints are ready for deployment in inference frameworks like TensorRT-LLM or TensorRT, with planned integrations for NVIDIA NeMo and Megatron-LM. The tool also supports 8-bit quantization with Stable Diffusion for enterprise users on NVIDIA NIM. Model Optimizer is available for free on NVIDIA PyPI, and this repository serves as a platform for sharing examples, GPU-optimized recipes, and collecting community feedback.

github

: 220

falkon

Falkon is a Python implementation of the Falkon algorithm for large-scale, approximate kernel ridge regression. The code is optimized for scalability to large datasets with tens of millions of points and beyond. Full kernel matrices are never computed explicitly so that you will not run out of memory on larger problems. Preconditioned conjugate gradient optimization ensures that only few iterations are necessary to obtain good results. The basic algorithm is a Nyström approximation to kernel ridge regression, which needs only three hyperparameters: 1. The number of centers M - this controls the quality of the approximation: a higher number of centers will produce more accurate results at the expense of more computation time, and higher memory requirements. 2. The penalty term, which controls the amount of regularization. 3. The kernel function. A good default is always the Gaussian (RBF) kernel (`falkon.kernels.GaussianKernel`).

github

: 172

KULLM

KULLM (구름) is a Korean Large Language Model developed by Korea University NLP & AI Lab and HIAI Research Institute. It is based on the upstage/SOLAR-10.7B-v1.0 model and has been fine-tuned for instruction. The model has been trained on 8×A100 GPUs and is capable of generating responses in Korean language. KULLM exhibits hallucination and repetition phenomena due to its decoding strategy. Users should be cautious as the model may produce inaccurate or harmful results. Performance may vary in benchmarks without a fixed system prompt.

github

: 527

awesome-generative-ai

Awesome Generative AI is a curated list of modern Generative Artificial Intelligence projects and services. Generative AI technology creates original content like images, sounds, and texts using machine learning algorithms trained on large data sets. It can produce unique and realistic outputs such as photorealistic images, digital art, music, and writing. The repo covers a wide range of applications in art, entertainment, marketing, academia, and computer science.

github

: 5.1k

galah

Galah is an LLM-powered web honeypot designed to mimic various applications and dynamically respond to arbitrary HTTP requests. It supports multiple LLM providers, including OpenAI. Unlike traditional web honeypots, Galah dynamically crafts responses for any HTTP request, caching them to reduce repetitive generation and API costs. The honeypot's configuration is crucial, directing the LLM to produce responses in a specified JSON format. Note that Galah is a weekend project exploring LLM capabilities and not intended for production use, as it may be identifiable through network fingerprinting and non-standard responses.

github

: 285

ragdoll-studio

Ragdoll Studio is a platform offering web apps and libraries for interacting with Ragdoll, enabling users to go beyond fine-tuning and create flawless creative deliverables, rich multimedia, and engaging experiences. It provides various modes such as Story Mode for creating and chatting with characters, Vector Mode for producing vector art, Raster Mode for producing raster art, Video Mode for producing videos, Audio Mode for producing audio, and 3D Mode for producing 3D objects. Users can export their content in various formats and share their creations on the community site. The platform consists of a Ragdoll API and a front-end React application for seamless usage.

github

: 156

agent

Stately Agent is a library for building stateful, interactive agents using OpenAI's GPT-3 API. With Stately Agent, you can create agents that can remember past conversations, track state, and generate text that is both informative and engaging.

github

: 66

20 - OpenAI Gpts

Immersive Experience Designer

This GPT Helps to Brainstorm Immersive Experience Ideas Using the TISECT Method

gpt

: 200+

The Godfather GPT

Immersive character role-play from The Godfather series

gpt

: 30+

Able-Nature's Echo.

Guides users through beautiful landscapes with spatial audio for immersion.

gpt

: 10+

Science History Content Maker

My goal is to produce content that highlights significant historical events, technological advancements, and the anniversaries of notable figures in science and technology.

gpt

: 30+

Actor 'Scene' Writer

I'll help you craft scenes to produce for your demo reel or for scene study in acting class!

gpt

: 20+

Psychiatry Education Assistant

An academic assistant for psychiatrists, creating educational content and practice questions. (Not for use in clinical decision making, verify all information, as model may produce errors)

gpt

: 100+

Plang help & code generator

Help you understand what plang is and to generate plang code. ChatGPT still is not familiar with the language so it might produce wrong code. It should be simple to fix, for help come to our Discussion - https://github.com/orgs/PLangHQ/discussions or Discord -https://discord.gg/A8kYUymsDD

gpt

: 10+

Voice/Style/Tone AI Prompt Snippet Generator

Analyzes your writing and produces a prompt snippet you can use in any other prompt to guide AI in replicating your voice, style, and tone. Just provide the text in the prompt box or in a document (don't use a link or image). You don't need to write any additional prompt language with your text.

gpt

: 10K+