Best AI tools for< Join Video Conferences >
20 - AI tool Sites
Groupthink
Groupthink is an AI-powered meeting assistant that helps teams have more productive and efficient meetings. It offers features such as real-time meeting notes, task detection, meeting recaps, and the ability to introspect with LLM chat as the meeting happens. Groupthink also integrates with popular video conferencing platforms such as Zoom, Microsoft Teams, and Google Meet.
VideoAI
VideoAI is an AI-powered platform that revolutionizes video creation by leveraging cutting-edge AI technologies. It offers features such as AI video generation, video style transfer, high-quality outputs, and a user-friendly interface. Users can create a wide range of videos, apply custom styles, and ensure high-quality outputs with the help of advanced AI algorithms. VideoAI provides a seamless experience for both beginners and professionals, empowering users to unleash their video potential with ease and creativity.
Translate.Video
Translate.Video is an AI-powered multi-speaker video translation tool that offers features like voice cloning, text-to-speech, and speaker diarization. It allows users to translate videos to over 75 languages with just one click, making content creation and localization efficient and accessible. The tool also provides plugins for popular design software like Photoshop, Illustrator, and Figma, enabling users to accelerate creative translation. Translate.Video aims to simplify the process of captioning, subtitling, and dubbing, catering to influencers, enterprises, and content creators looking to reach a global audience.
Kapwing
Kapwing is a modern video creation platform that helps teams make great content faster. It offers a suite of AI-powered tools and templates to automate tedious tasks, streamline the video creation process, and ensure brand consistency. With Kapwing, teams can create, edit, and share videos in real-time, making it easy to collaborate and produce high-quality content.
VividHubs.ai
VividHubs.ai is an AI application that allows users to create romantic AI kissing videos using just two photos. The advanced AI technology of VividHubs.ai enables users to generate heartwarming videos of two people kissing, perfect for all relationships and occasions. With customizable scenarios and realistic AI-powered animations, VividHubs.ai provides a unique and special way to bring loved ones together through virtual AI kissings.
OpenGPT
OpenGPT is a community for Open AI enthusiasts. It provides access to various AI tools such as GPT Store, OpenGPTs, Open Chat, Open Draw, and Open Video. Users can submit their GPTs and earn credits for free access to advanced AI models like Google Gemini Pro, ChatGPT4, DALL.E.3, and Imagen2.
Samurai AI
Samurai AI is an AI-powered read-it-later app that provides users with concise and insightful summaries of articles, YouTube videos, and TED talks. The app is designed to save users time and help them get the most out of their reading and viewing experiences. Samurai AI is still in development, but it is expected to be released soon on iOS and Android devices.
Deciphr
Deciphr is an AI tool designed to automate podcast content workflow solutions. It can turn any audio, video, or text into unlimited B2B content in less than 8 minutes. Trusted by marketers across industries, Deciphr generates SEO articles, meeting minutes, webinar summaries, newsletters, and more with the help of AI technology. It offers a comprehensive solution for content creation and management, making the process efficient and seamless for users.
Thumbly
Thumbly is an AI-powered tool that helps content creators generate clickbait YouTube thumbnails, compelling titles, and analyze thumbnail effectiveness. With Thumbly, users can create custom thumbnails in seconds, boost their clicks and viewer engagement, and enhance their productivity. The tool is easy to use, requires no specific skills or expertise, and offers affordable solutions for content creators of all levels.
Klipify
Klipify is an AI-powered video creation tool that empowers content creators to effortlessly transform long videos into viral shorts. It offers professional-friendly tools for extracting clips from videos, an integrated calendar for auto-content posting, and an AI video translator for multilingual accessibility. Klipify aims to streamline the content creation process and enhance digital presence through innovative features and automation.
OSSA.AI
OSSA.AI is an AI tool designed to make influence accessible to everyone by simplifying short-form content creation. It is used by top content creators like Liza Ivanovna to save time, increase social media engagement, and create unique videos that resonate with their audiences. The platform, founded by social media powerhouse @Colewherld, offers script-to-video creation, content diversity, and ready-to-upload videos optimized for engagement.
Crayo
Crayo is an AI-powered tool that helps users create short videos quickly and easily. With Crayo, users can generate captions, effects, background music, and even voiceovers for their videos, all with just a few clicks. Crayo is perfect for users who want to create engaging and shareable videos for social media, marketing, or any other purpose.
PixVerse
PixVerse is an AI-powered video creation tool that allows users to easily create stunning videos using advanced AI technology. With features like Text to Video, Image to Video, and Character to Video, PixVerse enables users to bring their creative ideas to life in a matter of minutes. Whether you want to create magical creatures, daily life scenes, fantastical tales, animated animals, space explorations, cheeky memes, majestic scenery, or showcase disaster strikes, PixVerse has got you covered. The tool is designed to unleash creative potential for everyone, offering a seamless and intuitive user experience. Join the PixVerse community and start creating captivating videos today!
SeaArt AI
SeaArt AI is a free AI art generator that allows users to create unique and realistic images from text prompts. The platform offers a wide range of AI-powered tools, including AI face swap, AI filters, AI portrait, AI makeup, AI image upscaler, sketch to img, remove background, txt2img, and more. With SeaArt AI, users can easily create stunning images for personal or commercial use.
Replika
Replika is an AI companion application that provides emotional support and companionship to users. It uses sophisticated neural network machine learning algorithms to engage in conversations and mimic users' texting styles. Replika aims to create a safe and nurturing environment for users to express themselves and build meaningful relationships with their AI companions. The application has garnered a large user base and positive feedback for its ability to provide emotional support and companionship, especially during challenging times like the pandemic.
Latercut
Latercut is an AI-powered video editing tool designed for social media content creators. It allows users to create engaging and professional-looking videos in a matter of minutes. With features like faceless video editing, voiceover, gaming templates, storytelling tools, and quiz engagement options, Latercut simplifies the video creation process. Integrated with popular social media platforms like Instagram, TikTok, and YouTube, Latercut is the go-to tool for short creators looking to enhance their content. Join the community of creators and start producing high-quality videos effortlessly.
Otter.ai
Otter.ai is an AI meeting assistant application that provides users with the ability to record audio, write notes, automatically capture slides, and generate meeting summaries. Users can collaborate with teammates in real-time, add comments, highlight key points, and assign action items. Otter.ai helps companies and organizations to write notes and summarize meetings 30 times faster. The application also offers features like automated slide capture and automated meeting notes, which can be connected to Google or Microsoft calendar to join and record meetings on platforms like Zoom, Microsoft Teams, and Google Meet. Otter.ai aims to streamline meeting processes and enhance productivity by leveraging AI technology.
ToneShift
ToneShift is an AI-powered platform that allows users to clone voices, separate music, and join a community of voices. With ToneShift, users can transform recordings into versatile voices for various purposes, separate vocals and instrumentals from songs to create new remixes and mashups, and join a community to discover new tones, contribute their creations, and collaborate with others.
AI Tools Up
AI Tools Up is a website that provides a directory of AI tools and software. The site includes a variety of tools for different purposes, such as copywriting, productivity, design, developer tools, research, marketing, video editing, and SEO. AI Tools Up also includes a blog with articles on AI trends and best practices.
Swinghub
Swinghub is the ultimate non-monogamous social network app. It is packed with powerful features to help you get started, including the ability to see who's near you, who's looked at you, and what events are happening around you. Swinghub also has a number of safety features in place, including AI technology to prevent catfish profiles and AI moderation to ensure that nothing illegal or outside the community guidelines is being posted.
20 - Open Source AI Tools
gpupixel
GPUPixel is a real-time, high-performance image and video filter library written in C++11 and based on OpenGL/ES. It incorporates a built-in beauty face filter that achieves commercial-grade beauty effects. The library is extremely easy to compile and integrate with a small size, supporting platforms including iOS, Android, Mac, Windows, and Linux. GPUPixel provides various filters like skin smoothing, whitening, face slimming, big eyes, lipstick, and blush. It supports input formats like YUV420P, RGBA, JPEG, PNG, and output formats like RGBA and YUV420P. The library's performance on devices like iPhone and Android is optimized, with low CPU usage and fast processing times. GPUPixel's lib size is compact, making it suitable for mobile and desktop applications.
verl
veRL is a flexible and efficient reinforcement learning training framework designed for large language models (LLMs). It allows easy extension of diverse RL algorithms, seamless integration with existing LLM infrastructures, and flexible device mapping. The framework achieves state-of-the-art throughput and efficient actor model resharding with 3D-HybridEngine. It supports popular HuggingFace models and is suitable for users working with PyTorch FSDP, Megatron-LM, and vLLM backends.
ColossalAI
Colossal-AI is a deep learning system for large-scale parallel training. It provides a unified interface to scale sequential code of model training to distributed environments. Colossal-AI supports parallel training methods such as data, pipeline, tensor, and sequence parallelism and is integrated with heterogeneous training and zero redundancy optimizer.
LLaMA-Factory
LLaMA Factory is a unified framework for fine-tuning 100+ large language models (LLMs) with various methods, including pre-training, supervised fine-tuning, reward modeling, PPO, DPO and ORPO. It features integrated algorithms like GaLore, BAdam, DoRA, LongLoRA, LLaMA Pro, LoRA+, LoftQ and Agent tuning, as well as practical tricks like FlashAttention-2, Unsloth, RoPE scaling, NEFTune and rsLoRA. LLaMA Factory provides experiment monitors like LlamaBoard, TensorBoard, Wandb, MLflow, etc., and supports faster inference with OpenAI-style API, Gradio UI and CLI with vLLM worker. Compared to ChatGLM's P-Tuning, LLaMA Factory's LoRA tuning offers up to 3.7 times faster training speed with a better Rouge score on the advertising text generation task. By leveraging 4-bit quantization technique, LLaMA Factory's QLoRA further improves the efficiency regarding the GPU memory.
awesome-llms-fine-tuning
This repository is a curated collection of resources for fine-tuning Large Language Models (LLMs) like GPT, BERT, RoBERTa, and their variants. It includes tutorials, papers, tools, frameworks, and best practices to aid researchers, data scientists, and machine learning practitioners in adapting pre-trained models to specific tasks and domains. The resources cover a wide range of topics related to fine-tuning LLMs, providing valuable insights and guidelines to streamline the process and enhance model performance.
nlp-phd-global-equality
This repository aims to promote global equality for individuals pursuing a PhD in NLP by providing resources and information on various aspects of the academic journey. It covers topics such as applying for a PhD, getting research opportunities, preparing for the job market, and succeeding in academia. The repository is actively updated and includes contributions from experts in the field.
agents
The LiveKit Agent Framework is designed for building real-time, programmable participants that run on servers. Easily tap into LiveKit WebRTC sessions and process or generate audio, video, and data streams. The framework includes plugins for common workflows, such as voice activity detection and speech-to-text. Agents integrates seamlessly with LiveKit server, offloading job queuing and scheduling responsibilities to it. This eliminates the need for additional queuing infrastructure. Agent code developed on your local machine can scale to support thousands of concurrent sessions when deployed to a server in production.
rlhf_trojan_competition
This competition is organized by Javier Rando and Florian Tramèr from the ETH AI Center and SPY Lab at ETH Zurich. The goal of the competition is to create a method that can detect universal backdoors in aligned language models. A universal backdoor is a secret suffix that, when appended to any prompt, enables the model to answer harmful instructions. The competition provides a set of poisoned generation models, a reward model that measures how safe a completion is, and a dataset with prompts to run experiments. Participants are encouraged to use novel methods for red-teaming, automated approaches with low human oversight, and interpretability tools to find the trojans. The best submissions will be offered the chance to present their work at an event during the SaTML 2024 conference and may be invited to co-author a publication summarizing the competition results.
InternVL
InternVL scales up the ViT to _**6B parameters**_ and aligns it with LLM. It is a vision-language foundation model that can perform various tasks, including: **Visual Perception** - Linear-Probe Image Classification - Semantic Segmentation - Zero-Shot Image Classification - Multilingual Zero-Shot Image Classification - Zero-Shot Video Classification **Cross-Modal Retrieval** - English Zero-Shot Image-Text Retrieval - Chinese Zero-Shot Image-Text Retrieval - Multilingual Zero-Shot Image-Text Retrieval on XTD **Multimodal Dialogue** - Zero-Shot Image Captioning - Multimodal Benchmarks with Frozen LLM - Multimodal Benchmarks with Trainable LLM - Tiny LVLM InternVL has been shown to achieve state-of-the-art results on a variety of benchmarks. For example, on the MMMU image classification benchmark, InternVL achieves a top-1 accuracy of 51.6%, which is higher than GPT-4V and Gemini Pro. On the DocVQA question answering benchmark, InternVL achieves a score of 82.2%, which is also higher than GPT-4V and Gemini Pro. InternVL is open-sourced and available on Hugging Face. It can be used for a variety of applications, including image classification, object detection, semantic segmentation, image captioning, and question answering.
HAMi
HAMi is a Heterogeneous AI Computing Virtualization Middleware designed to manage Heterogeneous AI Computing Devices in a Kubernetes cluster. It allows for device sharing, device memory control, device type specification, and device UUID specification. The tool is easy to use and does not require modifying task YAML files. It includes features like hard limits on device memory, partial device allocation, streaming multiprocessor limits, and core usage specification. HAMi consists of components like a mutating webhook, scheduler extender, device plugins, and in-container virtualization techniques. It is suitable for scenarios requiring device sharing, specific device memory allocation, GPU balancing, low utilization optimization, and scenarios needing multiple small GPUs. The tool requires prerequisites like NVIDIA drivers, CUDA version, nvidia-docker, Kubernetes version, glibc version, and helm. Users can install, upgrade, and uninstall HAMi, submit tasks, and monitor cluster information. The tool's roadmap includes supporting additional AI computing devices, video codec processing, and Multi-Instance GPUs (MIG).
hold
This repository contains the code for HOLD, a method that jointly reconstructs hands and objects from monocular videos without assuming a pre-scanned object template. It can reconstruct 3D geometries of novel objects and hands, enabling template-free bimanual hand-object reconstruction, textureless object interaction with hands, and multiple objects interaction with hands. The repository provides instructions to download in-the-wild videos from HOLD, preprocess and train on custom videos, a volumetric rendering framework, a generalized codebase for single and two hand interaction with objects, a viewer to interact with predictions, and code to evaluate and compare with HOLD in HO3D. The repository also includes documentation for setup, training, evaluation, visualization, preprocessing custom sequences, and using HOLD on ARCTIC.
hackingBuddyGPT
hackingBuddyGPT is a framework for testing LLM-based agents for security testing. It aims to create common ground truth by creating common security testbeds and benchmarks, evaluating multiple LLMs and techniques against those, and publishing prototypes and findings as open-source/open-access reports. The initial focus is on evaluating the efficiency of LLMs for Linux privilege escalation attacks, but the framework is being expanded to evaluate the use of LLMs for web penetration-testing and web API testing. hackingBuddyGPT is released as open-source to level the playing field for blue teams against APTs that have access to more sophisticated resources.
data-juicer
Data-Juicer is a one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs. It is a systematic & reusable library of 80+ core OPs, 20+ reusable config recipes, and 20+ feature-rich dedicated toolkits, designed to function independently of specific LLM datasets and processing pipelines. Data-Juicer allows detailed data analyses with an automated report generation feature for a deeper understanding of your dataset. Coupled with multi-dimension automatic evaluation capabilities, it supports a timely feedback loop at multiple stages in the LLM development process. Data-Juicer offers tens of pre-built data processing recipes for pre-training, fine-tuning, en, zh, and more scenarios. It provides a speedy data processing pipeline requiring less memory and CPU usage, optimized for maximum productivity. Data-Juicer is flexible & extensible, accommodating most types of data formats and allowing flexible combinations of OPs. It is designed for simplicity, with comprehensive documentation, easy start guides and demo configs, and intuitive configuration with simple adding/removing OPs from existing configs.
unitxt
Unitxt is a customizable library for textual data preparation and evaluation tailored to generative language models. It natively integrates with common libraries like HuggingFace and LM-eval-harness and deconstructs processing flows into modular components, enabling easy customization and sharing between practitioners. These components encompass model-specific formats, task prompts, and many other comprehensive dataset processing definitions. The Unitxt-Catalog centralizes these components, fostering collaboration and exploration in modern textual data workflows. Beyond being a tool, Unitxt is a community-driven platform, empowering users to build, share, and advance their pipelines collaboratively.
cogai
The W3C Cognitive AI Community Group focuses on advancing Cognitive AI through collaboration on defining use cases, open source implementations, and application areas. The group aims to demonstrate the potential of Cognitive AI in various domains such as customer services, healthcare, cybersecurity, online learning, autonomous vehicles, manufacturing, and web search. They work on formal specifications for chunk data and rules, plausible knowledge notation, and neural networks for human-like AI. The group positions Cognitive AI as a combination of symbolic and statistical approaches inspired by human thought processes. They address research challenges including mimicry, emotional intelligence, natural language processing, and common sense reasoning. The long-term goal is to develop cognitive agents that are knowledgeable, creative, collaborative, empathic, and multilingual, capable of continual learning and self-awareness.
VLMEvalKit
VLMEvalKit is an open-source evaluation toolkit of large vision-language models (LVLMs). It enables one-command evaluation of LVLMs on various benchmarks, without the heavy workload of data preparation under multiple repositories. In VLMEvalKit, we adopt generation-based evaluation for all LVLMs, and provide the evaluation results obtained with both exact matching and LLM-based answer extraction.
nnstreamer
NNStreamer is a set of Gstreamer plugins that allow Gstreamer developers to adopt neural network models easily and efficiently and neural network developers to manage neural network pipelines and their filters easily and efficiently.
fluid
Fluid is an open source Kubernetes-native Distributed Dataset Orchestrator and Accelerator for data-intensive applications, such as big data and AI applications. It implements dataset abstraction, scalable cache runtime, automated data operations, elasticity and scheduling, and is runtime platform agnostic. Key concepts include Dataset and Runtime. Prerequisites include Kubernetes version > 1.16, Golang 1.18+, and Helm 3. The tool offers features like accelerating remote file accessing, machine learning, accelerating PVC, preloading dataset, and on-the-fly dataset cache scaling. Contributions are welcomed, and the project is under the Apache 2.0 license with a vendor-neutral approach.
scalene
Scalene is a high-performance CPU, GPU, and memory profiler for Python that provides detailed information and runs faster than many other profilers. It incorporates AI-powered proposed optimizations, allowing users to generate optimization suggestions by clicking on specific lines or regions of code. Scalene separates time spent in Python from native code, highlights hotspots, and identifies memory usage per line. It supports GPU profiling on NVIDIA-based systems and detects memory leaks. Users can generate reduced profiles, profile specific functions using decorators, and suspend/resume profiling for background processes. Scalene is available as a pip or conda package and works on various platforms. It offers features like profiling at the line level, memory trends, copy volume reporting, and leak detection.
20 - OpenAI Gpts
NO DUMB QUESTIONS
Join as the Third Chair guest with Destin Sandlin and Matt Whitman in a new podcast episode of 🧮𝗡𝗗𝗤✝️ - Game
Riddle Brawl
Join Riddle Brawl! Solve image riddles, unlock the passphrases, and compete to become the ultimate Champion. Are you up for the challenge? Let's begin! 🕵️♂️
Ai Doc
Join millions of students, researchers and professionals to instantly answer questions and understand research with Al
GPT Builder V2.4 (by GB)
Craft and refine GPTs. Join our Reddit community: https://www.reddit.com/r/GPTreview/
Spicey Clap Back (supported by GB)
Clever assistant for sharp 'clap backs', keeps details private. Join our Reddit community: https://www.reddit.com/r/GPTreview/
Tales from AIsteros
Interpret AI and technology news trough blend of fantasy and modern tech mixed with wit, join a game to sit on AI-ron Throne, checkout Medium publication V.03 2023-11-26
Prototyping GPT
John De Prototyper, expert in diverse product prototyping methods and industry insights. Join our Reddit community: https://www.reddit.com/r/GPTreview/
Bizarre Insults (supported by GB)
Generates quirky, non-profane insults with a secretive, grumpy tone. Join our Reddit community: https://www.reddit.com/r/GPTreview/.
Soulful Escapes: Travel and Discover
A Journey with a Friend: Ava is more than a guide; she's a companion who adds depth to your travels with her knowledge, and humor. Join Ava for a Souful Escape. Another Zen Experience by Dave Lalande
Abraham Lincoln
I am Abraham Lincoln, interpreting today's world with historical insight. Born from primary sources and multimodal, join me in a unique conversational journey.
Nature guard
Moim zadaniem jest promowanie świadomości i angażowanie użytkowników w konkretne działania, które przyczyniają się do ochrony środowiska naturalnego.
GCP-BigQueryGPT
BigQueryGPT aids in mastering BigQuery SQL with concise, practical examples. Tailored for all skill levels, it simplifies complex queries, offering clear explanations and optimized solutions for efficient learning and query troubleshooting.
SQL Code Helper
Assists with SQL programming by providing code examples, debugging tips, and best practices.
Web3 GPT
A Web3 expert providing in-depth knowledge on blockchain, cryptocurrencies, and more.