Best AI tools for< Upload Video >
20 - AI tool Sites

Supertranslate
Supertranslate is an AI-powered tool that allows users to automatically add English subtitles to videos in any language. It is powered by OpenAI's Whisper, the world's most accurate speech-to-text engine. The tool offers a fast and efficient way to generate subtitles, with features such as intuitive subtitle editing capabilities. Users can upload videos, generate subtitles, and download .srt/.vtt files with ease. Supertranslate ensures high-quality subtitle generation using OpenAI-Whisper technology.

Video Upscaler
Video Upscaler is an online video enhancement platform that utilizes advanced AI algorithms to automatically enhance the quality of videos in just seconds. It offers a simple and effective solution for users to upscale their videos to 4K resolution without any loss of detail or quality. The platform is user-friendly, affordable, and constantly updating its models to provide the highest quality results across various categories.

AI Story Generator for YouTube Shorts
The AI Story Generator for YouTube Shorts is an innovative tool that allows users to create personalized characters and motion for their videos. With this tool, users can easily generate engaging stories for their YouTube Shorts content. The tool provides consistent and custom characters and motion options, making it easy for users to create high-quality videos. Users have the option to upload their own character or use the tool's default options. The tool simplifies the video creation process and helps users bring their creative ideas to life.

Ankara AI
Ankara AI is an automated video narration and commentary application that utilizes AI technology to generate narrations for videos in over 25 different languages. Users can upload a video, select a voice, and provide a narration prompt to create professional narrations effortlessly. The application ensures user privacy by not retaining uploaded videos but securely storing anonymized prompts and script results to enhance narration quality. Ankara AI also offers a variety of voices to choose from, catering to different preferences and needs.

HappySRT
HappySRT is an AI-powered online tool that specializes in generating subtitles and editing SRT files for videos. It simplifies the process of creating accurate subtitles for YouTube videos by automatically generating them from uploaded files or YouTube links. Users can benefit from its seamless integration with YouTube, efficient workflow, and impeccable accuracy. HappySRT offers a range of pricing plans to cater to different user needs, from individuals to businesses and industries.

Filmora
Wondershare Filmora is a powerful and easy-to-use video editing software that is perfect for both beginners and experienced users. With a wide range of features and tools, Filmora makes it easy to create professional-looking videos with just a few clicks. Whether you're a filmmaker, a marketer, or just someone who wants to share your stories with the world, Filmora has everything you need to get started.

Speax AI
Speax AI is a cutting-edge AI technology company that specializes in revolutionizing dubbing processes. The platform offers instant and accurate translations in over 29 languages, allowing users to upload videos and receive perfectly dubbed content in minutes. With advanced AI capabilities, Speax AI ensures seamless voice synchronization and cultural accuracy at affordable rates, making it easy for users to connect with global audiences effortlessly. Additionally, the platform provides smart translation and paraphrasing algorithms for elevated content precision, as well as voice cloning features for multilingual voice replication. Speax AI also offers background music and sound effects to enhance video content, all while maintaining the highest quality standards at competitive prices.

AutoReels
AutoReels is an AI-powered tool designed to help users create engaging faceless videos for social media platforms like YouTube, TikTok, and Instagram. The tool leverages advanced AI algorithms to craft unique videos that stand out, ensuring privacy and creativity by never showing the user's face. AutoReels simplifies the video creation process, allowing users to generate, schedule, and upload videos on autopilot. With features like automated video creation, subtitles customization, and scheduling posts for social media, AutoReels is a time-saving solution for social media teams, small business owners, content creators, and influencers.

Lipsyncer.ai
Lipsyncer.ai is an AI application that allows users to create AI lip-sync videos automatically. Users can upload videos, images, or audio files to synchronize lip movements with any audio. The application saves time by eliminating the need for manual video editing, making it ideal for businesses, advertising agencies, YouTubers, influencers, and marketing agencies. Lipsyncer.ai offers high-quality lip-syncing, multilingual text-to-speech presenters, and a pay-as-you-go pricing model. The application is integrated into popular design programs and e-commerce systems, providing digital efficiency to users' workflows.

BiteSyzed
BiteSyzed is an AI-powered video repurposing tool that transforms long videos into viral clips 10 times faster. The platform uses cutting-edge AI technology to automatically analyze and edit raw footage, extract captivating moments, and create cohesive video clips. Users can upload videos from YouTube, export clips in different aspect ratios, and share them with their audience effortlessly. Bitesyzed simplifies the video editing process by automating the creation of viral clips with AI-generated descriptions and hashtags, saving time and resources. The application is designed to help users create more engaging video content with minimal effort, catering to a wide range of users from content creators to marketers.

SendShort
SendShort is an AI-powered online tool designed to help users create viral short videos effortlessly. With features like AI-generated clips, faceless videos, auto B-roll, subtitles, voiceovers, and more, SendShort streamlines the process of turning long videos into engaging short clips suitable for various social media platforms. It offers a user-friendly interface that allows users to upload videos or paste YouTube links, and the AI takes care of the rest, from editing to scheduling. SendShort aims to empower creators, marketers, and businesses to enhance their video content creation and reach a wider audience with minimal effort and maximum impact.

Sound Effect Generator
The Sound Effect Generator is an AI-powered tool that allows users to create custom sound effects instantly. It uses cutting-edge AI Text to Sound Effect technology to transform ideas into high-quality sound effects. Perfect for creators, developers, and sound designers, the generator offers a free sound effect library with thousands of AI-generated sound effects. Users can fine-tune duration and audio quality, support multiple languages, and even upload videos to add AI-generated sound effects. The tool combines professional sound design with AI technology to provide a unique and creative audio experience.

OSSA.AI
OSSA.AI is an AI tool designed to make influence accessible to everyone by simplifying short-form content creation. It is used by top content creators like Liza Ivanovna to save time, increase social media engagement, and create unique videos that resonate with their audiences. The platform, founded by social media powerhouse @Colewherld, offers script-to-video creation, content diversity, and ready-to-upload videos optimized for engagement.

Taption
Taption is an AI-powered platform that offers automatic transcription, translation, and subtitle generation services for audio and video content in over 40 languages. It provides embedded bilingual subtitles, labeled transcripts, and translations. Users can upload videos, transcribe from YouTube, edit transcripts, analyze video content, translate subtitles, and export files in various formats. Taption's AI analysis feature helps in summarizing videos, generating topics, creating YouTube chapters, and more. The platform also includes a collaborative team feature and an advanced editing platform for precise video editing and synchronization.

Bluedot
Bluedot is a productivity application that offers features such as screen recordings, meetings, collections, and automation. Users can install a Chrome extension to enhance their experience. The application allows users to upload videos and perform various tasks efficiently. By signing up, users agree to the Terms of Service and Privacy Policy.

AI Video Narration
The AI Video Narration tool is an online application that allows users to easily add narration to their videos. Users can upload their video files, customize the narration style in multiple languages, and download the narrated videos. The tool supports popular video formats like MP4, WEBM, and MOV, with a maximum file size limit of 100MB. It offers a quick and efficient way to enhance videos with professional narration, making it ideal for content creators, marketers, educators, and anyone looking to improve their video content.

Video Face Swap
Video Face Swap is a free online AI tool that allows users to effortlessly swap faces in videos using cutting-edge artificial intelligence algorithms. Users can upload a video with a face and a photo with a target face to start the face swap process. The tool supports multiple face swaps, GIF face swaps, and batch face swaps, enabling users to create entertaining and creative content. With features like fast and accurate face swapping, enhanced creativity, 100% free service, support for various formats, and user-friendly interface, Video Face Swap provides a secure and private platform for users to experiment with face swapping in videos.

Slick
Slick is an AI-powered video editing tool that helps you create and edit viral short videos. With Slick, you can add trendy captions, cut silences and umms, snap b-rolls, add sound effects, use magic zooms, and more. Slick supports all aspect ratios and up to 4k resolution. You can also add custom background music and sound effects, and remove filler words in one click. Slick is available in over 30 languages, including English, French, Spanish, German, Hindi, and more. New caption styles are added every week, and all captions are 100% customizable. With Slick, you can trim and extend clips, and adjust clip duration. All of these features are available without lifting a finger, thanks to Slick's AI technology.

Targum
Targum is a super fast AI-based video translation service that allows users to translate any video from any language to any language in a matter of seconds. Users can paste a link to a video from Twitter, TikTok, Instagram, or Reddit, or they can upload a video file or drag and drop it onto the Targum website. Targum also allows users to record a video from a mobile device. Once a video has been uploaded, Targum will automatically translate it to the user's desired language. Targum is a valuable tool for anyone who needs to translate videos for personal or professional use.

X-Me
X-Me is an AI tool that allows users to generate personalized AI avatar videos effortlessly. Users can create AI avatars that mimic famous personalities like AI Trump, AI Musk, AI Johnson, Al Kard, and AI Gaga by inputting text in multiple languages. The tool supports 147 languages, offers zero customization fees, and requires zero training. With X-Me, users can upload a selfie video, enter text, and generate AI avatar videos that reflect their face, voice, and story. The platform is known for its efficient, fast, and user-friendly approach to creating realistic digital human videos without the need for complex model training processes.
20 - Open Source AI Tools

videodb-python
VideoDB Python SDK allows you to interact with the VideoDB serverless database. Manage videos as intelligent data, not files. It's scalable, cost-efficient & optimized for AI applications and LLM integration. The SDK provides functionalities for uploading videos, viewing videos, streaming specific sections of videos, searching inside a video, searching inside multiple videos in a collection, adding subtitles to a video, generating thumbnails, and more. It also offers features like indexing videos by spoken words, semantic indexing, and future indexing options for scenes, faces, and specific domains like sports. The SDK aims to simplify video management and enhance AI applications with video data.

PromptClip
PromptClip is a tool that allows developers to create video clips using LLM prompts. Users can upload videos from various sources, prompt the video in natural language, use different LLM models, instantly watch the generated clips, finetune the clips, and add music or image overlays. The tool provides a seamless way to extract specific moments from videos based on user queries, making video editing and content creation more efficient and intuitive.

MiKaPo
MiKaPo is a web-based tool that allows users to pose MMD models in real-time using video input. It utilizes technologies such as Mediapipe for 3D key points detection, Babylon.js for 3D scene rendering, babylon-mmd for MMD model viewing, and Vite+React for the web framework. Users can upload videos and images, select different environments, and choose models for posing. MiKaPo also supports camera input and Ollama (electron version). The tool is open to feature requests and pull requests, with ongoing development to add VMD export functionality.

vnve
VNVE is a Visual Novel Video Editor that allows users to create visual novel videos in their browser with AI-powered rapid creation. It offers a low-cost production solution for converting textual content into videos, creating interactive videos for gaming experiences, and making video teasers for novels and short video dramas. The tool is a pure front-end Typescript implementation powered by PixiJS + WebCodecs, and users can also create videos programmatically using the npm package. VNVE is tailored specifically for visual novels, focusing on text content and simplifying the video creation process for users.

LLM_Notebooks
LLM_Notebooks is a repository supporting The Machine Learning Engineer YouTube channel. It contains materials related to various topics such as Generative AI, MLOps, ML projects, Azure Projects, Google VertexAi, ML Tricks, and more. The repository includes notebooks and code in Python and C#, with a focus on Python. The videos on the channel cover a wide range of topics in English and Spanish, organized into playlists based on general themes. The repository links are provided in the video descriptions for easy access. The creator uploads videos regularly and encourages viewers to subscribe, like, and leave constructive comments. The repository serves as a valuable resource for learning and exploring machine learning concepts and tools.

AI-LLM-ML-CS-Quant-Readings
AI-LLM-ML-CS-Quant-Readings is a repository dedicated to taking notes on Artificial Intelligence, Large Language Models, Machine Learning, Computer Science, and Quantitative Finance. It contains a wide range of resources, including theory, applications, conferences, essentials, foundations, system design, computer systems, finance, and job interview questions. The repository covers topics such as AI systems, multi-agent systems, deep learning theory and applications, system design interviews, C++ design patterns, high-frequency finance, algorithmic trading, stochastic volatility modeling, and quantitative investing. It is a comprehensive collection of materials for individuals interested in these fields.

AI-LLM-ML-CS-Quant-Overview
AI-LLM-ML-CS-Quant-Overview is a repository providing overview notes on AI, Large Language Models (LLM), Machine Learning (ML), Computer Science (CS), and Quantitative Finance. It covers various topics such as LangGraph & Cursor AI, DeepSeek, MoE (Mixture of Experts), NVIDIA GTC, LLM Essentials, System Design, Computer Systems, Big Data and AI in Finance, Econometrics and Statistics Conference, C++ Design Patterns and Derivatives Pricing, High-Frequency Finance, Machine Learning for Algorithmic Trading, Stochastic Volatility Modeling, Quant Job Interview Questions, Distributed Systems, Language Models, Designing Machine Learning Systems, Designing Data-Intensive Applications (DDIA), Distributed Machine Learning, and The Elements of Quantitative Investing.

llmops-duke-aipi
LLMOps Duke AIPI is a course focused on operationalizing Large Language Models, teaching methodologies for developing applications using software development best practices with large language models. The course covers various topics such as generative AI concepts, setting up development environments, interacting with large language models, using local large language models, applied solutions with LLMs, extensibility using plugins and functions, retrieval augmented generation, introduction to Python web frameworks for APIs, DevOps principles, deploying machine learning APIs, LLM platforms, and final presentations. Students will learn to build, share, and present portfolios using Github, YouTube, and Linkedin, as well as develop non-linear life-long learning skills. Prerequisites include basic Linux and programming skills, with coursework available in Python or Rust. Additional resources and references are provided for further learning and exploration.

Awesome-RL-based-LLM-Reasoning
This repository is dedicated to enhancing Language Model (LLM) reasoning with reinforcement learning (RL). It includes a collection of the latest papers, slides, and materials related to RL-based LLM reasoning, aiming to facilitate quick learning and understanding in this field. Starring this repository allows users to stay updated and engaged with the forefront of RL-based LLM reasoning.

Whisper-TikTok
Discover Whisper-TikTok, an innovative AI-powered tool that leverages the prowess of Edge TTS, OpenAI-Whisper, and FFMPEG to craft captivating TikTok videos. Whisper-TikTok effortlessly generates accurate transcriptions from audio files and integrates Microsoft Edge Cloud Text-to-Speech API for vibrant voiceovers. The program orchestrates the synthesis of videos using a structured JSON dataset, generating mesmerizing TikTok content in minutes.

FunClip
FunClip is an open-source, locally deployable automated video editing tool that utilizes the FunASR Paraformer series models from Alibaba DAMO Academy for speech recognition in videos. Users can select text segments or speakers from the recognition results and click the clip button to obtain the corresponding video segments. FunClip integrates advanced features such as the Paraformer-Large model for accurate Chinese ASR, SeACo-Paraformer for customized hotword recognition, CAM++ speaker recognition model, Gradio interactive interface for easy usage, support for multiple free edits with automatic SRT subtitles generation, and segment-specific SRT subtitles.

FunClip
FunClip is an open-source, locally deployed automated video clipping tool that leverages Alibaba TONGYI speech lab's FunASR Paraformer series models for speech recognition on videos. Users can select text segments or speakers from recognition results to obtain corresponding video clips. It integrates industrial-grade models for accurate predictions and offers hotword customization and speaker recognition features. The tool is user-friendly with Gradio interaction, supporting multi-segment clipping and providing full video and target segment subtitles. FunClip is suitable for users looking to automate video clipping tasks with advanced AI capabilities.

KrillinAI
KrillinAI is a video subtitle translation and dubbing tool based on AI large models, featuring speech recognition, intelligent sentence segmentation, professional translation, and one-click deployment of the entire process. It provides a one-stop workflow from video downloading to the final product, empowering cross-language cultural communication with AI. The tool supports multiple languages for input and translation, integrates features like automatic dependency installation, video downloading from platforms like YouTube and Bilibili, high-speed subtitle recognition, intelligent subtitle segmentation and alignment, custom vocabulary replacement, professional-level translation engine, and diverse external service selection for speech and large model services.

Dataset
DL3DV-10K is a large-scale dataset of real-world scene-level videos with annotations, covering diverse scenes with different levels of reflection, transparency, and lighting. It includes 10,510 multi-view scenes with 51.2 million frames at 4k resolution, and offers benchmark videos for novel view synthesis (NVS) methods. The dataset is designed to facilitate research in deep learning-based 3D vision and provides valuable insights for future research in NVS and 3D representation learning.

langfuse-docs
Langfuse Docs is a repository for langfuse.com, built on Nextra. It provides guidelines for contributing to the documentation using GitHub Codespaces and local development setup. The repository includes Python cookbooks in Jupyter notebooks format, which are converted to markdown for rendering on the site. It also covers media management for images, videos, and gifs. The stack includes Nextra, Next.js, shadcn/ui, and Tailwind CSS. Additionally, there is a bundle analysis feature to analyze the production build bundle size using @next/bundle-analyzer.

swift-chat
SwiftChat is a fast and responsive AI chat application developed with React Native and powered by Amazon Bedrock. It offers real-time streaming conversations, AI image generation, multimodal support, conversation history management, and cross-platform compatibility across Android, iOS, and macOS. The app supports multiple AI models like Amazon Bedrock, Ollama, DeepSeek, and OpenAI, and features a customizable system prompt assistant. With a minimalist design philosophy and robust privacy protection, SwiftChat delivers a seamless chat experience with various features like rich Markdown support, comprehensive multimodal analysis, creative image suite, and quick access tools. The app prioritizes speed in launch, request, render, and storage, ensuring a fast and efficient user experience. SwiftChat also emphasizes app privacy and security by encrypting API key storage, minimal permission requirements, local-only data storage, and a privacy-first approach.

aiograpi
aiograpi is an asynchronous Instagram API wrapper for Python that allows users to interact with various Instagram functionalities such as retrieving public data of users, posts, stories, followers, and following users, managing proxy servers and challenge resolver, login by different methods, managing messages and threads, downloading and uploading various types of content, working with insights, likes, comments, and more. It is designed for testing or research purposes rather than production business use.

SheetCopilot
SheetCopilot is an assistant agent that manipulates spreadsheets by following user commands. It leverages Large Language Models (LLMs) to interact with spreadsheets like a human expert, enabling non-expert users to complete tasks on complex software such as Google Sheets and Excel via a language interface. The tool observes spreadsheet states, polishes generated solutions based on external action documents and error feedback, and aims to improve success rate and efficiency. SheetCopilot offers a dataset with diverse task categories and operations, supporting operations like entry & manipulation, management, formatting, charts, and pivot tables. Users can interact with SheetCopilot in Excel or Google Sheets, executing tasks like calculating revenue, creating pivot tables, and plotting charts. The tool's evaluation includes performance comparisons with leading LLMs and VBA-based methods on specific datasets, showcasing its capabilities in controlling various aspects of a spreadsheet.

vigenair
ViGenAiR is a tool that harnesses the power of Generative AI models on Google Cloud Platform to automatically transform long-form Video Ads into shorter variants, targeting different audiences. It generates video, image, and text assets for Demand Gen and YouTube video campaigns. Users can steer the model towards generating desired videos, conduct A/B testing, and benefit from various creative features. The tool offers benefits like diverse inventory, compelling video ads, creative excellence, user control, and performance insights. ViGenAiR works by analyzing video content, splitting it into coherent segments, and generating variants following Google's best practices for effective ads.
20 - OpenAI Gpts

Merch on Demand Upload Assistant
Structures Amazon Merch on Demand listings with SEO-optimized, focusing on design appeal and marketability. Upload design to begin.

Academic Hook Test
Upload your manuscript introduction. Get 'Reviewer 2' grade feedback in return.๐

11:11 Eternal Wisdom Portal 11:11
Upload a picture of your hand, your aura, or your handwriting. I'll draw the tarot cards (you can upload a photo as well) and read your destiny through Tarot, Palmistry, Runes, Numerology, Graphology, Aura Reading, and more.

ใใชใใฎๆ็ใๆก็นใใพใใใ๐ณWe grade your food
Upload a photo of your food!ใใชใใฎๆ็ใAIใๆก็น

Birth Chart Analysis & Astrologist
Upload your birth chart and get a personalized astrology. Discover your life path, numerology, and more.

RedlineGPT
Upload a jpg/png (<5MB, <2000px) for architectural drawing feedback. Note: This tool is not adept at calculations, counting, and can't guarantee code compliance. Consider IP issues before uploading.

Home Inspector
Upload a picture of your home wall, floor, window, driveway, roof, HVAC, and get an instant opinion.

Art Style Explorer ๐๏ธ
Upload or paste an image to gain insights and generate new images inspired by its style

Process Map Optimizer
Upload your process map and I will analyse and suggest improvements

WALL COLOR GPT
Upload a room image, get a custom wall color palette and visual representation.