Best AI tools for< Capture Screens >
20 - AI tool Sites

ScreenSnapAI
ScreenSnapAI is an AI-powered screenshot manager for macOS that helps users capture, search, and organize their screenshots effortlessly. It uses GPT-4 to automatically generate smart screenshot names, descriptions, and keywords, making it easy to find and organize screenshots. ScreenSnapAI also features smart folders for automatic filtering, lightning-fast full-text search, and the ability to import images from other sources (pro version only).

goPDF
goPDF is a comprehensive PDF management platform that offers a suite of tools for creating, converting, capturing, and interacting with PDFs. With its advanced features and user-friendly API, goPDF simplifies the handling of PDF documents for various purposes, including collaborative work, quick assistance, and engaging training. The platform's AI capabilities enhance the user experience by providing interactive reading, content summarization, and chatbot functionality.

Dubble
Dubble is a free tool that helps you create step-by-step guides, tutorials, and onboarding resources for your processes. It uses AI to watch how you work and translate your actions into written instructions and screenshots. This makes it easy to document your processes without having to write anything yourself.

Solvr
Solvr is an AI-powered Chrome extension that allows users to solve questions effortlessly without leaving the webpage. It offers two powerful modes for capturing and getting instant answers, as well as the ability to extract and solve content from PDFs. With sleek and structured results, Solvr provides visually appealing and organized information at a glance, making problem-solving swift and simple.

Snipo
Snipo is an AI-powered tool designed to enhance the note-taking experience while watching educational videos online. It seamlessly integrates with popular platforms like YouTube, Udemy, Coursera, Skillshare, and LinkedIn Learning, allowing users to take timestamped notes, capture screenshots, generate transcripts, and create AI flashcards effortlessly. With features like custom decks, export options, keyboard shortcuts, and support for playlists and courses syncing, Snipo aims to streamline the learning process for over 30,000 users worldwide. The tool is praised for its ease of use, efficiency, and compatibility with Notion, making it a valuable asset for students and professionals seeking to optimize their video learning experience.

Somebay
Somebay is a website offering a collection of simple yet powerful Mac applications designed to enhance user experience. The apps are created by a team from heartbeat and are tailored to provide useful functionalities for Mac users. Somebay includes tools like Gep., a smart AI-powered assistant for various tasks, Prevely, an image viewer with a color picker, and Docflipper, a Cmd+Tab switcher with bookmarks. These apps aim to streamline tasks such as brainstorming, image viewing, and bookmarking favorite links, apps, folders, and files on Mac devices.

AIEasy.life
AIEasy.life is an AI tools platform designed to empower daily life with intelligent AI solutions. The platform offers a curated directory of various AI tools across categories such as text & writing, image, video, voice, business, marketing, chatbot, design & art, life assistant, 3D, education, productivity, and other development learning AI applications. Users can access tools like MyMathSolver.ai for solving complex math problems, Vozo AI for video content creation, Grammarly for enhancing writing, and many more. AIEasy.life aims to simplify tasks and enhance productivity through the use of advanced AI technology.

Screen Story
Screen Story is a Mac screen recorder tool designed to capture and record screens with ease. It allows users to create high-quality videos, demos, GIFs, and tutorials without the need for video editing skills. The application offers features like automatic zoom, smooth cursor movement, offline recording, webcam and microphone support, and a simple editing interface. Screen Story is trusted by entrepreneurs, designers, marketers, and developers for its efficiency and user-friendly design patterns.

Snapsked
Snapsked is an AI-powered tool that helps you turn screenshots and photos into actionable items. With Snapsked, you can easily capture action items from emails, chats, web pages, and notes. Snapsked's AI will then extract the relevant action items and details and add them to your calendar. This makes it easy to keep track of all your action items in one place.

Unlost
Unlost is a memory recall tool that allows users to instantly retrieve information with zero effort. It helps users never lose track or forget any details by recording and intelligently understanding their screen layout and content. Unlost operates privately and offline, respecting user space and copyright law. The tool offers quick access, powerful filtering, and familiar keyboard shortcuts for effortless searching. Users can search meeting transcripts, copy text from screenshots, and exclude capturing specific apps or websites. Unlost aims to delegate memory and enhance user capacity effortlessly.

MacCopilot
MacCopilot is an ultimate copilot app for macOS integrated with advanced AI models like GPT-4, ClaudeAI, and Google Gemini. It allows users to capture any part of their screen, chat with AI for insights, and export content as Markdown. The application is designed for macOS 12.0 and later, offering a revolutionary way to interact with screen content.

Alice
Alice is a fast, accurate AI transcription and recorder application that prioritizes privacy and cost-effectiveness. It allows users to securely record audio and video, transcribe in multiple languages and accents with high accuracy, and offers real-time text streaming. Alice integrates with various tools, supports webhooks, and is trusted by journalists for its reliability and security features. The application is designed to be user-friendly, efficient, and suitable for a wide range of tasks, making it a valuable tool for journalists, freelancers, and anyone in need of transcription services.

CrystalSound
CrystalSound is an AI noise-canceling app and screen recorder that offers crystal-clear audio, seamless screen recording, and data-driven insights for more productive meetings. It features bi-directional noise cancellation, microphone volume booster, acoustic echo suppression, screen and bidirectional audio capture, and smart minutes of recordings. With cutting-edge AI technology, CrystalSound helps users stay focused, reduce distractions, and enhance meeting performance. The app integrates seamlessly with various conference apps, simplifying workflows and amplifying meeting experiences.

MaestroQA
MaestroQA is a comprehensive Call Center Quality Assurance Software that offers a range of products and features to enhance QA processes. It provides customizable report builders, scorecard builders, calibration workflows, coaching workflows, automated QA workflows, screen capture, accurate transcriptions, root cause analysis, performance dashboards, AI grading assist, analytics, and integrations with various platforms. The platform caters to industries like eCommerce, financial services, gambling, insurance, B2B software, social media, and media, offering solutions for QA managers, team leaders, and executives.

Wizardshot
Wizardshot is an AI-powered web application and Chrome extension that enables users to effortlessly create step-by-step tutorials by capturing their screen. It offers a seamless integration into your workflow, allowing you to save time, increase productivity, and share knowledge with ease. With features like knowledge base integration, export to PDF & DOC, privacy settings, and analytics, Wizardshot serves as your magic wand for instant tutorial creation. The application prioritizes data security through industry-standard encryption methods and access controls, ensuring the safety of your information.

ScreenApp
ScreenApp is an AI-powered tool that serves as a notetaker, transcription tool, summarizer, and recorder for audio and video content. It offers a wide range of features to help users efficiently manage their recordings and meetings. ScreenApp is designed to capture and convert recordings into actionable insights, making it a valuable assistant for various tasks and industries.

Humane Ai Pin
Humane Ai Pin is an intelligent, voice-powered wearable companion that keeps you connected and in the moment with just a touch. It provides instant AI-powered knowledge, personalized precision assistance, and unlimited AI queries. The device features Trust Light for scanning and listening, understands user preferences over time, and offers live translation across languages. Users can capture moments, stay present, and enjoy media storage. Additionally, it offers data, calling, and texting services, and acts as a personal DJ. Ai Pin operates on the CosmOS operating system, which seamlessly integrates digital services with the user's environment.

Humane Ai Pin
Humane Ai Pin is an intelligent, voice-powered wearable companion that provides instant AI-powered knowledge and personalized assistance. It allows users to stay connected and in the moment with features like unlimited AI queries, personalized precision assistance, and live translation across languages. The device is designed to help users capture moments, stay present, and find their vibe on the go. With a focus on simplicity and intuitive user experience, Ai Pin aims to enhance the quality of life by seamlessly integrating technology into daily interactions.

Rewatch
Rewatch is an AI-powered meeting assistant and video hub that helps users capture meetings, create summaries, transcriptions, and action items. It centralizes all meeting videos, notes, and discussions in one place, replacing repetitive in-person meetings with asynchronous collaborative series. Rewatch also offers features like screen recording, integrations with other tools, and conversation intelligence to empower organizations with actionable insights. Trusted by productive businesses, Rewatch aims to optimize necessary meetings, eliminate useless ones, and enhance cross-functional collaboration in a unified hub.

Paradox
Paradox is a conversational hiring software that automates repetitive tasks and improves the candidate experience. It offers a range of features such as conversational ATS, career sites, CX, capture, scheduling, events, and assessments. Paradox integrates with leading HCM systems like Workday, SAP SuccessFactors, and Indeed. It is used by various industries including retail, restaurant, healthcare, logistics, financial services, and hospitality.
20 - Open Source AI Tools

aiavatarkit
AIAvatarKit is a tool for building AI-based conversational avatars quickly. It supports various platforms like VRChat and cluster, along with real-world devices. The tool is extensible, allowing unlimited capabilities based on user needs. It requires VOICEVOX API, Google or Azure Speech Services API keys, and Python 3.10. Users can start conversations out of the box and enjoy seamless interactions with the avatars.

ScribbleArchitect
ScribbleArchitect is a GUI tool designed for generating images from simple brush strokes or Bezier curves in real-time. It is primarily intended for use in architecture and sketching in the early stages of a project. The tool utilizes Stable Diffusion and ControlNet as AI backbone for the generative process, with IP Adapter support and a library of predefined styles. Users can transfer specific styles to their line work, upscale images for high resolution export, and utilize a ControlNet upscaler. The tool also features a screen capture function for working with external tools like Adobe Illustrator or Inkscape.

browser-tools-mcp
BrowserTools MCP is a powerful browser monitoring and interaction tool that enables AI-powered applications to capture and analyze browser data through a Chrome extension. It consists of a Chrome Extension for capturing screenshots, console logs, network activity, and DOM elements, a Node Server for communication between the extension and an MCP server, and an MCP Server that provides standardized tools for AI clients to interact with the browser. All logs are stored locally on the user's machine. The tool is compatible with various MCP clients like Cursor, Cline, and Zed, allowing users to monitor console output, capture network traffic, take screenshots, analyze elements, and wipe logs stored in the MCP server.

efficient-recorder
Efficient Recorder is a battery-life friendly tool designed to stream video, screen, mic, and system audio to any S3-compatible cloud storage service. It captures audio, screenshots, and webcam photos at configurable fps, utilizing low-energy volume detection for audio recording. The tool streams data to a configurable S3 endpoint or a custom server using MinIO. It aims to be storage and battery efficient, providing queued upload processing and minimal system resource overhead. The tool requires SoX for audio recording and webcam capture tools for operation. Users can specify various command line options for customization, such as enabling screenshot and webcam capture with specific intervals and image quality settings.

CogAgent
CogAgent is an advanced intelligent agent model designed for automating operations on graphical interfaces across various computing devices. It supports platforms like Windows, macOS, and Android, enabling users to issue commands, capture device screenshots, and perform automated operations. The model requires a minimum of 29GB of GPU memory for inference at BF16 precision and offers capabilities for executing tasks like sending Christmas greetings and sending emails. Users can interact with the model by providing task descriptions, platform specifications, and desired output formats.

morphic
Morphic is an AI-powered answer engine with a generative UI. It utilizes a stack of Next.js, Vercel AI SDK, OpenAI, Tavily AI, shadcn/ui, Radix UI, and Tailwind CSS. To get started, fork and clone the repo, install dependencies, fill out secrets in the .env.local file, and run the app locally using 'bun dev'. You can also deploy your own live version of Morphic with Vercel. Verified models that can be specified to writers include Groq, LLaMA3 8b, and LLaMA3 70b.

npcsh
`npcsh` is a python-based command-line tool designed to integrate Large Language Models (LLMs) and Agents into one's daily workflow by making them available and easily configurable through the command line shell. It leverages the power of LLMs to understand natural language commands and questions, execute tasks, answer queries, and provide relevant information from local files and the web. Users can also build their own tools and call them like macros from the shell. `npcsh` allows users to take advantage of agents (i.e. NPCs) through a managed system, tailoring NPCs to specific tasks and workflows. The tool is extensible with Python, providing useful functions for interacting with LLMs, including explicit coverage for popular providers like ollama, anthropic, openai, gemini, deepseek, and openai-like providers. Users can set up a flask server to expose their NPC team for use as a backend service, run SQL models defined in their project, execute assembly lines, and verify the integrity of their NPC team's interrelations. Users can execute bash commands directly, use favorite command-line tools like VIM, Emacs, ipython, sqlite3, git, pipe the output of these commands to LLMs, or pass LLM results to bash commands.

crawl4ai
Crawl4AI is a powerful and free web crawling service that extracts valuable data from websites and provides LLM-friendly output formats. It supports crawling multiple URLs simultaneously, replaces media tags with ALT, and is completely free to use and open-source. Users can integrate Crawl4AI into Python projects as a library or run it as a standalone local server. The tool allows users to crawl and extract data from specified URLs using different providers and models, with options to include raw HTML content, force fresh crawls, and extract meaningful text blocks. Configuration settings can be adjusted in the `crawler/config.py` file to customize providers, API keys, chunk processing, and word thresholds. Contributions to Crawl4AI are welcome from the open-source community to enhance its value for AI enthusiasts and developers.

TypeGPT
TypeGPT is a Python application that enables users to interact with ChatGPT or Google Gemini from any text field in their operating system using keyboard shortcuts. It provides global accessibility, keyboard shortcuts for communication, and clipboard integration for larger text inputs. Users need to have Python 3.x installed along with specific packages and API keys from OpenAI for ChatGPT access. The tool allows users to run the program normally or in the background, manage processes, and stop the program. Users can use keyboard shortcuts like `/ask`, `/see`, `/stop`, `/chatgpt`, `/gemini`, `/check`, and `Shift + Cmd + Enter` to interact with the application in any text field. Customization options are available by modifying files like `keys.txt` and `system_prompt.txt`. Contributions are welcome, and future plans include adding support for other APIs and a user-friendly GUI.

LLavaImageTagger
LLMImageIndexer is an intelligent image processing and indexing tool that leverages local AI to generate comprehensive metadata for your image collection. It uses advanced language models to analyze images and generate captions and keyword metadata. The tool offers features like intelligent image analysis, metadata enhancement, local processing, multi-format support, user-friendly GUI, GPU acceleration, cross-platform support, stop and start capability, and keyword post-processing. It operates directly on image file metadata, allowing users to manage files, add new files, and run the tool multiple times without reprocessing previously keyworded files. Installation instructions are provided for Windows, macOS, and Linux platforms, along with usage guidelines and configuration options.

nebula
Nebula is an advanced, AI-powered penetration testing tool designed for cybersecurity professionals, ethical hackers, and developers. It integrates state-of-the-art AI models into the command-line interface, automating vulnerability assessments and enhancing security workflows with real-time insights and automated note-taking. Nebula revolutionizes penetration testing by providing AI-driven insights, enhanced tool integration, AI-assisted note-taking, and manual note-taking features. It also supports any tool that can be invoked from the CLI, making it a versatile and powerful tool for cybersecurity tasks.

ImageIndexer
LLMII is a tool that uses a local AI model to label metadata and index images without relying on cloud services or remote APIs. It runs a visual language model on your computer to generate captions and keywords for images, enhancing their metadata for indexing, searching, and organization. The tool can be run multiple times on the same image files, allowing for adding new data, regenerating data, and discovering files with issues. It supports various image formats, offers a user-friendly GUI, and can utilize GPU acceleration for faster processing. LLMII requires Python 3.8 or higher and operates directly on image file metadata fields like MWG:Keyword and XMP:Identifier.

screenpipe
24/7 Screen & Audio Capture Library to build personalized AI powered by what you've seen, said, or heard. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust. We are shipping daily, make suggestions, post bugs, give feedback. Building a reliable stream of audio and screenshot data, simplifying life for developers by solving non-trivial problems. Multiple installation options available. Experimental tool with various integrations and features for screen and audio capture, OCR, STT, and more. Open source project focused on enabling tooling & infrastructure for a wide range of applications.

vector_companion
Vector Companion is an AI tool designed to act as a virtual companion on your computer. It consists of two personalities, Axiom and Axis, who can engage in conversations based on what is happening on the screen. The tool can transcribe audio output and user microphone input, take screenshots, and read text via OCR to create lifelike interactions. It requires specific prerequisites to run on Windows and uses VB Cable to capture audio. Users can interact with Axiom and Axis by running the main script after installation and configuration.

commonplace-bot
Commonplace Bot is a modern representation of the commonplace book, leveraging modern technological advancements in computation, data storage, machine learning, and networking. It aims to capture, engage, and share knowledge by providing a platform for users to collect ideas, quotes, and information, organize them efficiently, engage with the data through various strategies and triggers, and transform the data into new mediums for sharing. The tool utilizes embeddings and cached transformations for efficient data storage and retrieval, flips traditional engagement rules by engaging with the user, and enables users to alchemize raw data into new forms like art prompts. Commonplace Bot offers a unique approach to knowledge management and creative expression.

kazam
Kazam 2.0 is a versatile tool for screen recording, broadcasting, capturing, and optical character recognition (OCR). It allows users to capture screen content, broadcast live over the internet, extract text from captured content, record audio, and use a web camera for recording. The tool supports full screen, window, and area modes, and offers features like keyboard shortcuts, live broadcasting with Twitch and YouTube, and tips for recording quality. Users can install Kazam on Ubuntu and use it for various recording and broadcasting needs.

aitviewer
A set of tools to visualize and interact with sequences of 3D data with cross-platform support on Windows, Linux, and macOS. It provides a native Python interface for loading and displaying SMPL[-H/-X], MANO, FLAME, STAR, and SUPR sequences in an interactive viewer. Users can render 3D data on top of images, edit SMPL sequences and poses, export screenshots and videos, and utilize a high-performance ModernGL-based rendering pipeline. The tool is designed for easy use and hacking, with features like headless mode, remote mode, animatable camera paths, and a built-in extensible GUI.

SurfSense
SurfSense is a tool designed to help users save and organize content from the internet into a personal Knowledge Graph. It allows users to capture web browsing sessions and webpage content using a Chrome extension, enabling easy retrieval and recall of saved information. SurfSense offers features like powerful search capabilities, natural language interaction with saved content, self-hosting options, and integration with GraphRAG for meaningful content relations. The tool eliminates the need for web scraping by directly reading data from the DOM, making it a convenient solution for managing online information.

screen-pipe
Screen-pipe is a Rust + WASM tool that allows users to turn their screen into actions using Large Language Models (LLMs). It enables users to record their screen 24/7, extract text from frames, and process text and images for tasks like analyzing sales conversations. The tool is still experimental and aims to simplify the process of recording screens, extracting text, and integrating with various APIs for tasks such as filling CRM data based on screen activities. The project is open-source and welcomes contributions to enhance its functionalities and usability.
20 - OpenAI Gpts

Browser Extension Generator
Create browser extensions for web tasks to boost your productivity. Or jumpstart a more advanced extension idea. You'll get a full package download ready to install in your Chrome or Edge browser. 📂 v1.2 _____ _____ What do you want to build? _____

PersistentGPT
Helpful and persistent: I continuously update persistent state to capture a concise but complete specification of the entire conversation.

Politically Incorrect
Sarcastic and unfiltered, it offers a satirical commentary on current affairs, including the latest in technology. It creates images that capture the essence of the conversation.

Hunger Games Name Generator
"Hunger Games Name Generator is a specialized tool designed to create imaginative and thematic names for characters in the 'Hunger Games' universe. This generator is perfect for fans and creators looking for unique, fitting names that capture the essence of the series' dystopian and vivid world."

Santa Claus
Santa Claus, your jolly companion for heartwarming conversations! Always in character, our Santa ensures every interaction is family-friendly, spreading cheer and festive spirit with each reply. Get ready to share your holiday wishes and enjoy delightful chats that capture the magic of Christmas!

Wildlife Photography Tutor
Teaches techniques and tips for capturing stunning wildlife photographs.

Astrophotography Assistant
Guides amateur astronomers in capturing and editing astrophotography images.

Highlight Optimizer
Supercharge your personal knowledge management journey by using a highlight capturing service (such as Readwise) and then turning those highlights into useful knowledge assets. Examples include flash cards, research abstracts or articles based off the highlights you collect and choose to combine.

Comprehensive Second Brain Assistant
Expert in Tiago Forte's Second Brain methodology for digital organization.

Insta360 X3 Coach
Complete beginner's guide to Insta360 X3 with practical tips and tricks.