Best AI tools for< open drawers >
20 - AI tool Sites
Dobb-E
Dobb-E is an open-source, general framework for learning household robotic manipulation. It is an affordable yet versatile general-purpose system for learning robotic manipulation within household settings. Dobb-E can learn a new task with only five minutes of a user showing it how to do it, thanks to a demonstration collection tool ("The Stick") built out of cheap parts and iPhones. Dobb-E has been tested in 10 homes, with a total of 109 tasks in different environments, and finally achieved a success rate of 81%.
Open Interpreter
Open Interpreter is a tool that allows users to run code on their computer using LLMs (Large Language Models). This enables users to complete tasks that would normally be difficult or impossible to do manually. Open Interpreter is still in its early stages of development, but it has the potential to revolutionize the way we use computers.
Open Translation AI
Open Translation AI is an AI-powered translation tool that provides human-like, fluent, and near-perfect translations. It utilizes advanced machine learning algorithms to analyze and translate text, capturing the nuances and context of the original language. With Open Translation AI, users can easily translate documents, websites, and other content into over 100 languages, ensuring accurate and effective communication across linguistic barriers.
Open GPT 4o
Open GPT 4o is an advanced large multimodal language model developed by OpenAI, offering real-time audiovisual responses, emotion recognition, and superior visual capabilities. It can handle text, audio, and image inputs, providing a rich and interactive user experience. GPT 4o is free for all users and features faster response times, advanced interactivity, and the ability to recognize and output emotions. It is designed to be more powerful and comprehensive than its predecessor, GPT 4, making it suitable for applications requiring voice interaction and multimodal processing.
CVF Open Access
The Computer Vision Foundation (CVF) is a non-profit organization dedicated to advancing the field of computer vision. CVF organizes several conferences and workshops each year, including the International Conference on Computer Vision (ICCV), the Conference on Computer Vision and Pattern Recognition (CVPR), and the Winter Conference on Applications of Computer Vision (WACV). CVF also publishes the International Journal of Computer Vision (IJCV) and the Computer Vision and Image Understanding (CVIU) journal. The CVF Open Access website provides access to the full text of all CVF-sponsored conference papers. These papers are available for free download in PDF format. The CVF Open Access website also includes links to the arXiv versions of the papers, where available.
arXiv
arXiv.org is a free distribution service and an open-access archive for nearly 2.4 million scholarly articles in the fields of physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering and systems science, and economics. Materials on this site are not peer-reviewed by arXiv.
WordPress
WordPress is a free and open-source content management system (CMS) that allows you to create a website or blog. It is the most popular CMS in the world, powering over 43% of all websites. WordPress is easy to use, even for beginners, and it comes with a wide range of features and plugins that allow you to customize your website to your needs.
DVC
DVC is an open-source version control system for machine learning projects. It allows users to track and manage their data, models, and code in a single place. DVC also provides a number of features that make it easy to collaborate on machine learning projects, such as experiment tracking, model registration, and pipeline management.
OpenGPT
OpenGPT is a community for Open AI enthusiasts. It provides access to various AI tools such as GPT Store, OpenGPTs, Open Chat, Open Draw, and Open Video. Users can submit their GPTs and earn credits for free access to advanced AI models like Google Gemini Pro, ChatGPT4, DALL.E.3, and Imagen2.
SoraWebui
SoraWebui is an open-source web platform that simplifies video creation by allowing users to generate videos from text using OpenAI's Sora model. It provides an easy-to-use interface and one-click website deployment, making it accessible to both professionals and enthusiasts in video production and AI technology. SoraWebui also includes a simulated version of the Sora API called FakeSoraAPI, which allows developers to start developing and testing their projects in a mock environment.
Google Gemma
Google Gemma is a lightweight, state-of-the-art open language model (LLM) developed by Google. It is part of the same research used in the creation of Google's Gemini models. Gemma models come in two sizes, the 2B and 7B parameter versions, where each has a base (pre-trained) and instruction-tuned modifications. Gemma models are designed to be cross-device compatible and optimized for Google Cloud and NVIDIA GPUs. They are also accessible through Kaggle, Hugging Face, Google Cloud with Vertex AI or GKE. Gemma models can be used for a variety of applications, including text generation, summarization, RAG, and both commercial and research use.
Tomat AI
Tomat AI is a powerful tool that allows you to open and explore large CSV files without the need for code or formulas. With its intuitive drag-and-drop interface, you can easily filter, sort, and group rows, as well as merge multiple files into a single table. Tomat AI also includes a built-in AI assistant that can help you enrich, translate, analyze, and clean up your data. Whether you're a data analyst, a researcher, or a business user, Tomat AI can help you get the most out of your data.
Clarity Write
Clarity Write is an open-source SaaS script that provides a comprehensive suite of AI-powered tools to transform content creation. With its powerful AI capabilities, users can effortlessly generate high-quality content, create stunning visuals, automate coding tasks, transcribe audio and video files, and engage with AI experts via chatbots. Clarity Write also offers a vast library of over 500 professionally designed templates, a feature-rich editor for refining content, and robust admin tools for streamlined management. By leveraging the capabilities of OpenAI APIs, Clarity Write empowers users to enhance their content creation process, unlock endless creativity, and simplify their operations.
karaok-AI
karaok-AI is an open-source karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text). It uses WhisperHallu and WhisperTimeSync to extract vocals and lyrics. karaok-AI also includes kaiDJ, a minimalist and easy-to-use DJ Party Player with multi-sound cards support, two players with auto-mix between songs, and a pre-listen player. It can index thousands of songs in a single efficient database and allows for direct search and selection over all songs. Additionally, it offers playlist management with nested groups and the ability to open and save m3u and m3u8 playlists while keeping group definitions.
OpenVoiceOS
OpenVoiceOS is a community-driven, open-source voice AI platform for creating custom voice-controlled interfaces across devices with NLP, a customizable UI, and a focus on privacy and security. OpenVoiceOS is designed to provide users with a seamless and intuitive voice interface for controlling their smart home devices, playing music, setting reminders, and much more. OpenVoiceOS is open to all developers and contributors wanting to support a specific device or a platform. OpenVoiceOS is the platform to throw your ideas at if you have an experimental feature you want users to experience before landing them into any of the Linux-based open-source voice assistant projects upstream.
Cyguru
Cyguru is an all-in-one cloud-based AI Security Operation Center (SOC) that offers a comprehensive range of features for a robust and secure digital landscape. Its Security Operation Center is the cornerstone of its service domain, providing AI-Powered Attack Detection, Continuous Monitoring for Vulnerabilities and Misconfigurations, Compliance Assurance, SecPedia: Your Cybersecurity Knowledge Hub, and Advanced ML & AI Detection. Cyguru's AI-Powered Analyst promptly alerts users to any suspicious behavior or activity that demands attention, ensuring timely delivery of notifications. The platform is accessible to everyone, with up to three free servers and subsequent pricing that is more than 85% below the industry average.
Evidently AI
Evidently AI is an open-source machine learning (ML) monitoring and observability platform that helps data scientists and ML engineers evaluate, test, and monitor ML models from validation to production. It provides a centralized hub for ML in production, including data quality monitoring, data drift monitoring, ML model performance monitoring, and NLP and LLM monitoring. Evidently AI's features include customizable reports, structured checks for data and models, and a Python library for ML monitoring. It is designed to be easy to use, with a simple setup process and a user-friendly interface. Evidently AI is used by over 2,500 data scientists and ML engineers worldwide, and it has been featured in publications such as Forbes, VentureBeat, and TechCrunch.
LAION
LAION is a non-profit organization that provides datasets, tools, and models to advance machine learning research. The organization's goal is to promote open public education and encourage the reuse of existing datasets and models to reduce the environmental impact of machine learning research.
OpenCV
OpenCV is a library of programming functions mainly aimed at real-time computer vision. Originally developed by Intel, it was later supported by Willow Garage and is now maintained by Itseez. OpenCV is cross-platform and free for use under the open-source BSD license.
Keep
Keep is an open-source AIOps platform designed for those who deal with alerts in complex environments. It offers a bidirectional integration system to keep alerts and signals in sync with high-quality integrations like AppDynamics, Datadog, Grafana, Jira, Pagerduty, and more. Keep provides a single pane of glass for alerts, advanced querying capabilities, and rule-based grouping to combat noise. With a workflow engine similar to GitHub Actions but with a user-friendly UI, users can query databases, enrich alerts, update tickets, and run scripts. The platform leverages AI for alert correlation based on past incidents and knowledge base, constantly improving with user feedback. Keep is committed to open source, offering self-hosted and cloud deployment options with unique AI-driven alert correlation and summarization.
20 - Open Source AI Tools
habitat-sim
Habitat-Sim is a high-performance physics-enabled 3D simulator with support for 3D scans of indoor/outdoor spaces, CAD models of spaces and piecewise-rigid objects, configurable sensors, robots described via URDF, and rigid-body mechanics. It prioritizes simulation speed over the breadth of simulation capabilities, achieving several thousand frames per second (FPS) running single-threaded and over 10,000 FPS multi-process on a single GPU when rendering a scene from the Matterport3D dataset. Habitat-Sim simulates a Fetch robot interacting in ReplicaCAD scenes at over 8,000 steps per second (SPS), where each ‘step’ involves rendering 1 RGBD observation (128×128 pixels) and rigid-body dynamics for 1/30sec.
aiolauncher_scripts
AIO Launcher Scripts is a collection of Lua scripts that can be used with AIO Launcher to enhance its functionality. These scripts can be used to create widget scripts, search scripts, and side menu scripts. They provide various functions such as displaying text, buttons, progress bars, charts, and interacting with app widgets. The scripts can be used to customize the appearance and behavior of the launcher, add new features, and interact with external services.
chatgpt-mobile
ChatGPT Mobile is a mobile application built using React Native, Expo & Webview. It offers an improved mobile UI/UX with features such as native drawer swipe, transparent status bar, themed bottom bar, automatic light/dark theme based on system settings, android back button functionality, startup keyboard automatic open, chat keyboard automatic dismiss, splash screen display, and a 'chatgpt is at capacity' workaround. Users can download the Android APK file from GitHub Releases to experience the application.
Awesome-Segment-Anything
The Segment Anything Model (SAM) is a powerful tool that allows users to segment any object in an image with just a few clicks. This makes it a great tool for a variety of tasks, such as object detection, tracking, and editing. SAM is also very easy to use, making it a great option for both beginners and experienced users.
langchain
LangChain is a framework for developing Elixir applications powered by language models. It enables applications to connect language models to other data sources and interact with the environment. The library provides components for working with language models and off-the-shelf chains for specific tasks. It aims to assist in building applications that combine large language models with other sources of computation or knowledge. LangChain is written in Elixir and is not aimed for parity with the JavaScript and Python versions due to differences in programming paradigms and design choices. The library is designed to make it easy to integrate language models into applications and expose features, data, and functionality to the models.
org-ai
org-ai is a minor mode for Emacs org-mode that provides access to generative AI models, including OpenAI API (ChatGPT, DALL-E, other text models) and Stable Diffusion. Users can use ChatGPT to generate text, have speech input and output interactions with AI, generate images and image variations using Stable Diffusion or DALL-E, and use various commands outside org-mode for prompting using selected text or multiple files. The tool supports syntax highlighting in AI blocks, auto-fill paragraphs on insertion, and offers block options for ChatGPT, DALL-E, and other text models. Users can also generate image variations, use global commands, and benefit from Noweb support for named source blocks.
ai-audio-datasets
AI Audio Datasets List (AI-ADL) is a comprehensive collection of datasets consisting of speech, music, and sound effects, used for Generative AI, AIGC, AI model training, and audio applications. It includes datasets for speech recognition, speech synthesis, music information retrieval, music generation, audio processing, sound synthesis, and more. The repository provides a curated list of diverse datasets suitable for various AI audio tasks.
open-webui
Open WebUI is an extensible, feature-rich, and user-friendly self-hosted WebUI designed to operate entirely offline. It supports various LLM runners, including Ollama and OpenAI-compatible APIs. For more information, be sure to check out our Open WebUI Documentation.
open-parse
Open Parse is a Python library for visually discerning document layouts and chunking them effectively. It is designed to fill the gap in open-source libraries for handling complex documents. Unlike text splitting, which converts a file to raw text and slices it up, Open Parse visually analyzes documents for superior LLM input. It also supports basic markdown for parsing headings, bold, and italics, and has high-precision table support, extracting tables into clean Markdown formats with accuracy that surpasses traditional tools. Open Parse is extensible, allowing users to easily implement their own post-processing steps. It is also intuitive, with great editor support and completion everywhere, making it easy to use and learn.
open-source-ops
This repository contains various tools, scripts, instructions, and guides that can be useful when creating open-source projects. All materials are available under the BSD-3 license.
Open-Sora-Plan
Open-Sora-Plan is a project that aims to create a simple and scalable repo to reproduce Sora (OpenAI, but we prefer to call it "ClosedAI"). The project is still in its early stages, but the team is working hard to improve it and make it more accessible to the open-source community. The project is currently focused on training an unconditional model on a landscape dataset, but the team plans to expand the scope of the project in the future to include text2video experiments, training on video2text datasets, and controlling the model with more conditions.
AeonLabs-AI-Volvo-MKII-Open-Hardware
This open hardware project aims to extend the life of Volvo P2 platform vehicles by updating them to current EU safety and emission standards. It involves designing and prototyping OEM hardware electronics that can replace existing electronics in these vehicles, using the existing wiring and without requiring reverse engineering or modifications. The project focuses on serviceability, maintenance, repairability, and personal ownership safety, and explores the advantages of using open solutions compared to conventional hardware electronics solutions.
open-saas
Open SaaS is a free and open-source React and Node.js template for building SaaS applications. It comes with a variety of features out of the box, including authentication, payments, analytics, and more. Open SaaS is built on top of the Wasp framework, which provides a number of features to make it easy to build SaaS applications, such as full-stack authentication, end-to-end type safety, jobs, and one-command deploy.
Open-DocLLM
Open-DocLLM is an open-source project that addresses data extraction and processing challenges using OCR and LLM technologies. It consists of two main layers: OCR for reading document content and LLM for extracting specific content in a structured manner. The project offers a larger context window size compared to JP Morgan's DocLLM and integrates tools like Tesseract OCR and Mistral for efficient data analysis. Users can run the models on-premises using LLM studio or Ollama, and the project includes a FastAPI app for testing purposes.
Open-Interface
Open Interface is a self-driving software that automates computer tasks by sending user requests to a language model backend (e.g., GPT-4V) and simulating keyboard and mouse inputs to execute the steps. It course-corrects by sending current screenshots to the language models. The tool supports MacOS, Linux, and Windows, and requires setting up the OpenAI API key for access to GPT-4V. It can automate tasks like creating meal plans, setting up custom language model backends, and more. Open Interface is currently not efficient in accurate spatial reasoning, tracking itself in tabular contexts, and navigating complex GUI-rich applications. Future improvements aim to enhance the tool's capabilities with better models trained on video walkthroughs. The tool is cost-effective, with user requests priced between $0.05 - $0.20, and offers features like interrupting the app and primary display visibility in multi-monitor setups.
open-ai
Open AI is a powerful tool for artificial intelligence research and development. It provides a wide range of machine learning models and algorithms, making it easier for developers to create innovative AI applications. With Open AI, users can explore cutting-edge technologies such as natural language processing, computer vision, and reinforcement learning. The platform offers a user-friendly interface and comprehensive documentation to support users in building and deploying AI solutions. Whether you are a beginner or an experienced AI practitioner, Open AI offers the tools and resources you need to accelerate your AI projects and stay ahead in the rapidly evolving field of artificial intelligence.
open-model-database
OpenModelDB is a community-driven database of AI upscaling models, providing a centralized platform for users to access and compare various models. The repository contains a collection of models and model metadata, facilitating easy exploration and evaluation of different AI upscaling solutions. With a focus on enhancing the accessibility and usability of AI models, OpenModelDB aims to streamline the process of finding and selecting the most suitable models for specific tasks or projects.
open-llms
Open LLMs is a repository containing various Large Language Models licensed for commercial use. It includes models like T5, GPT-NeoX, UL2, Bloom, Cerebras-GPT, Pythia, Dolly, and more. These models are designed for tasks such as transfer learning, language understanding, chatbot development, code generation, and more. The repository provides information on release dates, checkpoints, papers/blogs, parameters, context length, and licenses for each model. Contributions to the repository are welcome, and it serves as a resource for exploring the capabilities of different language models.
Open-SD-WebUI-Launcher
Open-SD-WebUI-Launcher is a web-based tool developed by the Astro-Artistry-Dev-Team for launching SD web user interfaces. It provides a user-friendly interface for managing and launching web applications. The tool is designed to simplify the process of launching web interfaces for SD applications, making it easier for users to access and interact with their applications. With Open-SD-WebUI-Launcher, users can quickly and easily launch their SD web interfaces without the need for complex configurations or technical knowledge. The tool offers a streamlined and intuitive interface that allows users to manage their web applications with ease.
Open-Prompt-Injection
OpenPromptInjection is an open-source toolkit for attacks and defenses in LLM-integrated applications, enabling easy implementation, evaluation, and extension of attacks, defenses, and LLMs. It supports various attack and defense strategies, including prompt injection, paraphrasing, retokenization, data prompt isolation, instructional prevention, sandwich prevention, perplexity-based detection, LLM-based detection, response-based detection, and know-answer detection. Users can create models, tasks, and apps to evaluate different scenarios. The toolkit currently supports PaLM2 and provides a demo for querying models with prompts. Users can also evaluate ASV for different scenarios by injecting tasks and querying models with attacked data prompts.
20 - OpenAI Gpts
Open Source Starter Guide
Open Source Guide for Everyone: First time contributors, maintainers, and the curious.
Open Source Alternative
Find open source alternative to any paid service you can think of
Open Data Italia bot
Fornisce informazioni sulla normativa italiana in materia di open data, con un tono professionale e divulgativo. In modo che sia più facile chiederne e/o pretenderne la pubblicazione.
Open AI API Documentation Assistant
Uses OpenAI's latest API docs to answer questions about their newly released API. This is not an official OpenAI bot.
Anything Goes Adventurer Game
This is a truly, open world game, where literally anything is possible, you can form your own story, or start to dive into one of the built mysteries and adventures. There are no two similar games and experiences.
OpenData Explorer
I'll help you access and understand open data published by central government, local authorities and public bodies. You can ask me in your native language.
CaseCracker™: Consultant Case Interview Practice
Crack open the door to your future. (Partner tip: use the iPhone app for voice chat)
Personal Financial Advisor
This Open AI tool analyzes your financial data, budgets and cashflow and suggests areas of improvement and quick insights. Drop an XLS file here or copy/paste your financial data and get insights! (Your data remains private and creator of this ChatGPT has no access to it).
GPT-Info
Extensive guide for ChatGPT models. 🛈 This software is free and open-source; anyone can redistribute it and/or modify it.