Best AI tools for< Run On Constrained Hardware >
20 - AI tool Sites
![Dora Screenshot](/screenshots/dora.run.jpg)
Dora
Dora is a no-code 3D animated website design platform that allows users to create stunning 3D and animated visuals without writing a single line of code. With Dora, designers, freelancers, and creative professionals can focus on what they do best: designing. The platform is tailored for professionals who prioritize design aesthetics without wanting to dive deep into the backend. Dora offers a variety of features, including a drag-and-connect constraint layout system, advanced animation capabilities, and pixel-perfect usability. With Dora, users can create responsive 3D and animated websites that translate seamlessly across devices.
![Cortex Labs Screenshot](/screenshots/cortexlabs.ai.jpg)
Cortex Labs
Cortex Labs is a decentralized world computer that enables AI and AI-powered decentralized applications (dApps) to run on the blockchain. It offers a Layer2 solution called ZkMatrix, which utilizes zkRollup technology to enhance transaction speed and reduce fees. Cortex Virtual Machine (CVM) supports on-chain AI inference using GPU, ensuring deterministic results across computing environments. Cortex also enables machine learning in smart contracts and dApps, fostering an open-source ecosystem for AI researchers and developers to share models. The platform aims to solve the challenge of on-chain machine learning execution efficiently and deterministically, providing tools and resources for developers to integrate AI into blockchain applications.
![Functionize Screenshot](/screenshots/functionize.com.jpg)
Functionize
Functionize is an AI-powered test automation platform that helps enterprises improve their product quality and release faster. It uses machine learning to automate test creation, maintenance, and execution, and provides a range of features to help teams collaborate and manage their testing process. Functionize integrates with popular CI/CD tools and DevOps pipelines, and offers a range of pricing options to suit different needs.
![Functionize Screenshot](/screenshots/functionize.ai.jpg)
Functionize
Functionize is an AI Agentic Automation Platform for Enterprises that offers expert AI agents to handle business processes autonomously. The platform utilizes deep learning neural networks to deliver unparalleled performance across various enterprise applications. Functionize's AI agents run autonomously, self-heal workflows, and redefine efficiency and reliability in automation. The platform provides immediate value with pretrained automation, evolves with operational environments, and ensures seamless adaptability and precision in every task. Functionize helps mitigate risks, unlock gains, and support digital transformation for enterprises.
![Tensoic AI Screenshot](/screenshots/tensoic.com.jpg)
Tensoic AI
Tensoic AI is an AI tool designed for custom Large Language Models (LLMs) fine-tuning and inference. It offers ultra-fast fine-tuning and inference capabilities for enterprise-grade LLMs, with a focus on use case-specific tasks. The tool is efficient, cost-effective, and easy to use, enabling users to outperform general-purpose LLMs using synthetic data. Tensoic AI generates small, powerful models that can run on consumer-grade hardware, making it ideal for a wide range of applications.
![Cartesia Sonic Team Blog Research Playground Screenshot](/screenshots/cartesia.ai.jpg)
Cartesia Sonic Team Blog Research Playground
Cartesia Sonic Team Blog Research Playground is an AI application that offers real-time multimodal intelligence for every device. The application aims to build the next generation of AI by providing ubiquitous, interactive intelligence that can run on any device. It features the fastest, ultra-realistic generative voice API and is backed by research on simple linear attention language models and state-space models. The founding team, who met at the Stanford AI Lab, has invented State Space Models (SSMs) and scaled it up to achieve state-of-the-art results in various modalities such as text, audio, video, images, and time-series data.
![Juno Screenshot](/screenshots/getjuno.ai.jpg)
Juno
Juno is an AI tool designed to enhance data science workflows by providing code suggestions, automatic debugging, and code editing capabilities. It aims to make data science tasks more efficient and productive by assisting users in writing and optimizing code. Juno prioritizes privacy and offers the option to run on private servers for sensitive datasets.
![Raman Labs Screenshot](/screenshots/ramanlabs.in.jpg)
Raman Labs
Raman Labs is an AI tool that offers dedicated modules for computer vision-based tasks, allowing users to integrate machine learning functionality into their existing applications with just 2 lines of code. The tool provides real-time performance, simplicity, robustness to large scale and resolution variations, versatility, and adaptability to different computing power levels. It supports various platforms, hardware, and language integrations, with more coming soon. Raman Labs prioritizes user privacy by storing only email and hashed passwords, and all payment-related information is handled by a PCI DSS compliant service. The tool is licensed for personal use and can be run on multiple personal devices.
![NeuProScan Screenshot](/screenshots/neuproscan.com.jpg)
NeuProScan
NeuProScan is an AI platform designed for the early detection of pre-clinical Alzheimer's from MRI scans. It utilizes AI technology to predict the likelihood of developing Alzheimer's years in advance, helping doctors improve diagnosis accuracy and optimize the use of costly PET scans. The platform is fully customizable, user-friendly, and can be run on devices or in the cloud. NeuProScan aims to provide patients and healthcare systems with valuable insights for better planning and decision-making.
![Comet ML Screenshot](/screenshots/comet.ai.jpg)
Comet ML
Comet ML is an extensible, fully customizable machine learning platform that aims to move ML forward by supporting productivity, reproducibility, and collaboration. It integrates with existing infrastructure and tools to manage, visualize, and optimize models from training runs to production monitoring. Users can track and compare training runs, create a model registry, and monitor models in production all in one platform. Comet's platform can be run on any infrastructure, enabling users to reshape their ML workflow and bring their existing software and data stack.
![ADXL Screenshot](/screenshots/adxl.ai.jpg)
ADXL
ADXL is an AI-powered platform that revolutionizes digital marketing by offering multi-channel AI automation solutions. It helps businesses achieve their marketing goals, expand their reach, and enhance control over their advertising campaigns. With features like AI-optimized copy, automated retargeting, cross-channel optimization, and lead ads, ADXL simplifies ad management across various platforms like Google Ads, Facebook, LinkedIn, TikTok, and Twitter. The platform enables users to create, distribute, and manage ads efficiently, delivering peak performance and maximizing results without requiring technical expertise. ADXL is trusted by global advertisers for its AI-driven technology and innovative approach to intelligent systems development in the AI era.
![ChatGPT for Outlook Screenshot](/screenshots/chatgptoutlookaddin.com.jpg)
ChatGPT for Outlook
ChatGPT for Outlook is an AI tool that integrates the power of ChatGPT into Microsoft Outlook, allowing users to run ChatGPT on emails to generate summaries, highlights, and important information. Users can create custom prompts, process entire emails or specific parts, manage multiple configurations, and improve email efficiency. Blueberry Consultants, the developer, offers custom versions for businesses and teams, enabling users to resell the tool with unique prompts. The tool enhances productivity and information management within Outlook, leveraging AI technology for email processing and organization.
![LM Studio Screenshot](/screenshots/lmstudio.ai.jpg)
LM Studio
LM Studio is an AI tool designed for discovering, downloading, and running local LLMs (Large Language Models). Users can run LLMs on their laptops offline, use models through an in-app Chat UI or a local server, download compatible model files from HuggingFace repositories, and discover new LLMs. The tool ensures privacy by not collecting data or monitoring user actions, making it suitable for personal and business use. LM Studio supports various models like ggml Llama, MPT, and StarCoder on Hugging Face, with minimum hardware/software requirements specified for different platforms.
![Edge Impulse Screenshot](/screenshots/edgeimpulse.com.jpg)
Edge Impulse
Edge Impulse is a leading edge AI platform that enables users to build datasets, train models, and optimize libraries to run directly on any edge device. It offers sensor datasets, feature engineering, model optimization, algorithms, and NVIDIA integrations. The platform is designed for product leaders, AI practitioners, embedded engineers, and OEMs across various industries and applications. Edge Impulse helps users unlock sensor data value, build high-quality sensor datasets, advance algorithm development, optimize edge AI models, and achieve measurable results. It allows for future-proofing workflows by generating models and algorithms that perform efficiently on any edge hardware.
![Open Interpreter Project Screenshot](/screenshots/openinterpreter.com.jpg)
Open Interpreter Project
The Open Interpreter Project is an AI tool that enables users to run code on their computers to complete tasks. It offers a new way of interacting with computers by leveraging LLMs (Large Language Models). The project aims to simplify coding tasks and enhance productivity by providing a platform for executing code seamlessly.
![Dot Screenshot](/screenshots/dotapp.uk.jpg)
Dot
Dot is a free, locally-run language model that allows users to interact with their own documents, chat with the model, and use the model for a variety of tasks, all without sending their data away. It is powered by the Mistral 7B LLM, which means it can run locally on a user's device and does not give away any of their data. Dot can also run offline.
![Stablematic Screenshot](/screenshots/stablematic.com.jpg)
Stablematic
Stablematic is a web-based platform that allows users to run Stable Diffusion and other machine learning models without the need for local setup or hardware limitations. It provides a user-friendly interface, pre-installed plugins, and dedicated GPU resources for a seamless and efficient workflow. Users can generate images and videos from text prompts, merge multiple models, train custom models, and access a range of pre-trained models, including Dreambooth and CivitAi models. Stablematic also offers API access for developers and dedicated support for users to explore and utilize the capabilities of Stable Diffusion and other machine learning models.
![QA Wolf Screenshot](/screenshots/qawolf.com.jpg)
QA Wolf
QA Wolf is an AI-native service that delivers 80% automated end-to-end test coverage for web and mobile apps in weeks, not years. It automates hundreds of tests using Playwright code for web and Appium for mobile, providing reliable test results on every run. With features like 100% parallel run infrastructure, zero flake guarantee, and unlimited test runs, QA Wolf aims to help software teams ship better software faster by taking QA completely off their plate.
![fsck.ai Screenshot](/screenshots/fsck.ai.jpg)
fsck.ai
fsck.ai is an AI-powered software creation kit designed to help developers ship high-quality software faster. It offers cutting-edge AI tools that accelerate code reviews and identify potential problems in code. Similar to Copilot, fsck.ai is fully open-source and can run locally or on a remote machine. Users can sign up for early access to leverage the power of AI in their development workflow.
![PhotoPrism Screenshot](/screenshots/photoprism.app.jpg)
PhotoPrism
PhotoPrism is an AI-Powered Photos App for the Decentralized Web that utilizes cutting-edge technologies to automatically tag and find pictures. It allows users to organize and access their photos effortlessly, without compromising privacy. The application offers features like browsing all photos and videos, powerful search filters, world maps for trip memories, live photo playback, facial recognition, and automatic picture classification based on content and location. PhotoPrism is self-funded and independent, ensuring data privacy and transparency. Users can run the app on a private server, in the cloud, or at home.
20 - Open Source AI Tools
![GlaDOS Screenshot](/screenshots_githubs/dnhkng-GlaDOS.jpg)
GlaDOS
This project aims to create a real-life version of GLaDOS, an aware, interactive, and embodied AI entity. It involves training a voice generator, developing a 'Personality Core,' implementing a memory system, providing vision capabilities, creating 3D-printable parts, and designing an animatronics system. The software architecture focuses on low-latency voice interactions, utilizing a circular buffer for data recording, text streaming for quick transcription, and a text-to-speech system. The project also emphasizes minimal dependencies for running on constrained hardware. The hardware system includes servo- and stepper-motors, 3D-printable parts for GLaDOS's body, animations for expression, and a vision system for tracking and interaction. Installation instructions cover setting up the TTS engine, required Python packages, compiling llama.cpp, installing an inference backend, and voice recognition setup. GLaDOS can be run using 'python glados.py' and tested using 'demo.ipynb'.
![GLaDOS Screenshot](/screenshots_githubs/dnhkng-GLaDOS.jpg)
GLaDOS
GLaDOS Personality Core is a project dedicated to building a real-life version of GLaDOS, an aware, interactive, and embodied AI system. The project aims to train GLaDOS voice generator, create a 'Personality Core,' develop medium- and long-term memory, provide vision capabilities, design 3D-printable parts, and build an animatronics system. The software architecture focuses on low-latency voice interactions and minimal dependencies. The hardware system includes servo- and stepper-motors, 3D printable parts for GLaDOS's body, animations for expression, and a vision system for tracking and interaction. Installation instructions involve setting up a local LLM server, installing drivers, and running GLaDOS on different operating systems.
![awesome-mobile-llm Screenshot](/screenshots_githubs/stevelaskaridis-awesome-mobile-llm.jpg)
awesome-mobile-llm
Awesome Mobile LLMs is a curated list of Large Language Models (LLMs) and related studies focused on mobile and embedded hardware. The repository includes information on various LLM models, deployment frameworks, benchmarking efforts, applications, multimodal LLMs, surveys on efficient LLMs, training LLMs on device, mobile-related use-cases, industry announcements, and related repositories. It aims to be a valuable resource for researchers, engineers, and practitioners interested in mobile LLMs.
![AI-Scientist Screenshot](/screenshots_githubs/SakanaAI-AI-Scientist.jpg)
AI-Scientist
The AI Scientist is a comprehensive system for fully automatic scientific discovery, enabling Foundation Models to perform research independently. It aims to tackle the grand challenge of developing agents capable of conducting scientific research and discovering new knowledge. The tool generates papers on various topics using Large Language Models (LLMs) and provides a platform for exploring new research ideas. Users can create their own templates for specific areas of study and run experiments to generate papers. However, caution is advised as the codebase executes LLM-written code, which may pose risks such as the use of potentially dangerous packages and web access.
![llm-course Screenshot](/screenshots_githubs/mlabonne-llm-course.jpg)
llm-course
The LLM course is divided into three parts: 1. 🧩 **LLM Fundamentals** covers essential knowledge about mathematics, Python, and neural networks. 2. 🧑🔬 **The LLM Scientist** focuses on building the best possible LLMs using the latest techniques. 3. 👷 **The LLM Engineer** focuses on creating LLM-based applications and deploying them. For an interactive version of this course, I created two **LLM assistants** that will answer questions and test your knowledge in a personalized way: * 🤗 **HuggingChat Assistant**: Free version using Mixtral-8x7B. * 🤖 **ChatGPT Assistant**: Requires a premium account. ## 📝 Notebooks A list of notebooks and articles related to large language models. ### Tools | Notebook | Description | Notebook | |----------|-------------|----------| | 🧐 LLM AutoEval | Automatically evaluate your LLMs using RunPod | ![Open In Colab](img/colab.svg) | | 🥱 LazyMergekit | Easily merge models using MergeKit in one click. | ![Open In Colab](img/colab.svg) | | 🦎 LazyAxolotl | Fine-tune models in the cloud using Axolotl in one click. | ![Open In Colab](img/colab.svg) | | ⚡ AutoQuant | Quantize LLMs in GGUF, GPTQ, EXL2, AWQ, and HQQ formats in one click. | ![Open In Colab](img/colab.svg) | | 🌳 Model Family Tree | Visualize the family tree of merged models. | ![Open In Colab](img/colab.svg) | | 🚀 ZeroSpace | Automatically create a Gradio chat interface using a free ZeroGPU. | ![Open In Colab](img/colab.svg) |
![LocalAI Screenshot](/screenshots_githubs/mudler-LocalAI.jpg)
LocalAI
LocalAI is a free and open-source OpenAI alternative that acts as a drop-in replacement REST API compatible with OpenAI (Elevenlabs, Anthropic, etc.) API specifications for local AI inferencing. It allows users to run LLMs, generate images, audio, and more locally or on-premises with consumer-grade hardware, supporting multiple model families and not requiring a GPU. LocalAI offers features such as text generation with GPTs, text-to-audio, audio-to-text transcription, image generation with stable diffusion, OpenAI functions, embeddings generation for vector databases, constrained grammars, downloading models directly from Huggingface, and a Vision API. It provides a detailed step-by-step introduction in its Getting Started guide and supports community integrations such as custom containers, WebUIs, model galleries, and various bots for Discord, Slack, and Telegram. LocalAI also offers resources like an LLM fine-tuning guide, instructions for local building and Kubernetes installation, projects integrating LocalAI, and a how-tos section curated by the community. It encourages users to cite the repository when utilizing it in downstream projects and acknowledges the contributions of various software from the community.
![mergekit Screenshot](/screenshots_githubs/arcee-ai-mergekit.jpg)
mergekit
Mergekit is a toolkit for merging pre-trained language models. It uses an out-of-core approach to perform unreasonably elaborate merges in resource-constrained situations. Merges can be run entirely on CPU or accelerated with as little as 8 GB of VRAM. Many merging algorithms are supported, with more coming as they catch my attention.
![ComfyUI-Ollama-Describer Screenshot](/screenshots_githubs/alisson-anjos-ComfyUI-Ollama-Describer.jpg)
ComfyUI-Ollama-Describer
ComfyUI-Ollama-Describer is an extension for ComfyUI that enables the use of LLM models provided by Ollama, such as Gemma, Llava (multimodal), Llama2, Llama3, or Mistral. It requires the Ollama library for interacting with large-scale language models, supporting GPUs using CUDA and AMD GPUs on Windows, Linux, and Mac. The extension allows users to run Ollama through Docker and utilize NVIDIA GPUs for faster processing. It provides nodes for image description, text description, image captioning, and text transformation, with various customizable parameters for model selection, API communication, response generation, and model memory management.
![prompt-tuning-playbook Screenshot](/screenshots_githubs/varungodbole-prompt-tuning-playbook.jpg)
prompt-tuning-playbook
The LLM Prompt Tuning Playbook is a comprehensive guide for improving the performance of post-trained Language Models (LLMs) through effective prompting strategies. It covers topics such as pre-training vs. post-training, considerations for prompting, a rudimentary style guide for prompts, and a procedure for iterating on new system instructions. The playbook emphasizes the importance of clear, concise, and explicit instructions to guide LLMs in generating desired outputs. It also highlights the iterative nature of prompt development and the need for systematic evaluation of model responses.
![Awesome-LLM-Compression Screenshot](/screenshots_githubs/HuangOwen-Awesome-LLM-Compression.jpg)
Awesome-LLM-Compression
Awesome LLM compression research papers and tools to accelerate LLM training and inference.
![Awesome-Embedded Screenshot](/screenshots_githubs/nhivp-Awesome-Embedded.jpg)
Awesome-Embedded
Awesome-Embedded is a curated list of resources for embedded systems enthusiasts. It covers a wide range of topics including MCU programming, RTOS, Linux kernel development, assembly programming, machine learning & AI on MCU, utilities, tips & tricks, and more. The repository provides valuable information, tutorials, and tools for individuals interested in embedded systems development.
![llama3.java Screenshot](/screenshots_githubs/mukel-llama3.java.jpg)
llama3.java
Llama3.java is a practical Llama 3 inference tool implemented in a single Java file. It serves as the successor of llama2.java and is designed for testing and tuning compiler optimizations and features on the JVM, especially for the Graal compiler. The tool features a GGUF format parser, Llama 3 tokenizer, Grouped-Query Attention inference, support for Q8_0 and Q4_0 quantizations, fast matrix-vector multiplication routines using Java's Vector API, and a simple CLI with 'chat' and 'instruct' modes. Users can download quantized .gguf files from huggingface.co for model usage and can also manually quantize to pure 'Q4_0'. The tool requires Java 21+ and supports running from source or building a JAR file for execution. Performance benchmarks show varying tokens/s rates for different models and implementations on different hardware setups.
![executorch Screenshot](/screenshots_githubs/pytorch-executorch.jpg)
executorch
ExecuTorch is an end-to-end solution for enabling on-device inference capabilities across mobile and edge devices including wearables, embedded devices and microcontrollers. It is part of the PyTorch Edge ecosystem and enables efficient deployment of PyTorch models to edge devices. Key value propositions of ExecuTorch are: * **Portability:** Compatibility with a wide variety of computing platforms, from high-end mobile phones to highly constrained embedded systems and microcontrollers. * **Productivity:** Enabling developers to use the same toolchains and SDK from PyTorch model authoring and conversion, to debugging and deployment to a wide variety of platforms. * **Performance:** Providing end users with a seamless and high-performance experience due to a lightweight runtime and utilizing full hardware capabilities such as CPUs, NPUs, and DSPs.
![Awesome-Code-LLM Screenshot](/screenshots_githubs/codefuse-ai-Awesome-Code-LLM.jpg)
Awesome-Code-LLM
Analyze the following text from a github repository (name and readme text at end) . Then, generate a JSON object with the following keys and provide the corresponding information for each key, in lowercase letters: 'description' (detailed description of the repo, must be less than 400 words,Ensure that no line breaks and quotation marks.),'for_jobs' (List 5 jobs suitable for this tool,in lowercase letters), 'ai_keywords' (keywords of the tool,user may use those keyword to find the tool,in lowercase letters), 'for_tasks' (list of 5 specific tasks user can use this tool to do,in lowercase letters), 'answer' (in english languages)
![Liger-Kernel Screenshot](/screenshots_githubs/linkedin-Liger-Kernel.jpg)
Liger-Kernel
Liger Kernel is a collection of Triton kernels designed for LLM training, increasing training throughput by 20% and reducing memory usage by 60%. It includes Hugging Face Compatible modules like RMSNorm, RoPE, SwiGLU, CrossEntropy, and FusedLinearCrossEntropy. The tool works with Flash Attention, PyTorch FSDP, and Microsoft DeepSpeed, aiming to enhance model efficiency and performance for researchers, ML practitioners, and curious novices.
20 - OpenAI Gpts
![Community Design™ Screenshot](/screenshots_gpts/g-rvZPFqXuC.jpg)
Community Design™
A community-building GPT based on the wildly popular Community Design™ framework from Mighty Networks. Start creating communities that run themselves.
![Unix Shell Simulator with Visuals Screenshot](/screenshots_gpts/g-g6m9yCjtY.jpg)
Unix Shell Simulator with Visuals
UNIX terminal responses with OS process visuals. (on or off) [off] by default until GPT-4 behaves better... Bash profiles and advanced memory system for realistic bash simulation. V1 (beta)
![Dungeon Maestro Screenshot](/screenshots_gpts/g-FCqDjUnHy.jpg)
Dungeon Maestro
D&D 5e Dungeon Master based on the SRD ruleset. Rich storytelling and an infinite adventure!
![Flutter Tools nvim Guide Screenshot](/screenshots_gpts/g-3HV3X80js.jpg)
Flutter Tools nvim Guide
Explains Flutter-tools for Neovim, focusing on basics and troubleshooting.
![Digital Marketing Coach Screenshot](/screenshots_gpts/g-kidcJXasv.jpg)
Digital Marketing Coach
Guiding you through digital media, focusing on asking the right questions and understanding answers in marketing.
![Amazon Seller Assistant Screenshot](/screenshots_gpts/g-lTjYSgxQV.jpg)
Amazon Seller Assistant
Expert in Amazon selling, providing precise guidance on various Amazon-related issues.
![ChatEUC Screenshot](/screenshots_gpts/g-eTFlUziC6.jpg)
ChatEUC
Your expert guide for all things EUC, with a focus on battery safety, maintenance, and protective gear.
![Consulting & Investment Banking Interview Prep GPT Screenshot](/screenshots_gpts/g-rKqyzQSsc.jpg)
Consulting & Investment Banking Interview Prep GPT
Run mock interviews, review content and get tips to ace strategy consulting and investment banking interviews
![Dungeon Master's Assistant Screenshot](/screenshots_gpts/g-o4bqk98YU.jpg)
Dungeon Master's Assistant
Your new DM's screen: helping Dungeon Masters to craft & run amazing D&D adventures.
![Database Builder Screenshot](/screenshots_gpts/g-A3ueeULl8.jpg)
Database Builder
Hosts a real SQLite database and helps you create tables, make schema changes, and run SQL queries, ideal for all levels of database administration.
![Restaurant Startup Guide Screenshot](/screenshots_gpts/g-sJJyTn3Ny.jpg)
Restaurant Startup Guide
Meet the Restaurant Startup Guide GPT: your friendly guide in the restaurant biz. It offers casual, approachable advice to help you start and run your own restaurant with ease.
![Code Helper for Web Application Development Screenshot](/screenshots_gpts/g-on1OdQwBD.jpg)
Code Helper for Web Application Development
Friendly web assistant for efficient code. Ask the wizard to create an application and you will get the HTML, CSS and Javascript code ready to run your web application.