Best AI tools for< Install Custom Kernels >
20 - AI tool Sites
FastBots.ai
FastBots.ai is an AI chatbot builder that allows users to create custom chatbots trained on their own data. These chatbots can be integrated into websites to provide customer support, sales assistance, and other services. FastBots.ai is easy to use and requires no coding. It supports a wide range of content types, including text, PDFs, and YouTube videos. FastBots.ai also offers a variety of features, such as customization options, chat history storage, and Zapier integration.
AnythingLLM
AnythingLLM is an all-in-one AI application designed for everyone. It offers a suite of tools for working with LLM (Large Language Models), documents, and agents in a fully private environment. Users can install AnythingLLM on their desktop for Windows, MacOS, and Linux, enabling flexible one-click installation and secure, fully private operation without internet connectivity. The application supports custom models, including enterprise models like GPT-4, custom fine-tuned models, and open-source models like Llama and Mistral. AnythingLLM allows users to work with various document formats, such as PDFs and word documents, providing tailored solutions with locally running defaults for privacy.
Axiom.ai
Axiom.ai is a no-code browser automation tool that allows users to automate website actions and repetitive tasks on any website or web app. It is a Chrome Extension that is simple to install and free to try. Once installed, users can pin Axiom to the Chrome Toolbar and click on the icon to open and close. Users can build custom bots or use templates to automate actions like clicking, typing, and scraping data from websites. Axiom.ai can be integrated with Zapier to trigger automations based on external events.
AnythingYou.AI
AnythingYou.AI is an AI tool that generates beautiful profile pictures using AI avatars. Users can create custom AI avatars by uploading 10-20 selfies, and the tool will train a custom model for them immediately. The generated avatar images are high-quality and realistic, created using innovative technologies like Stable Diffusion and DreamBooth. Users can easily create avatars without the need for subscriptions or app installs, and get their avatar images in just 2 hours. The tool ensures user privacy by using images only for model training and deleting them immediately after avatar generation.
Cmd J – ChatGPT for Chrome
Cmd J – ChatGPT for Chrome is a Chrome extension that allows users to use ChatGPT on any tab without having to copy and paste. It offers a variety of features to help users improve their writing, generate blog posts, crush coding issues, boost their social engagement, and fix code bugs faster. The extension is easy to use and can be accessed with a simple keyboard shortcut.
Package
Package is a generative AI rendering tool that helps homeowners envision different renovation styles, receive recommended material packages, and streamline procurement with just one click. It offers a wide range of design packages curated by experts, allowing users to customize items to fit their specific style. Package also provides 3D renderings, material management, and personalized choices, making it easy for homeowners to bring their design ideas to life.
Cascadeur
Cascadeur is a standalone 3D software that lets you create keyframe animation, as well as clean up and edit any imported ones. Thanks to its AI-assisted and physics tools you can dramatically speed up the animation process and get high quality results. It works with .FBX, .DAE and .USD files making it easy to integrate into any animation workflow.
Hoop.dev
Hoop.dev is an AI application that provides live AI data masking in Rails console sessions. It offers shield Rails console access, automated employee onboarding & off-boarding, and AI data masking to protect customer data with a plug & play PII filter. The application enables compliant access without disrupting speed, automates HIPAA, SOC 1/2, PCI, GDPR, & other security controls, and reduces Rails Console use by finding repeated operations and turning Ruby scripts into repeatable no-code UIs.
GptPanda
GptPanda is a free AI-assistant application designed for Slack users to enhance teamwork and productivity. It integrates seamlessly into Slack workspaces, offering unlimited requests and support in multiple languages. Users can communicate with GptPanda in personal messages or corporate chats, allowing it to assist with daily tasks, answer questions, and manage workspaces efficiently. The application prioritizes user data security through encryption and provides 24/7 customer support for any inquiries or issues.
Meow Apps
Meow Apps is a collection of powerful WordPress plugins designed to supercharge websites with AI capabilities, optimization features, and more. Created by Jordy Meow, a software engineer and photographer based in Tokyo, the plugins aim to enhance productivity and user experience on WordPress platforms. With a focus on optimization, imagery, and AI integration, Meow Apps offers a range of tools to elevate content, automate social posts, clean databases, manage media files, and add AI features like chatbots and content generation. The plugins are known for their friendly user interface, extensive features, and support for databases of all sizes. Meow Apps strives for perfection by providing high-quality tools that can transform the WordPress experience for users.
Visual Studio Marketplace
The Visual Studio Marketplace is a platform where users can find and publish extensions for Visual Studio family of products, such as Visual Studio, Visual Studio Code, and Azure DevOps. It offers a wide range of extensions to enhance development workflows and productivity. Users can explore and install various tools, themes, and integrations to customize their development environment.
Tactiq
Tactiq is a live transcription and AI summary tool for Google Meet, Zoom, and MS Teams. It provides real-time transcriptions, speaker identification, and AI-powered insights to help users focus on the meeting and take effective notes. Tactiq also offers one-click AI actions, such as generating meeting summaries, crafting follow-up emails, and formatting project updates, to streamline post-meeting workflows.
Colorcinch
Colorcinch is an online photo editor and AI cartoonizer that allows users to easily edit and transform their photos into artwork. It offers a wide range of features, including background removal, image cropping and resizing, color adjustment, and the ability to add filters and effects. Colorcinch also has a large library of stock photography, graphics, and icons that users can use to enhance their photos. The platform is available online and offline, making it easy for users to access their projects from anywhere.
Machinet
Machinet is an AI Agent designed for full-stack software developers. It serves as an AI-based IDE that assists developers in various tasks, such as code generation, terminal access, front-end debugging, architecture suggestions, refactoring, and mentoring. The tool aims to enhance productivity and streamline the development workflow by providing intelligent assistance and support throughout the coding process. Machinet prioritizes security and privacy, ensuring that user data is encrypted, secure, and never stored for training purposes.
Shakespeare Toolbar
Shakespeare Toolbar is an AI-powered writing tool that helps you write better and faster. It is available as a Chrome extension and can be used on any website. With Shakespeare Toolbar, you can rephrase emails, summarize documents, write social media posts, and more. It supports over 10 languages and is available for a one-time purchase of $49.
GPTConsole
GPTConsole is an AI-powered platform that helps developers build production-ready applications faster and more efficiently. Its AI agents can generate code for a variety of applications, including web applications, AI applications, and landing pages. GPTConsole also offers a range of features to help developers build and maintain their applications, including an AI agent that can learn your entire codebase and answer your questions, and a CLI tool for accessing agents directly from the command line.
Stable Diffusion
Stable Diffusion is an AI art generation tool that allows users to create high-quality images from text descriptions. It offers a user-friendly platform for both beginners and experts to explore AI art creation without deep technical knowledge. The tool excels in producing complex, detailed, and customizable images, making it ideal for artists, designers, and anyone looking to integrate AI into their creative process. Stable Diffusion provides unprecedented creative freedom through features like image generation, inpainting, outpainting, and text-guided image-to-image translation.
Pinokio
Pinokio is a browser that enables users to easily install, run, and control various AI applications on their computer with just one click. It provides a platform for exploring, learning, and sharing scripts developed by the community, allowing users to access a wide range of AI tools and applications effortlessly.
Remodel AI
Remodel AI is an innovative AI application that allows users to renovate their homes with ease. By simply taking photos of their home's interior or exterior, users can instantly visualize fully remodeled versions, new flooring, different walls, and more. The app leverages artificial intelligence to provide various interior design styles and architecture options for users to choose from. With features like interior and exterior remodeling, new flooring installation, wall painting, landscaping visualization, and object reskinning, Remodel AI offers a comprehensive solution for home renovation enthusiasts. The app has received accolades for its user-friendly interface and ability to transform home design ideas into reality.
75 Wbet Com Daftar : OLX500
75 Wbet Com Daftar : OLX500 is an online platform offering a variety of casino games, including slots, live casino, poker, and more. Users can access popular games like Sweet Bonanza, Mahjong Ways, and Gates of Olympus. The platform also provides guidance on how to install the app on Android devices. With a focus on responsible gambling, 75 Wbet Com Daftar : OLX500 aims to enhance the gaming experience for its users.
20 - Open Source AI Tools
BitMat
BitMat is a Python package designed to optimize matrix multiplication operations by utilizing custom kernels written in Triton. It leverages the principles outlined in the "1bit-LLM Era" paper, specifically utilizing packed int8 data to enhance computational efficiency and performance in deep learning and numerical computing tasks.
MInference
MInference is a tool designed to accelerate pre-filling for long-context Language Models (LLMs) by leveraging dynamic sparse attention. It achieves up to a 10x speedup for pre-filling on an A100 while maintaining accuracy. The tool supports various decoding LLMs, including LLaMA-style models and Phi models, and provides custom kernels for attention computation. MInference is useful for researchers and developers working with large-scale language models who aim to improve efficiency without compromising accuracy.
fsdp_qlora
The fsdp_qlora repository provides a script for training Large Language Models (LLMs) with Quantized LoRA and Fully Sharded Data Parallelism (FSDP). It integrates FSDP+QLoRA into the Axolotl platform and offers installation instructions for dependencies like llama-recipes, fastcore, and PyTorch. Users can finetune Llama-2 70B on Dual 24GB GPUs using the provided command. The script supports various training options including full params fine-tuning, LoRA fine-tuning, custom LoRA fine-tuning, quantized LoRA fine-tuning, and more. It also discusses low memory loading, mixed precision training, and comparisons to existing trainers. The repository addresses limitations and provides examples for training with different configurations, including BnB QLoRA and HQQ QLoRA. Additionally, it offers SLURM training support and instructions for adding support for a new model.
prime
Prime is a framework for efficient, globally distributed training of AI models over the internet. It includes features such as fault-tolerant training with ElasticDeviceMesh, asynchronous distributed checkpointing, live checkpoint recovery, custom Int8 All-Reduce Kernel, maximizing bandwidth utilization, PyTorch FSDP2/DTensor ZeRO-3 implementation, and CPU off-loading. The framework aims to optimize communication, checkpointing, and bandwidth utilization for large-scale AI model training.
lorax
LoRAX is a framework that allows users to serve thousands of fine-tuned models on a single GPU, dramatically reducing the cost of serving without compromising on throughput or latency. It features dynamic adapter loading, heterogeneous continuous batching, adapter exchange scheduling, optimized inference, and is ready for production with prebuilt Docker images, Helm charts for Kubernetes, Prometheus metrics, and distributed tracing with Open Telemetry. LoRAX supports a number of Large Language Models as the base model including Llama, Mistral, and Qwen, and any of the linear layers in the model can be adapted via LoRA and loaded in LoRAX.
llama.cpp
llama.cpp is a C++ implementation of LLaMA, a large language model from Meta. It provides a command-line interface for inference and can be used for a variety of tasks, including text generation, translation, and question answering. llama.cpp is highly optimized for performance and can be run on a variety of hardware, including CPUs, GPUs, and TPUs.
KsanaLLM
KsanaLLM is a high-performance engine for LLM inference and serving. It utilizes optimized CUDA kernels for high performance, efficient memory management, and detailed optimization for dynamic batching. The tool offers flexibility with seamless integration with popular Hugging Face models, support for multiple weight formats, and high-throughput serving with various decoding algorithms. It enables multi-GPU tensor parallelism, streaming outputs, and an OpenAI-compatible API server. KsanaLLM supports NVIDIA GPUs and Huawei Ascend NPU, and seamlessly integrates with verified Hugging Face models like LLaMA, Baichuan, and Qwen. Users can create a docker container, clone the source code, compile for Nvidia or Huawei Ascend NPU, run the tool, and distribute it as a wheel package. Optional features include a model weight map JSON file for models with different weight names.
exllamav2
ExLlamaV2 is an inference library for running local LLMs on modern consumer GPUs. It is a faster, better, and more versatile codebase than its predecessor, ExLlamaV1, with support for a new quant format called EXL2. EXL2 is based on the same optimization method as GPTQ and supports 2, 3, 4, 5, 6, and 8-bit quantization. It allows for mixing quantization levels within a model to achieve any average bitrate between 2 and 8 bits per weight. ExLlamaV2 can be installed from source, from a release with prebuilt extension, or from PyPI. It supports integration with TabbyAPI, ExUI, text-generation-webui, and lollms-webui. Key features of ExLlamaV2 include: - Faster and better kernels - Cleaner and more versatile codebase - Support for EXL2 quantization format - Integration with various web UIs and APIs - Community support on Discord
aici
The Artificial Intelligence Controller Interface (AICI) lets you build Controllers that constrain and direct output of a Large Language Model (LLM) in real time. Controllers are flexible programs capable of implementing constrained decoding, dynamic editing of prompts and generated text, and coordinating execution across multiple, parallel generations. Controllers incorporate custom logic during the token-by-token decoding and maintain state during an LLM request. This allows diverse Controller strategies, from programmatic or query-based decoding to multi-agent conversations to execute efficiently in tight integration with the LLM itself.
ztachip
ztachip is a RISCV accelerator designed for vision and AI edge applications, offering up to 20-50x acceleration compared to non-accelerated RISCV implementations. It features an innovative tensor processor hardware to accelerate various vision tasks and TensorFlow AI models. ztachip introduces a new tensor programming paradigm for massive processing/data parallelism. The repository includes technical documentation, code structure, build procedures, and reference design examples for running vision/AI applications on FPGA devices. Users can build ztachip as a standalone executable or a micropython port, and run various AI/vision applications like image classification, object detection, edge detection, motion detection, and multi-tasking on supported hardware.
hqq
HQQ is a fast and accurate model quantizer that skips the need for calibration data. It's super simple to implement (just a few lines of code for the optimizer). It can crunch through quantizing the Llama2-70B model in only 4 minutes! 🚀
vasttools
This repository contains a collection of tools that can be used with vastai. The tools are free to use, modify and distribute. If you find this useful and wish to donate your welcome to send your donations to the following wallets. BTC 15qkQSYXP2BvpqJkbj2qsNFb6nd7FyVcou XMR 897VkA8sG6gh7yvrKrtvWningikPteojfSgGff3JAUs3cu7jxPDjhiAZRdcQSYPE2VGFVHAdirHqRZEpZsWyPiNK6XPQKAg RVN RSgWs9Co8nQeyPqQAAqHkHhc5ykXyoMDUp USDT(ETH ERC20) 0xa5955cf9fe7af53bcaa1d2404e2b17a1f28aac4f Paypal PayPal.Me/cryptolabsZA
basalt
Basalt is a lightweight and flexible CSS framework designed to help developers quickly build responsive and modern websites. It provides a set of pre-designed components and utilities that can be easily customized to create unique and visually appealing web interfaces. With Basalt, developers can save time and effort by leveraging its modular structure and responsive design principles to create professional-looking websites with ease.
sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with LLMs faster and more controllable by co-designing the frontend language and the runtime system. The core features of SGLang include: - **A Flexible Front-End Language**: This allows for easy programming of LLM applications with multiple chained generation calls, advanced prompting techniques, control flow, multiple modalities, parallelism, and external interaction. - **A High-Performance Runtime with RadixAttention**: This feature significantly accelerates the execution of complex LLM programs by automatic KV cache reuse across multiple calls. It also supports other common techniques like continuous batching and tensor parallelism.
LLamaTuner
LLamaTuner is a repository for the Efficient Finetuning of Quantized LLMs project, focusing on building and sharing instruction-following Chinese baichuan-7b/LLaMA/Pythia/GLM model tuning methods. The project enables training on a single Nvidia RTX-2080TI and RTX-3090 for multi-round chatbot training. It utilizes bitsandbytes for quantization and is integrated with Huggingface's PEFT and transformers libraries. The repository supports various models, training approaches, and datasets for supervised fine-tuning, LoRA, QLoRA, and more. It also provides tools for data preprocessing and offers models in the Hugging Face model hub for inference and finetuning. The project is licensed under Apache 2.0 and acknowledges contributions from various open-source contributors.
ktransformers
KTransformers is a flexible Python-centric framework designed to enhance the user's experience with advanced kernel optimizations and placement/parallelism strategies for Transformers. It provides a Transformers-compatible interface, RESTful APIs compliant with OpenAI and Ollama, and a simplified ChatGPT-like web UI. The framework aims to serve as a platform for experimenting with innovative LLM inference optimizations, focusing on local deployments constrained by limited resources and supporting heterogeneous computing opportunities like GPU/CPU offloading of quantized models.
aif
Arno's Iptables Firewall (AIF) is a single- & multi-homed firewall script with DSL/ADSL support. It is a free software distributed under the GNU GPL License. The script provides a comprehensive set of configuration files and plugins for setting up and managing firewall rules, including support for NAT, load balancing, and multirouting. It offers detailed instructions for installation and configuration, emphasizing security best practices and caution when modifying settings. The script is designed to protect against hostile attacks by blocking all incoming traffic by default and allowing users to configure specific rules for open ports and network interfaces.
LLaMA-Factory
LLaMA Factory is a unified framework for fine-tuning 100+ large language models (LLMs) with various methods, including pre-training, supervised fine-tuning, reward modeling, PPO, DPO and ORPO. It features integrated algorithms like GaLore, BAdam, DoRA, LongLoRA, LLaMA Pro, LoRA+, LoftQ and Agent tuning, as well as practical tricks like FlashAttention-2, Unsloth, RoPE scaling, NEFTune and rsLoRA. LLaMA Factory provides experiment monitors like LlamaBoard, TensorBoard, Wandb, MLflow, etc., and supports faster inference with OpenAI-style API, Gradio UI and CLI with vLLM worker. Compared to ChatGLM's P-Tuning, LLaMA Factory's LoRA tuning offers up to 3.7 times faster training speed with a better Rouge score on the advertising text generation task. By leveraging 4-bit quantization technique, LLaMA Factory's QLoRA further improves the efficiency regarding the GPU memory.
20 - OpenAI Gpts
S22 Flip Advisor
Expert on Cat S22 FLIP rooting and custom ROMs, with a broad internet research scope.
Browser Extension Generator
Create browser extensions for web tasks to boost your productivity. Or jumpstart a more advanced extension idea. You'll get a full package download ready to install in your Chrome or Edge browser. 📂 v1.2 _____ _____ What do you want to build? _____
FlutterCraft
FlutterCraft is an AI-powered assistant that streamlines Flutter app development. It interprets user-provided descriptions to generate and compile Flutter app code, providing ready-to-install APK and iOS files. Ideal for rapid prototyping, FlutterCraft makes app development accessible and efficient.
BioinformaticsManual
Compile instructions from the web and github for bioinformatics applications. Receive line-by-line instructions and commands to get started
Ciepły montaż okien
Firma Grupa Magnum specjalizuje się w sprzedaży akcesoriów do ciepłego montażu okien i drzwi. Oferują bogaty wybór narzędzi i akcesoriów, które są niezbędne do prawidłowego montażu stolarki okiennej i drzwiowej.
Throw a Wrench In Your Plans GPT
As "Throw a Wrench in Your Plans GPT", I provide expert guidance on skilled trades and AI adoption, inspired by TWYP Media