Best AI tools for< Speedup Evaluation >
20 - AI tool Sites
Career Copilot
Career Copilot is an AI-powered hiring tool that helps recruiters and hiring managers find the best candidates for their open positions. The tool uses machine learning to analyze candidate profiles and identify those who are most qualified for the job. Career Copilot also provides a number of features to help recruiters streamline the hiring process, such as candidate screening, interview scheduling, and offer management.
Sereda.ai
Sereda.ai is an AI-powered platform designed to unleash a team's potential by bringing together all documents and knowledge into one place, conducting employee surveys and satisfaction ratings, facilitating performance reviews, and providing solutions to increase team productivity. The platform offers features such as a knowledge base, employee surveys, performance review tools, interactive learning courses, and an AI assistant for instant answers. Sereda.ai aims to streamline HR processes, improve employee training and evaluation, and enhance overall team productivity.
SOMA
SOMA is a Research Automation Platform that accelerates medical innovation by providing up to 100x speedup through process automation. The platform analyzes medical research articles, extracts important concepts, and identifies causal and associative relationships between them. It organizes this information into a specialized database forming a knowledge graph. Researchers can retrieve causal chains, access specific research articles, and perform tasks like concept analysis, drug repurposing, and target discovery. SOMA enhances literature review efficiency by finding relevant articles based on causal chains and keywords specified by the user. It empowers researchers to focus on their research by saving up to 95% of the time spent on pre-processing documents. The platform offers freemium access with extended functionality for 14 days and advanced features available through subscription.
CodeParrot
CodeParrot is an AI tool designed to speed up frontend development tasks by generating production-ready frontend components from Figma design files using Large Language Models. It helps developers reduce UI development time, improve code quality, and focus on more creative tasks. CodeParrot offers customization options, support for frameworks like React, Vue, and Angular, and integrates seamlessly into various workflows, making it a must-have tool for developers looking to enhance their frontend development process.
DoMore.ai
DoMore.ai is a personalized AI tools catalog that offers a wide range of AI-powered tools to enhance productivity, creativity, and efficiency. With DoMore.ai, users can access a curated collection of AI tools tailored to their specific needs and preferences. The platform provides detailed descriptions, ratings, and reviews of each tool, making it easy for users to find the right tool for the job. DoMore.ai also offers a personalized recommendation engine that suggests tools based on user preferences and usage patterns. Whether you're a creative professional, a business owner, or a student, DoMore.ai has the tools you need to achieve your goals.
V7
V7 is an AI data engine for computer vision and generative AI. It provides a multimodal automation tool that helps users label data 10x faster, power AI products via API, build AI + human workflows, and reach 99% AI accuracy. V7's platform includes features such as automated annotation, DICOM annotation, dataset management, model management, image annotation, video annotation, document processing, and labeling services.
Osium AI
Osium AI is a cutting-edge AI-powered software designed to accelerate the development of sustainable and high-performance materials and chemicals. The platform leverages proprietary technology developed by experts with 10 years of experience in AI and authors of multiple AI patents. Osium AI offers a comprehensive solution that covers every step of materials and chemicals development cycles, from formulation and characterization to scale-up and manufacturing. The software is flexible, adaptable to various R&D projects, and eliminates trial-and-error approaches, unlocking the full potential of R&D with its advanced functionalities.
AI-SYNT
AI-SYNT is a digital copy trained on your content. AI-SYNT enables to insert humans, products or characters into generated scenes. Grow your engagement rate up to 4x.
Promptmate
Promptmate.io is an AI-powered app builder that allows users to create customized applications based on leading AI systems. With Promptmate, users can combine different AI systems, add external data, and automate processes to streamline their workflows. The platform offers a range of features, including pre-built app templates, bulk processing, and data extenders, making it easy for users to build and deploy AI-powered applications without the need for coding.
ONNX Runtime
ONNX Runtime is a production-grade AI engine designed to accelerate machine learning training and inferencing in various technology stacks. It supports multiple languages and platforms, optimizing performance for CPU, GPU, and NPU hardware. ONNX Runtime powers AI in Microsoft products and is widely used in cloud, edge, web, and mobile applications. It also enables large model training and on-device training, offering state-of-the-art models for tasks like image synthesis and text generation.
ioni.ai
ioni.ai is an AI application that offers ChatGPT-4 solution for customer support. It is a smart chatbot based on the latest AI technology, designed to handle general inquiries, complex questions, and user-specific requests. The application streamlines workflow with immediate responses, brings CSAT scores to a new level, and ensures human-in-the-loop verification for quality control. With self-learning capabilities, ioni.ai constantly improves its responses and provides accurate solutions to customer inquiries.
Checkr
Checkr is an employee background screening platform for companies, offering a range of services such as criminal background checks, employment verification, driving record checks, drug testing, and more. The platform aims to streamline the hiring process by providing fast and accurate background screening solutions. Checkr also focuses on fair chance hiring, helping qualified candidates with records find job opportunities while assisting companies in implementing diversity, equity, and inclusion practices.
Tolgee
Tolgee is a smart platform for painless localization, offering fast integration and dev tools for translation assistance, autonomous translation, and collaboration. It supports in-context translating, translation memory, machine translations, auto translation, and activity log features. Tolgee is used by 8500+ users worldwide and provides integrations for all major JS frameworks.
Remy
Remy is an AI-powered platform designed to help product security and compliance teams resolve security risks early. It offers a scalable design review solution that automates the identification and triage of high-impact engineering proposals, providing full visibility and reducing cost, risk, and time associated with security design reviews. Remy streamlines review processes, generates AI-based questions, and offers clear metrics and audit trails to enhance security practices. The platform is enterprise-ready, offering SSO for convenient logins, scalability, and customization to meet diverse enterprise needs.
SADESIGN RETOUCH PANEL
SADESIGN RETOUCH PANEL is a smart Photoshop Plugin with more than 600 powerful functions, fully integrated with automatic features such as mass color correction, automatic skinning, acne removal, face slimming, leg lengthening, makeup, and more. It includes valuable resource libraries and eliminates the need for additional software. The tool offers advanced technology for automated photo editing, making it a go-to solution for designers and photographers.
Automateed
Automateed is an all-in-one AI eBook creator that helps you create unique and professional-quality eBooks in minutes. With Automateed, you can generate unique book content, design beautiful eBook covers, and even get marketing tasks done for you. It's the perfect tool for authors, marketers, and anyone who wants to create high-quality eBooks quickly and easily.
Inkdrop
Inkdrop is an AI-powered tool that helps users visualize their cloud infrastructure by automatically generating interactive diagrams of cloud resources and dependencies. It provides a comprehensive overview of the infrastructure to speed up onboarding and understand complex resource relationships for effective troubleshooting. With seamless integration, users can effortlessly update documentation via CI pipeline integration. Meet the founders Antoine Descamps, Cofounder and CEO, and Alberto Schillaci, Cofounder and CTO. Inkdrop is trusted by partners who believe in its mission.
Streos
Streos is an AI-powered platform that enables users to build websites effortlessly and download them for free. The platform offers a seamless experience by generating complete websites, pages, and components based on user input. Users can easily customize and modify elements to match their vision, and deploy their website to a custom domain with just a few clicks. Streos aims to revolutionize web design by providing an intelligent and efficient AI Assistant that simplifies the website creation process.
Aftershoot
Aftershoot is an AI culling and editing software designed for professional photographers to streamline their post-processing workflow. The application leverages AI technology to assist users in culling and editing large volumes of photos efficiently, saving time and enhancing productivity. Aftershoot offers AI-assisted culling and editing features, allowing photographers to train personal AI editing profiles, speed up the culling process, and export photos seamlessly. With a focus on simplicity and automation, Aftershoot aims to empower photographers to concentrate on their creative vision and important aspects of their work.
HelpBar
The website is an AI tool called HelpBar, designed for SaaS companies to enhance user experience by providing universal search, navigation, and AI-powered answers. HelpBar allows users to easily set up self-serve support, customize brand styles, and create trigger widgets for quick access to relevant content. It helps in speeding up time to value, creating power users, reducing support costs, and keeping users engaged within the app by surfacing favorite tools and resources.
20 - Open Source AI Tools
wanda
Official PyTorch implementation of Wanda (Pruning by Weights and Activations), a simple and effective pruning approach for large language models. The pruning approach removes weights on a per-output basis, by the product of weight magnitudes and input activation norms. The repository provides support for various features such as LLaMA-2, ablation study on OBS weight update, zero-shot evaluation, and speedup evaluation. Users can replicate main results from the paper using provided bash commands. The tool aims to enhance the efficiency and performance of language models through structured and unstructured sparsity techniques.
Consistency_LLM
Consistency Large Language Models (CLLMs) is a family of efficient parallel decoders that reduce inference latency by efficiently decoding multiple tokens in parallel. The models are trained to perform efficient Jacobi decoding, mapping any randomly initialized token sequence to the same result as auto-regressive decoding in as few steps as possible. CLLMs have shown significant improvements in generation speed on various tasks, achieving up to 3.4 times faster generation. The tool provides a seamless integration with other techniques for efficient Large Language Model (LLM) inference, without the need for draft models or architectural modifications.
prometheus-eval
Prometheus-Eval is a repository dedicated to evaluating large language models (LLMs) in generation tasks. It provides state-of-the-art language models like Prometheus 2 (7B & 8x7B) for assessing in pairwise ranking formats and achieving high correlation scores with benchmarks. The repository includes tools for training, evaluating, and using these models, along with scripts for fine-tuning on custom datasets. Prometheus aims to address issues like fairness, controllability, and affordability in evaluations by simulating human judgments and proprietary LM-based assessments.
TriForce
TriForce is a training-free tool designed to accelerate long sequence generation. It supports long-context Llama models and offers both on-chip and offloading capabilities. Users can achieve a 2.2x speedup on a single A100 GPU. TriForce also provides options for offloading with tensor parallelism or without it, catering to different hardware configurations. The tool includes a baseline for comparison and is optimized for performance on RTX 4090 GPUs. Users can cite the associated paper if they find TriForce useful for their projects.
T-MAC
T-MAC is a kernel library that directly supports mixed-precision matrix multiplication without the need for dequantization by utilizing lookup tables. It aims to boost low-bit LLM inference on CPUs by offering support for various low-bit models. T-MAC achieves significant speedup compared to SOTA CPU low-bit framework (llama.cpp) and can even perform well on lower-end devices like Raspberry Pi 5. The tool demonstrates superior performance over existing low-bit GEMM kernels on CPU, reduces power consumption, and provides energy savings. It achieves comparable performance to CUDA GPU on certain tasks while delivering considerable power and energy savings. T-MAC's method involves using lookup tables to support mpGEMM and employs key techniques like precomputing partial sums, shift and accumulate operations, and utilizing tbl/pshuf instructions for fast table lookup.
LLM-Pruner
LLM-Pruner is a tool for structural pruning of large language models, allowing task-agnostic compression while retaining multi-task solving ability. It supports automatic structural pruning of various LLMs with minimal human effort. The tool is efficient, requiring only 3 minutes for pruning and 3 hours for post-training. Supported LLMs include Llama-3.1, Llama-3, Llama-2, LLaMA, BLOOM, Vicuna, and Baichuan. Updates include support for new LLMs like GQA and BLOOM, as well as fine-tuning results achieving high accuracy. The tool provides step-by-step instructions for pruning, post-training, and evaluation, along with a Gradio interface for text generation. Limitations include issues with generating repetitive or nonsensical tokens in compressed models and manual operations for certain models.
airllm
AirLLM is a tool that optimizes inference memory usage, enabling large language models to run on low-end GPUs without quantization, distillation, or pruning. It supports models like Llama3.1 on 8GB VRAM. The tool offers model compression for up to 3x inference speedup with minimal accuracy loss. Users can specify compression levels, profiling modes, and other configurations when initializing models. AirLLM also supports prefetching and disk space management. It provides examples and notebooks for easy implementation and usage.
awesome-transformer-nlp
This repository contains a hand-curated list of great machine (deep) learning resources for Natural Language Processing (NLP) with a focus on Generative Pre-trained Transformer (GPT), Bidirectional Encoder Representations from Transformers (BERT), attention mechanism, Transformer architectures/networks, Chatbot, and transfer learning in NLP.
neural-speed
Neural Speed is an innovative library designed to support the efficient inference of large language models (LLMs) on Intel platforms through the state-of-the-art (SOTA) low-bit quantization powered by Intel Neural Compressor. The work is inspired by llama.cpp and further optimized for Intel platforms with our innovations in NeurIPS' 2023
PowerInfer
PowerInfer is a high-speed Large Language Model (LLM) inference engine designed for local deployment on consumer-grade hardware, leveraging activation locality to optimize efficiency. It features a locality-centric design, hybrid CPU/GPU utilization, easy integration with popular ReLU-sparse models, and support for various platforms. PowerInfer achieves high speed with lower resource demands and is flexible for easy deployment and compatibility with existing models like Falcon-40B, Llama2 family, ProSparse Llama2 family, and Bamboo-7B.
qserve
QServe is a serving system designed for efficient and accurate Large Language Models (LLM) on GPUs with W4A8KV4 quantization. It achieves higher throughput compared to leading industry solutions, allowing users to achieve A100-level throughput on cheaper L40S GPUs. The system introduces the QoQ quantization algorithm with 4-bit weight, 8-bit activation, and 4-bit KV cache, addressing runtime overhead challenges. QServe improves serving throughput for various LLM models by implementing compute-aware weight reordering, register-level parallelism, and fused attention memory-bound techniques.
marlin
Marlin is a highly optimized FP16xINT4 matmul kernel designed for large language model (LLM) inference, offering close to ideal speedups up to batchsizes of 16-32 tokens. It is suitable for larger-scale serving, speculative decoding, and advanced multi-inference schemes like CoT-Majority. Marlin achieves optimal performance by utilizing various techniques and optimizations to fully leverage GPU resources, ensuring efficient computation and memory management.
Awesome-LLM-Compression
Awesome LLM compression research papers and tools to accelerate LLM training and inference.
MiniCPM-V
MiniCPM-V is a series of end-side multimodal LLMs designed for vision-language understanding. The models take image and text inputs to provide high-quality text outputs. The series includes models like MiniCPM-Llama3-V 2.5 with 8B parameters surpassing proprietary models, and MiniCPM-V 2.0, a lighter model with 2B parameters. The models support over 30 languages, efficient deployment on end-side devices, and have strong OCR capabilities. They achieve state-of-the-art performance on various benchmarks and prevent hallucinations in text generation. The models can process high-resolution images efficiently and support multilingual capabilities.
Awesome-Quantization-Papers
This repo contains a comprehensive paper list of **Model Quantization** for efficient deep learning on AI conferences/journals/arXiv. As a highlight, we categorize the papers in terms of model structures and application scenarios, and label the quantization methods with keywords.
Qwen
Qwen is a series of large language models developed by Alibaba DAMO Academy. It outperforms the baseline models of similar model sizes on a series of benchmark datasets, e.g., MMLU, C-Eval, GSM8K, MATH, HumanEval, MBPP, BBH, etc., which evaluate the models’ capabilities on natural language understanding, mathematic problem solving, coding, etc. Qwen models outperform the baseline models of similar model sizes on a series of benchmark datasets, e.g., MMLU, C-Eval, GSM8K, MATH, HumanEval, MBPP, BBH, etc., which evaluate the models’ capabilities on natural language understanding, mathematic problem solving, coding, etc. Qwen-72B achieves better performance than LLaMA2-70B on all tasks and outperforms GPT-3.5 on 7 out of 10 tasks.
6 - OpenAI Gpts
How To Make Your Computer Faster: Speed Up Your PC
A Guide To Speed Up Your Computer from Geeks On Command Computer Repair Company
Deal Architect
Designing Strategic M&A Blueprints for Success in buying, selling or merging companies. Use this GPT to simplify, speed up and improve the quality of the M&A process. With custom data - 100s of creative options in deal flow, deal structuring, financing and more. **Version 2.2 - 28012024**
FIX-MY-TECK
Reparer c'est mieux et c'est payants. FIX-MY-TECK vous donne la marche a suivre pour reparer vous meme vos electroniques, ordinateurs, et autres.