Best AI tools for< Score Llms >
20 - AI tool Sites
Clearbit
Clearbit is a B2B marketing intelligence platform that provides data enrichment, scoring, routing, and buying intent signals. It is powered by artificial intelligence and is the first HubSpot Native Data Provider. Clearbit's data foundation is built on public data from the web, proprietary data, and the power of LLMs to convert unstructured information into precise and standardized data sets. This data can be used to enrich leads, contacts, and accounts, and to identify hidden buying intent. Clearbit also offers a variety of features to help businesses score and route leads, and to create better converting forms.
GPTs.Fan
GPTs.Fan is a comprehensive platform dedicated to GPT designers, providing a wealth of resources and support. It offers a vibrant community forum where designers can connect, share knowledge, and collaborate on projects. Additionally, GPTs.Fan features a curated collection of GPT-related tools, tutorials, and articles, empowering designers to stay up-to-date with the latest advancements in the field.
Code Snippets AI
Code Snippets AI is an AI-powered code snippets library for teams. It helps developers master their codebase with contextually-rich AI chats, integrated with a secure code snippets library. Developers can build new features, fix bugs, add comments, and understand their codebase with the help of Code Snippets AI. The tool is trusted by the best development teams and helps developers code smarter than ever. With Code Snippets AI, developers can leverage the power of a codebase aware assistant, helping them write clean, performance optimized code. They can also create documentation, refactor, debug and generate code with full codebase context. This helps developers spend more time creating code and less time debugging errors.
JFrog ML
JFrog ML is an AI platform designed to streamline AI development from prototype to production. It offers a unified MLOps platform to build, train, deploy, and manage AI workflows at scale. With features like Feature Store, LLMOps, and model monitoring, JFrog ML empowers AI teams to collaborate efficiently and optimize AI & ML models in production.
Loata
Loata is an AI-powered platform that serves as a learning orchestrator for adaptive text analyses. It allows users to store their notes and documents in the cloud, which are then ingested and transformed into knowledge bases. The platform features smart AI agents powered by LLMs to provide intelligent answers based on the content. With end-to-end encryption and controlled ingestion, Loata ensures the security and privacy of user data. Users can choose from different subscription plans to access varying levels of storage and query capacity, making it suitable for individuals and professionals alike.
Studious Score AI
Studious Score AI is an AI-powered platform that offers knowledge and skill evaluation services supported by reputable individuals and organizations. The platform aims to revolutionize credentialing by providing a new approach. Studious Score AI is on a mission to establish itself as the global benchmark for assessing skills and knowledge in various aspects of life. Users can explore different categories and unlock their potential through the platform's innovative evaluation methods.
How Attractive Am I
How Attractive Am I is an AI-powered tool that analyzes facial features to calculate an attractiveness score. By evaluating symmetry and proportions, the tool provides personalized beauty scores. Users can upload a photo to discover their true beauty potential. The tool ensures accuracy by providing guidelines for taking photos and offers a fun and insightful way to understand facial appeal.
AI Credit Repair
AI Credit Repair is an AI-powered application designed to help users improve their credit score by providing tools such as credit card utilization builder, public records remover, dispute letter generator, and SMS alerts. The application assists users in managing their credit card payments, disputing collections, and staying on top of their credit-related activities. By leveraging artificial intelligence technology, AI Credit Repair aims to simplify the credit repair process and empower users to build a better financial future.
Humbot
Humbot is an AI-powered tool that helps you bypass AI detection by humanizing your AI-generated text. It uses advanced technology to rewrite your text in a way that makes it undetectable by most AI detectors. With Humbot, you can easily create high-quality, original content that is indistinguishable from human-written text.
SAT Reading & Writing Question Generator
The SAT Reading & Writing Question Generator is an AI-powered tool designed to help students practice for the SAT exam. It generates a wide variety of reading and writing questions to improve students' skills and boost their confidence. With its smart algorithms, the tool provides personalized practice questions tailored to each student's needs, making exam preparation more effective and efficient. The tool is user-friendly and accessible, offering a seamless experience for students to enhance their test-taking abilities.
VMock Dashboard
VMock Dashboard is an AI-powered platform designed to provide personalized feedback on resumes. It leverages machine learning algorithms to analyze resumes and offers detailed insights on areas of improvement. Users can upload their resumes, receive instant feedback on content, formatting, and overall effectiveness, and track their progress over time. VMock Dashboard aims to help individuals enhance their resumes and increase their chances of landing their desired job opportunities.
AlphaRank
AlphaRank is an AI SEO optimizer tool designed to boost website traffic by providing advanced SEO solutions. It offers a comprehensive suite of features to audit and optimize websites, leveraging AI algorithms and a vast database of keywords and merchant insights. Trusted by top-tier partners, AlphaRank aims to enhance content relevance, user engagement, and SEO precision for businesses, bloggers, and freelancers. The tool streamlines SEO operations, provides market insights, and offers expert recommendations for technical and content improvements.
ytRank.ai
ytRank.ai is an AI-powered tool designed to help YouTube creators optimize their content strategy, outsmart the competition, and accelerate channel growth. It offers a range of powerful features such as discovering trending keywords, analyzing SEO scores, generating compelling titles, and creating optimized tags. By leveraging AI technology, ytRank.ai aims to boost channel rankings, visibility, and attract more viewers. With a focus on increasing visibility, engagement, and long-term growth, ytRank.ai is a valuable tool for YouTubers seeking to enhance their online presence.
Football Predictions AI
Football Predictions AI is a website that provides users with accurate and reliable football match predictions. Users can access a variety of prediction types, including 1x2, BTTS, Over Under, and Correct Score predictions for matches across different leagues. The site also offers live scores, articles, and settings to customize the user experience. With a focus on user privacy and data protection, Football Predictions AI aims to enhance the football betting experience for enthusiasts and fans.
Scienaptic Systems
Scienaptic Systems is an AI-powered Credit Decisioning Platform that revolutionizes the lending industry by automating credit underwriting processes, leveraging alternate data points, and offering self-learning models for instant credit decisions. The platform helps lenders identify creditworthy borrowers, streamline customer experience, and ensure fair lending practices through explainable AI models. Scienaptic's suite of AI-enabled technologies enables lenders to say 'Yes' to more borrowers at lower risk, providing a 360-degree risk assessment before approving applications. The platform integrates seamlessly with existing lending ecosystems, ensuring disruption-free deployment and better risk predictions through a single API call.
ATS Friendly
ATS Friendly is a free resume checker that uses AI to help you get your resume shortlisted by Applicant Tracking Systems (ATS). With our comprehensive resume keyword scanner, our AI powered ATS Resume Checker will definitely enhance your chances of getting filtered through Applicant Tracking Systems (ATS). Our FREE ATS Friendly Resume Checker allows you to check your resume against the job posting, before applying. Our AI powered ATS Scanner will go through the keywords, hard skills, soft skills and other requirements of the job description, and provide you with a comprehensive analysis. We have helped over 100,000 job seekers to get shortlisted and hired faster. Try our Free ATS Checker now and get hired faster!
HireFlow.net
HireFlow.net is an AI-powered platform designed to optimize resumes and enhance job prospects. The website offers a free resume checker that leverages advanced Artificial Intelligence technology to provide personalized feedback and suggestions for improving resumes. Users can also access features such as CV analysis, cover letter and resignation letter generators, and expert insights to stand out in the competitive job market.
Pangea.ai
Pangea.ai is a leading talent aggregator that helps businesses hire quality technologists by comparing data points for reliable matching. It offers a unified hiring experience in a fragmented market, making it easier to compare and decide among the numerous software development agencies and talent networks available. Pangea.ai's intelligent matching system considers over 100 data points to find the best fit for businesses, while its rigorous vetting process evaluates expertise, client satisfaction, and team health. Businesses can choose to self-serve their way to a hire or opt for Pangea.ai's white-glove matching service.
Unspam
Unspam is an email spam checker and deliverability test tool that helps businesses ensure their emails land in the inbox, not spam. It offers a range of features including email spam checking, deliverability testing, email preview, AI eye-tracking heatmap, SPF, DKIM, DMARC, and more. Unspam's mission is to help businesses improve their email deliverability and engagement, and its tools and insights are designed to help users optimize their email campaigns for maximum impact.
StealthWriter
StealthWriter is an AI-powered tool that helps users rewrite AI-generated text into human-like content. It is designed to bypass AI detectors and ensure that the content is plagiarism-free and SEO-friendly. StealthWriter offers a variety of features, including the ability to humanize AI content, bypass AI detectors, generate plagiarism-free content, optimize content for SEO, and provide flawless content quality. It also offers multiple versions of humanized text and allows users to customize the transformed content.
20 - Open Source AI Tools
bench
Bench is a tool for evaluating LLMs for production use cases. It provides a standardized workflow for LLM evaluation with a common interface across tasks and use cases. Bench can be used to test whether open source LLMs can do as well as the top closed-source LLM API providers on specific data, and to translate the rankings on LLM leaderboards and benchmarks into scores that are relevant for actual use cases.
empirical
Empirical is a tool that allows you to test different LLMs, prompts, and other model configurations across all the scenarios that matter for your application. With Empirical, you can run your test datasets locally against off-the-shelf models, test your own custom models and RAG applications, view, compare, and analyze outputs on a web UI, score your outputs with scoring functions, and run tests on CI/CD.
eval-scope
Eval-Scope is a framework for evaluating and improving large language models (LLMs). It provides a set of commonly used test datasets, metrics, and a unified model interface for generating and evaluating LLM responses. Eval-Scope also includes an automatic evaluator that can score objective questions and use expert models to evaluate complex tasks. Additionally, it offers a visual report generator, an arena mode for comparing multiple models, and a variety of other features to support LLM evaluation and development.
LLaMA-Factory
LLaMA Factory is a unified framework for fine-tuning 100+ large language models (LLMs) with various methods, including pre-training, supervised fine-tuning, reward modeling, PPO, DPO and ORPO. It features integrated algorithms like GaLore, BAdam, DoRA, LongLoRA, LLaMA Pro, LoRA+, LoftQ and Agent tuning, as well as practical tricks like FlashAttention-2, Unsloth, RoPE scaling, NEFTune and rsLoRA. LLaMA Factory provides experiment monitors like LlamaBoard, TensorBoard, Wandb, MLflow, etc., and supports faster inference with OpenAI-style API, Gradio UI and CLI with vLLM worker. Compared to ChatGLM's P-Tuning, LLaMA Factory's LoRA tuning offers up to 3.7 times faster training speed with a better Rouge score on the advertising text generation task. By leveraging 4-bit quantization technique, LLaMA Factory's QLoRA further improves the efficiency regarding the GPU memory.
nlp-llms-resources
The 'nlp-llms-resources' repository is a comprehensive resource list for Natural Language Processing (NLP) and Large Language Models (LLMs). It covers a wide range of topics including traditional NLP datasets, data acquisition, libraries for NLP, neural networks, sentiment analysis, optical character recognition, information extraction, semantics, topic modeling, multilingual NLP, domain-specific LLMs, vector databases, ethics, costing, books, courses, surveys, aggregators, newsletters, papers, conferences, and societies. The repository provides valuable information and resources for individuals interested in NLP and LLMs.
llms-tools
The 'llms-tools' repository is a comprehensive collection of AI tools, open-source projects, and research related to Large Language Models (LLMs) and Chatbots. It covers a wide range of topics such as AI in various domains, open-source models, chats & assistants, visual language models, evaluation tools, libraries, devices, income models, text-to-image, computer vision, audio & speech, code & math, games, robotics, typography, bio & med, military, climate, finance, and presentation. The repository provides valuable resources for researchers, developers, and enthusiasts interested in exploring the capabilities of LLMs and related technologies.
llms
The 'llms' repository is a comprehensive guide on Large Language Models (LLMs), covering topics such as language modeling, applications of LLMs, statistical language modeling, neural language models, conditional language models, evaluation methods, transformer-based language models, practical LLMs like GPT and BERT, prompt engineering, fine-tuning LLMs, retrieval augmented generation, AI agents, and LLMs for computer vision. The repository provides detailed explanations, examples, and tools for working with LLMs.
llms-interview-questions
This repository contains a comprehensive collection of 63 must-know Large Language Models (LLMs) interview questions. It covers topics such as the architecture of LLMs, transformer models, attention mechanisms, training processes, encoder-decoder frameworks, differences between LLMs and traditional statistical language models, handling context and long-term dependencies, transformers for parallelization, applications of LLMs, sentiment analysis, language translation, conversation AI, chatbots, and more. The readme provides detailed explanations, code examples, and insights into utilizing LLMs for various tasks.
Aidan-Bench
Aidan Bench is a tool that rewards creativity, reliability, contextual attention, and instruction following. It is weakly correlated with Lmsys, has no score ceiling, and aligns with real-world open-ended use. The tool involves giving LLMs open-ended questions and evaluating their answers based on novelty scores. Users can set up the tool by installing required libraries and setting up API keys. The project allows users to run benchmarks for different models and provides flexibility in threading options.
LLM-RGB
LLM-RGB is a repository containing a collection of detailed test cases designed to evaluate the reasoning and generation capabilities of Language Learning Models (LLMs) in complex scenarios. The benchmark assesses LLMs' performance in understanding context, complying with instructions, and handling challenges like long context lengths, multi-step reasoning, and specific response formats. Each test case evaluates an LLM's output based on context length difficulty, reasoning depth difficulty, and instruction compliance difficulty, with a final score calculated for each test case. The repository provides a score table, evaluation details, and quick start guide for running evaluations using promptfoo testing tools.
LiveBench
LiveBench is a benchmark tool designed for Language Model Models (LLMs) with a focus on limiting contamination through monthly new questions based on recent datasets, arXiv papers, news articles, and IMDb movie synopses. It provides verifiable, objective ground-truth answers for accurate scoring without an LLM judge. The tool offers 18 diverse tasks across 6 categories and promises to release more challenging tasks over time. LiveBench is built on FastChat's llm_judge module and incorporates code from LiveCodeBench and IFEval.
llm-colosseum
llm-colosseum is a tool designed to evaluate Language Model Models (LLMs) in real-time by making them fight each other in Street Fighter III. The tool assesses LLMs based on speed, strategic thinking, adaptability, out-of-the-box thinking, and resilience. It provides a benchmark for LLMs to understand their environment and take context-based actions. Users can analyze the performance of different LLMs through ELO rankings and win rate matrices. The tool allows users to run experiments, test different LLM models, and customize prompts for LLM interactions. It offers installation instructions, test mode options, logging configurations, and the ability to run the tool with local models. Users can also contribute their own LLM models for evaluation and ranking.
LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse is a comprehensive and curated guide designed to empower developers, researchers, and enthusiasts to harness the true capabilities of Large Language Models (LLMs) and build intelligent applications that push the boundaries of natural language understanding. This GitHub repository provides in-depth articles, codebase mastery, LLM PlayLab, and resources for cost analysis and network visualization. It covers various aspects of LLMs, including NLP, models, training, evaluation metrics, open LLMs, and more. The repository also includes a collection of code examples and tutorials to help users build and deploy LLM-based applications.
Awesome-LLM-in-Social-Science
This repository compiles a list of academic papers that evaluate, align, simulate, and provide surveys or perspectives on the use of Large Language Models (LLMs) in the field of Social Science. The papers cover various aspects of LLM research, including assessing their alignment with human values, evaluating their capabilities in tasks such as opinion formation and moral reasoning, and exploring their potential for simulating social interactions and addressing issues in diverse fields of Social Science. The repository aims to provide a comprehensive resource for researchers and practitioners interested in the intersection of LLMs and Social Science.
arena-hard-auto
Arena-Hard-Auto-v0.1 is an automatic evaluation tool for instruction-tuned LLMs. It contains 500 challenging user queries. The tool prompts GPT-4-Turbo as a judge to compare models' responses against a baseline model (default: GPT-4-0314). Arena-Hard-Auto employs an automatic judge as a cheaper and faster approximator to human preference. It has the highest correlation and separability to Chatbot Arena among popular open-ended LLM benchmarks. Users can evaluate their models' performance on Chatbot Arena by using Arena-Hard-Auto.
WildBench
WildBench is a tool designed for benchmarking Large Language Models (LLMs) with challenging tasks sourced from real users in the wild. It provides a platform for evaluating the performance of various models on a range of tasks. Users can easily add new models to the benchmark by following the provided guidelines. The tool supports models from Hugging Face and other APIs, allowing for comprehensive evaluation and comparison. WildBench facilitates running inference and evaluation scripts, enabling users to contribute to the benchmark and collaborate on improving model performance.
duo-attention
DuoAttention is a framework designed to optimize long-context large language models (LLMs) by reducing memory and latency during inference without compromising their long-context abilities. It introduces a concept of Retrieval Heads and Streaming Heads to efficiently manage attention across tokens. By applying a full Key and Value (KV) cache to retrieval heads and a lightweight, constant-length KV cache to streaming heads, DuoAttention achieves significant reductions in memory usage and decoding time for LLMs. The framework uses an optimization-based algorithm with synthetic data to accurately identify retrieval heads, enabling efficient inference with minimal accuracy loss compared to full attention. DuoAttention also supports quantization techniques for further memory optimization, allowing for decoding of up to 3.3 million tokens on a single GPU.
20 - OpenAI Gpts
ICH Score Assistant
A specialised GPT for calculating and understanding Intracranial hemorrhage scores
TLICS Score Assistant
Thoracolumbar Injury Classification and Severity (TLICS) system calculator
Credit Score Check
Guides on checking and monitoring credit scores, with a financial and informative tone.
Your Edu Gurus Free SAT Score Calculator & Expert
Upload your SAT score PDF to our calculator and analyze how you did and how to preform better
WM ACC Score
This Custom GPT ACC Score allows you to input a an thought and analyze where on the ACC spectrum it falls.
SEOGenius - Craft SEO titles & Effectiveness Score
Crafts SEO-friendly titles, subtitles, summaries, TLDRs, and hashtags for online content. Imagine crafting titles so SEO-friendly that Google sends you a personal thank-you note 😂
GMAT Tutor
Get 1-on-1 tutoring. Trained from official questions only (verbal, quant, data insights). Score in the 90th percentile! 🚀
MetaRottenMDB
Average score from Metacritic, Rotten Tomatoes, IMDB, TMDB for any movie or series.
Rate My {{Startup}}
I will score your Mind Blowing Startup Ideas, helping your to evaluate faster.
Raven's Progressive Matrices Test
Provides Raven's Progressive Matrices test with explanations and calculates your IQ score.
Guess the 80s TV Show
Generates images hinting at 80s TV shows, keeping score for a guessing game.
Anxiety Assessment Guide
This GPT will guide you through an anxiety questionnaire and calculate your total score.