Best AI tools for< Use Data >
20 - AI tool Sites
Maekersuite
Maekersuite is an AI-powered platform designed to assist users in researching and scripting videos. It offers a wide range of tools and features to streamline the video creation process, from generating video ideas to optimizing scripts using data and AI. The platform aims to help users create engaging and data-driven video content for various purposes such as marketing, social media, education, and business.
DEUS
DEUS is a data and artificial intelligence company that empowers organizations to advance value creation by unlocking the true value within their data and applying AI services. They offer services in data science, engineering, design, and strategy, partnering with organizations to benefit people, business, and society. DEUS also focuses on addressing wicked problems and societal challenges through human-centered artificial intelligence initiatives. They help organizations launch AI projects that create real value and partner across the product and service lifecycle.
DataLang
DataLang is a tool that allows you to chat with your databases, expose a specific set of data (using SQL) to train GPT, and then chat with it in natural language. You can also use DataLang to automatically make your SQL views available via API, share it with your privately users, or make it public.
Stepsize AI
Stepsize AI is an AI-powered reporting tool for software development teams. It analyzes issue tracker activity to generate automated weekly updates on team and project progress. Stepsize AI provides metrics with automatic commentary, project-level AI insights, and intelligent delivery risk surfacing. It offers tailored insights, complete visibility, and unified focus, helping teams stay aligned and make timely decisions.
Hook
Hook is an AI-powered platform designed to help businesses grow their revenue by accurately predicting high-value customers, reducing churn, increasing expansion revenue, and actively managing accounts. The platform uses AI to analyze usage patterns and third-party data to prioritize customers for revenue growth and upsells. With a focus on data-driven decision-making, Hook empowers revenue teams to make informed choices and drive business growth.
Velocity Explorations
Velocity Explorations is an AI tool that empowers warfighters with cutting-edge technology by enhancing existing software systems with advanced AI capabilities. The team uses data to develop impactful solutions, focusing on prototyping, iterative development, and user-centered design. Their services include AI integration, spaceport integration, and business optimization to streamline processes and improve operational efficiency. The technology offered includes secure, hosted Mattermost for DoD teams, flexible AI integration, and AI-driven content based on live audio recordings.
Ajelix BI
Ajelix BI is a powerful yet easy-to-use data visualization platform that helps small business owners and non-technical teams create professional-looking reports with a dash of AI data analytics. With Ajelix BI, you can upload your spreadsheet files and create dashboards to share with colleagues, without the need for complex setups or database integrations. Ajelix BI also offers a suite of AI-powered tools for Excel, including an Excel formula generator, VBA & Google Apps code creator, Excel template builder, file translator, and Excel & Google Sheets add-on. These tools can help you automate tasks, improve productivity, and gain insights from your data.
BRIA.ai
BRIA.ai is a visual generative AI platform that provides developers and businesses with the tools they need to build and deploy AI-powered applications. The platform includes a suite of pre-trained foundation models, APIs, and tools that can be used to generate and modify images, videos, and other visual content. BRIA.ai is committed to responsible AI practices and ensures that all of its models are trained on licensed and safe-to-use data.
Orbital Insight GO Platform
Orbital Insight is a leading geospatial data analytics platform that provides users with the ability to query the world with three basic parameters: WHAT type of activity? WHERE on earth? WHEN? The platform automates the most difficult steps of deriving insights, allowing you to answer many challenging geospatial questions. Orbital Insight's GO platform is designed for enterprise collaboration and transforms multiple geospatial data sources to accelerate and streamline team member's research, reporting, due diligence, and more.
Dataiku
Dataiku is an end-to-end platform for data and AI projects. It provides a range of capabilities, including data preparation, machine learning, data visualization, and collaboration tools. Dataiku is designed to make it easy for users to build, deploy, and manage AI projects at scale.
syntheticAIdata
syntheticAIdata is a platform that provides synthetic data for training vision AI models. Synthetic data is generated artificially, and it can be used to augment existing real-world datasets or to create new datasets from scratch. syntheticAIdata's platform is easy to use, and it can be integrated with leading cloud platforms. The company's mission is to make synthetic data accessible to everyone, and to help businesses overcome the challenges of acquiring high-quality data for training their vision AI models.
Data & Trust Alliance
The Data & Trust Alliance is a group of industry-leading enterprises focusing on the responsible use of data and intelligent systems. They develop practices to enhance trust in data and AI models, ensuring transparency and reliability in the deployment processes. The alliance works on projects like Data Provenance Standards and Assessing third-party model trustworthiness to promote innovation and trust in AI applications. Through technology and innovation adoption, they aim to leverage expertise and influence for practical solutions and broad adoption across industries.
LearnQ.ai
LearnQ.ai is a personalized AI-powered learning platform that transforms the learning journey by providing real-time data and insights to students, teachers, and administrators. It utilizes AI to detect and address learning gaps, empower teachers with student analytics, and boost students' confidence through data-driven learning. The platform offers various features such as diagnostic tests, engaging game-based learning modules, full-length practice tests, and a personalized AI tutor named Mia.
Velotix
Velotix is an AI-powered data security platform that offers groundbreaking visual data security solutions to help organizations discover, visualize, and use their data securely and compliantly. The platform provides features such as data discovery, permission discovery, self-serve data access, policy-based access control, AI recommendations, and automated policy management. Velotix aims to empower enterprises with smart and compliant data access controls, ensuring data integrity and compliance. The platform helps organizations gain data visibility, control access, and enforce policy compliance, ultimately enhancing data security and governance.
Sicara
Sicara is a data and AI expert platform that helps clients define and implement data strategies, build data platforms, develop data science products, and automate production processes with computer vision. They offer services to improve data performance, accelerate data use cases, integrate generative AI, and support ESG transformation. Sicara collaborates with technology partners to provide tailor-made solutions for data and AI challenges. The platform also features a blog, job offers, and a team of experts dedicated to enhancing productivity and quality in data projects.
Creators
Creators is a website that offers a service to create pitch decks for startups and growing businesses. They specialize in creating visually stunning and impactful pitch decks that tell the story of the business and capture the attention of investors. They use a data-driven approach to storytelling, incorporating relevant data and analytics to back up the idea and prove its potential to investors. They also use artificial intelligence to identify the most compelling way to present the information, ensuring that the pitch deck is not just informative, but also engaging. Creators has a team of expert designers who excel at transforming complex ideas into clear, understandable visuals that are both stunning and highly effective in communicating the message to potential investors.
Segmed's De-Id Playground
Segmed's De-Id Playground is an AI tool designed for de-identification of sensitive data. The application utilizes Natural Language Processing (NLP) and language models to remove any Personal Health Information (PHI) from the provided data samples. It is a demo tool and not recommended for production use. Users can reach out to Segmed for De-Id services. The tool ensures that no data is saved or stored by Segmed.ai, providing a secure environment for data cleaning.
SQL Genius
SQL Genius is an AI-powered tool that helps you translate English language queries into SQL queries. It is designed to make it easier for non-technical users to access and use SQL databases. With SQL Genius, you can simply type in your question in plain English, and the tool will automatically generate the corresponding SQL query. This can save you a lot of time and effort, and it can also help you to avoid errors.
ClearSeas.AI
ClearSeas.AI is an AI-powered market research visualization dashboard that leverages advanced AI technologies to interpret complex datasets, delivering intuitive and visually compelling insights. The platform offers services such as expert survey design, efficient survey deployment, B2B panel access, and full-service project management. ClearSeas.AI prioritizes data security by employing stringent protocols, advanced encryption technologies, and document protection measures. The platform does not use customer data for AI training, ensuring data privacy and confidentiality. ClearSeas.AI aims to illuminate insights with the power of AI, empowering users to stay ahead of market trends and make informed strategic decisions.
Mindful AI Media
Mindful AI Media is a platform offering AI-powered tools and digital solutions to optimize business operations. They provide customized solutions for businesses of all sizes, focusing on innovation, integrity, and empathy. The platform offers products like Business App Pro, Grand Central AI, and Inbox Pro to help businesses streamline operations, enhance customer experience, and drive growth. Mindful AI Media aims to empower businesses through cutting-edge technology and ethical use of data.
20 - Open Source AI Tools
dataherald
Dataherald is a natural language-to-SQL engine built for enterprise-level question answering over structured data. It allows you to set up an API from your database that can answer questions in plain English. You can use Dataherald to: * Allow business users to get insights from the data warehouse without going through a data analyst * Enable Q+A from your production DBs inside your SaaS application * Create a ChatGPT plug-in from your proprietary data
data-prep-kit
Data Prep Kit is a community project aimed at democratizing and speeding up unstructured data preparation for LLM app developers. It provides high-level APIs and modules for transforming data (code, language, speech, visual) to optimize LLM performance across different use cases. The toolkit supports Python, Ray, Spark, and Kubeflow Pipelines runtimes, offering scalability from laptop to datacenter-scale processing. Developers can contribute new custom modules and leverage the data processing library for building data pipelines. Automation features include workflow automation with Kubeflow Pipelines for transform execution.
hash
HASH is a self-building, open-source database which grows, structures and checks itself. With it, we're creating a platform for decision-making, which helps you integrate, understand and use data in a variety of different ways.
SuperAdapters
SuperAdapters is a tool designed to finetune Large Language Models (LLMs) with various adapters on different platforms. It supports models like Bloom, LLaMA, ChatGLM, Qwen, Baichuan, Mixtral, Phi, and more. Users can finetune LLMs on Windows, Linux, and Mac M1/2, handle train/test data with Terminal, File, or DataBase, and perform tasks like CausalLM and SequenceClassification. The tool provides detailed instructions on how to use different models with specific adapters for tasks like finetuning and inference. It also includes requirements for CentOS, Ubuntu, and MacOS, along with information on LLM downloads and data formats. Additionally, it offers parameters for finetuning and inference, as well as options for web and API-based inference.
log10
Log10 is a one-line Python integration to manage your LLM data. It helps you log both closed and open-source LLM calls, compare and identify the best models and prompts, store feedback for fine-tuning, collect performance metrics such as latency and usage, and perform analytics and monitor compliance for LLM powered applications. Log10 offers various integration methods, including a python LLM library wrapper, the Log10 LLM abstraction, and callbacks, to facilitate its use in both existing production environments and new projects. Pick the one that works best for you. Log10 also provides a copilot that can help you with suggestions on how to optimize your prompt, and a feedback feature that allows you to add feedback to your completions. Additionally, Log10 provides prompt provenance, session tracking and call stack functionality to help debug prompt chains. With Log10, you can use your data and feedback from users to fine-tune custom models with RLHF, and build and deploy more reliable, accurate and efficient self-hosted models. Log10 also supports collaboration, allowing you to create flexible groups to share and collaborate over all of the above features.
data-juicer
Data-Juicer is a one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs. It is a systematic & reusable library of 80+ core OPs, 20+ reusable config recipes, and 20+ feature-rich dedicated toolkits, designed to function independently of specific LLM datasets and processing pipelines. Data-Juicer allows detailed data analyses with an automated report generation feature for a deeper understanding of your dataset. Coupled with multi-dimension automatic evaluation capabilities, it supports a timely feedback loop at multiple stages in the LLM development process. Data-Juicer offers tens of pre-built data processing recipes for pre-training, fine-tuning, en, zh, and more scenarios. It provides a speedy data processing pipeline requiring less memory and CPU usage, optimized for maximum productivity. Data-Juicer is flexible & extensible, accommodating most types of data formats and allowing flexible combinations of OPs. It is designed for simplicity, with comprehensive documentation, easy start guides and demo configs, and intuitive configuration with simple adding/removing OPs from existing configs.
create-tsi
Create TSI is a generative AI RAG toolkit that simplifies the process of creating AI Applications using LlamaIndex with low code. The toolkit leverages LLMs hosted by T-Systems on Open Telekom Cloud to generate bots, write agents, and customize them for specific use cases. It provides a Next.js-powered front-end for a chat interface, a Python FastAPI backend powered by llama-index package, and the ability to ingest and index user-supplied data for answering questions.
Open_Data_QnA
Open Data QnA is a Python library that allows users to interact with their PostgreSQL or BigQuery databases in a conversational manner, without needing to write SQL queries. The library leverages Large Language Models (LLMs) to bridge the gap between human language and database queries, enabling users to ask questions in natural language and receive informative responses. It offers features such as conversational querying with multiturn support, table grouping, multi schema/dataset support, SQL generation, query refinement, natural language responses, visualizations, and extensibility. The library is built on a modular design and supports various components like Database Connectors, Vector Stores, and Agents for SQL generation, validation, debugging, descriptions, embeddings, responses, and visualizations.
mindnlp
MindNLP is an open-source NLP library based on MindSpore. It provides a platform for solving natural language processing tasks, containing many common approaches in NLP. It can help researchers and developers to construct and train models more conveniently and rapidly. Key features of MindNLP include: * Comprehensive data processing: Several classical NLP datasets are packaged into a friendly module for easy use, such as Multi30k, SQuAD, CoNLL, etc. * Friendly NLP model toolset: MindNLP provides various configurable components. It is friendly to customize models using MindNLP. * Easy-to-use engine: MindNLP simplified complicated training process in MindSpore. It supports Trainer and Evaluator interfaces to train and evaluate models easily. MindNLP supports a wide range of NLP tasks, including: * Language modeling * Machine translation * Question answering * Sentiment analysis * Sequence labeling * Summarization MindNLP also supports industry-leading Large Language Models (LLMs), including Llama, GLM, RWKV, etc. For support related to large language models, including pre-training, fine-tuning, and inference demo examples, you can find them in the "llm" directory. To install MindNLP, you can either install it from Pypi, download the daily build wheel, or install it from source. The installation instructions are provided in the documentation. MindNLP is released under the Apache 2.0 license. If you find this project useful in your research, please consider citing the following paper: @misc{mindnlp2022, title={{MindNLP}: a MindSpore NLP library}, author={MindNLP Contributors}, howpublished = {\url{https://github.com/mindlab-ai/mindnlp}}, year={2022} }
litdata
LitData is a tool designed for blazingly fast, distributed streaming of training data from any cloud storage. It allows users to transform and optimize data in cloud storage environments efficiently and intuitively, supporting various data types like images, text, video, audio, geo-spatial, and multimodal data. LitData integrates smoothly with frameworks such as LitGPT and PyTorch, enabling seamless streaming of data to multiple machines. Key features include multi-GPU/multi-node support, easy data mixing, pause & resume functionality, support for profiling, memory footprint reduction, cache size configuration, and on-prem optimizations. The tool also provides benchmarks for measuring streaming speed and conversion efficiency, along with runnable templates for different data types. LitData enables infinite cloud data processing by utilizing the Lightning.ai platform to scale data processing with optimized machines.
goodai-ltm-benchmark
This repository contains code and data for replicating experiments on Long-Term Memory (LTM) abilities of conversational agents. It includes a benchmark for testing agents' memory performance over long conversations, evaluating tasks requiring dynamic memory upkeep and information integration. The repository supports various models, datasets, and configurations for benchmarking and reporting results.
llm-zoomcamp
LLM Zoomcamp is a free online course focusing on real-life applications of Large Language Models (LLMs). Over 10 weeks, participants will learn to build an AI bot capable of answering questions based on a knowledge base. The course covers topics such as LLMs, RAG, open-source LLMs, vector databases, orchestration, monitoring, and advanced RAG systems. Pre-requisites include comfort with programming, Python, and the command line, with no prior exposure to AI or ML required. The course features a pre-course workshop and is led by instructors Alexey Grigorev and Magdalena Kuhn, with support from sponsors and partners.
awesome-llms-fine-tuning
This repository is a curated collection of resources for fine-tuning Large Language Models (LLMs) like GPT, BERT, RoBERTa, and their variants. It includes tutorials, papers, tools, frameworks, and best practices to aid researchers, data scientists, and machine learning practitioners in adapting pre-trained models to specific tasks and domains. The resources cover a wide range of topics related to fine-tuning LLMs, providing valuable insights and guidelines to streamline the process and enhance model performance.
langfuse
Langfuse is a powerful tool that helps you develop, monitor, and test your LLM applications. With Langfuse, you can: * **Develop:** Instrument your app and start ingesting traces to Langfuse, inspect and debug complex logs, and manage, version, and deploy prompts from within Langfuse. * **Monitor:** Track metrics (cost, latency, quality) and gain insights from dashboards & data exports, collect and calculate scores for your LLM completions, run model-based evaluations, collect user feedback, and manually score observations in Langfuse. * **Test:** Track and test app behaviour before deploying a new version, test expected in and output pairs and benchmark performance before deploying, and track versions and releases in your application. Langfuse is easy to get started with and offers a generous free tier. You can sign up for Langfuse Cloud or deploy Langfuse locally or on your own infrastructure. Langfuse also offers a variety of integrations to make it easy to connect to your LLM applications.
fuse-med-ml
FuseMedML is a Python framework designed to accelerate machine learning-based discovery in the medical field by promoting code reuse. It provides a flexible design concept where data is stored in a nested dictionary, allowing easy handling of multi-modality information. The framework includes components for creating custom models, loss functions, metrics, and data processing operators. Additionally, FuseMedML offers 'batteries included' key components such as fuse.data for data processing, fuse.eval for model evaluation, and fuse.dl for reusable deep learning components. It supports PyTorch and PyTorch Lightning libraries and encourages the creation of domain extensions for specific medical domains.
MetaGPT
MetaGPT is a multi-agent framework that enables GPT to work in a software company, collaborating to tackle more complex tasks. It assigns different roles to GPTs to form a collaborative entity for complex tasks. MetaGPT takes a one-line requirement as input and outputs user stories, competitive analysis, requirements, data structures, APIs, documents, etc. Internally, MetaGPT includes product managers, architects, project managers, and engineers. It provides the entire process of a software company along with carefully orchestrated SOPs. MetaGPT's core philosophy is "Code = SOP(Team)", materializing SOP and applying it to teams composed of LLMs.
ManipVQA
ManipVQA is a framework that enhances Multimodal Large Language Models (MLLMs) with manipulation-centric knowledge through a Visual Question-Answering (VQA) format. It addresses the deficiency of conventional MLLMs in understanding affordances and physical concepts crucial for manipulation tasks. By infusing robotics-specific knowledge, including tool detection, affordance recognition, and physical concept comprehension, ManipVQA improves the performance of robots in manipulation tasks. The framework involves fine-tuning MLLMs with a curated dataset of interactive objects, enabling robots to understand and execute natural language instructions more effectively.
Perplexica
Perplexica is an open-source AI-powered search engine that utilizes advanced machine learning algorithms to provide clear answers with sources cited. It offers various modes like Copilot Mode, Normal Mode, and Focus Modes for specific types of questions. Perplexica ensures up-to-date information by using SearxNG metasearch engine. It also features image and video search capabilities and upcoming features include finalizing Copilot Mode and adding Discover and History Saving features.
babilong
BABILong is a generative benchmark designed to evaluate the performance of NLP models in processing long documents with distributed facts. It consists of 20 tasks that simulate interactions between characters and objects in various locations, requiring models to distinguish important information from irrelevant details. The tasks vary in complexity and reasoning aspects, with test samples potentially containing millions of tokens. The benchmark aims to challenge and assess the capabilities of Large Language Models (LLMs) in handling complex, long-context information.
Consistency_LLM
Consistency Large Language Models (CLLMs) is a family of efficient parallel decoders that reduce inference latency by efficiently decoding multiple tokens in parallel. The models are trained to perform efficient Jacobi decoding, mapping any randomly initialized token sequence to the same result as auto-regressive decoding in as few steps as possible. CLLMs have shown significant improvements in generation speed on various tasks, achieving up to 3.4 times faster generation. The tool provides a seamless integration with other techniques for efficient Large Language Model (LLM) inference, without the need for draft models or architectural modifications.
20 - OpenAI Gpts
Palm Reader
Moved to https://chat.openai.com/g/g-KFnF7qssT-palm-reader . Interprets palm readings from user-uploaded hand images. Turned off setting to use data for OpenAi to improve model.
Face Reader
Moved to https://chat.openai.com/g/g-q6GNcOkYx-face-reader. Reads faces to tell fortunes based on Chinese face reading. Turned off setting to use data for OpenAi to improve model.
OpenStreetMap Query
Helps get map data from Open Street Map by generating Overpass Turbo queries. Ask me for mapping features like cafes, rivers or highways
InvestorUpdateAssistantGPT
This GPT assists in creating impactful investor updates for companies that have already received funding. It asks insightful questions and recommends KPIs and data that should be included, even assisting with formatting and structuring with updates. It prompts you to opt out of sharing chat data.
Notion Formula & Settings Expert
A Notion expert that can help the user by providing advanced tips about how to use Notion effectively from creating templates, using relational databases, writing Notion formulas or optimizing their workflow.
📩 メール執筆・校正アシスタント【✅セキュリティ強化済み】
メールの作成・添削・返信の提案などを行います。メールの校正は、草案をそのままコピペするだけで可能です。件名・署名・添付ファイルなどを忘れないようリマインドも行います。さらにこのGPTは、特別な設定で「Use conversation data in your GPT to improve our models」の項目をOFFにしています。そのため、うっかり機密情報が含まれるメール文を送信してしまった場合でも、ChatGPTの学習に利用されないと思われます。 ※ただし、今後の仕様変更や他の経路からの情報漏洩などのリスクもありえます。個人情報は決して書き込まないでください。
Sommelier de dados
Opa! Cole o texto da sua reportagem ou trecho para que eu possa analisá-la com base em manuais de uso de dados em textos jornalísticos.
Vulkan Graphics Programming Helper
Helps beginners understand Vulkan concepts and terminology
HouseGPT
This GPT will take a user's data and use it to construct a fake TV scene. Start by providing it with your character's Patient Profile, Diagnostic Findings, and Lab Data
ecosystem.Ai Use Case Designer v2
The use case designer is configured with the latest Data Science and Behavioral Social Science insights to guide you through the process of defining AI and Machine Learning use cases for the ecosystem.Ai platform.
Deal Architect
Designing Strategic M&A Blueprints for Success in buying, selling or merging companies. Use this GPT to simplify, speed up and improve the quality of the M&A process. With custom data - 100s of creative options in deal flow, deal structuring, financing and more. **Version 2.2 - 28012024**
AI Use Case Analyst for Sales & Marketing
Enables sales & marketing leadership to identify high-value AI use cases
Terms of Use & Privacy policy Assistant
OpenAIのTerms of UseとPrivacy policyを参照できます(2023年12月14日適用分)
Dalia
Material facts application that details the content, impact category data, and environmental rating of materials.
Metaphor API Guide - Python SDK
Teaches you how to use the Metaphor Search API using our Python SDK