Best AI tools for< build data culture >
20 - AI tool Sites
Ferdinand
Ferdinand is a chatbot that provides interactive mini-courses on data and analytics, delivered directly through Slack. It aims to make data accessible and understandable for everyone, regardless of their background or experience. Ferdinand offers a range of features to help users learn about data, including: * Interactive mini-courses: Ferdinand's courses are designed to be engaging and easy to follow, with each lesson taking just 5-10 minutes to complete. * Personalized learning paths: Users can choose their own learning path, based on their interests and needs. * Progress tracking: Ferdinand tracks users' progress through the courses, so they can see how they are improving. * Integration with Slack: Ferdinand integrates directly with Slack, so users can access the courses and track their progress without leaving the platform. Ferdinand is a valuable tool for anyone who wants to learn more about data and analytics. It is easy to use, affordable, and effective.
GoProfiles
GoProfiles is an AI-powered people platform that helps remote and hybrid organizations build a culture of genuine connection. It combines the best parts of employee directories and peer recognition to create a new kind of people platform that makes it easy to find and connect with colleagues, celebrate achievements, and get to know the people behind the screens. GoProfiles is used by over 5,000 enterprises and startups to improve employee engagement, reduce new hire ramp time, and save money.
CustomerIQ
CustomerIQ is an AI platform that automatically discovers and quantifies themes across customer feedback channels like calls, surveys, tickets, and transcripts. It aggregates customer feedback, extracts and categorizes feature requests, pain points, preferences, and highlights related to customers. The platform helps align teams, prioritize work, and build a customer-obsessed culture. CustomerIQ accelerates development by scoping project requirements faster and providing actionable insights backed with context.
Macgence AI Training Data Services
Macgence is an AI training data services platform that offers high-quality off-the-shelf structured training data for organizations to build effective AI systems at scale. They provide services such as custom data sourcing, data annotation, data validation, content moderation, and localization. Macgence combines global linguistic, cultural, and technological expertise to create high-quality datasets for AI models, enabling faster time-to-market across the entire model value chain. With more than 5 years of experience, they support and scale AI initiatives of leading global innovators by designing custom data collection programs. Macgence specializes in handling AI training data for text, speech, image, and video data, offering cognitive annotation services to unlock the potential of unstructured textual data.
Audiense
Audiense is a leading provider of audience intelligence and social media marketing solutions. Our mission is to democratize audience insights and empower marketers with the data and tools they need to make better decisions. We offer a suite of products that provide deep insights into social media audiences, digital consumer behavior, and demand intelligence. With Audiense, you can understand your target audience, create more effective marketing campaigns, and measure the impact of your efforts.
Qubinets
Qubinets is a cloud data environment solutions platform that provides building blocks for building big data, AI, web, and mobile environments. It is an open-source, no lock-in, secured, and private platform that can be used on any cloud, including AWS, Digital Ocean, Google Cloud, and Microsoft Azure. Qubinets makes it easy to plan, build, and run data environments, and it streamlines and saves time and money by reducing the grunt work in setup and provisioning.
FranzAI LLM Playground
FranzAI LLM Playground is an AI-powered tool that helps you extract, classify, and analyze unstructured text data. It leverages transformer models to provide accurate and meaningful results, enabling you to build data applications faster and more efficiently. With FranzAI, you can accelerate product and content classification, enhance data interpretation, and advance data extraction processes, unlocking key insights from your textual data.
Alfatec Elarion
Alfatec Elarion is a powerful big data and AI platform that extracts data from any source and transforms it into enlightening information to help users gain deep insights. The platform offers solutions for various industries, including hospitality, insights development, and cyberintelligence. It provides services such as data modeling, loyalty survey analytics, online reputation management, and more. With a focus on data analytics, security, databases, software development, and homeland security, Alfatec Elarion aims to be a comprehensive solution for businesses seeking to leverage data for informed decision-making.
Lume AI
Lume AI is a data mapping tool that uses artificial intelligence (AI) to automate the process of mapping data between different schemas. This can save businesses a significant amount of time and effort, as manual data mapping can be a complex and error-prone process. Lume AI's AI-powered data mapping engine can automatically generate mapping logic to move data between any two schemas, regardless of their complexity. This makes it easy to onboard new customers and partners, ingest messy data, and build data pipelines faster. Lume AI also offers a number of other features that make it a valuable tool for businesses, including the ability to review and deploy mappers, auto-maintain mappers, and handle complex data mappings with ease.
Sicara
Sicara is a data and AI expert platform that helps clients define and implement data strategies, build data platforms, develop data science products, and automate production processes with computer vision. They offer services to improve data performance, accelerate data use cases, integrate generative AI, and support ESG transformation. Sicara collaborates with technology partners to provide tailor-made solutions for data and AI challenges. The platform also features a blog, job offers, and a team of experts dedicated to enhancing productivity and quality in data projects.
OGBRAIN
OGBRAIN is a leading provider of crypto data intelligence, offering advanced analytics and on-chain insights to help users make informed decisions in the cryptocurrency market. Our platform provides a comprehensive suite of tools and resources, including real-time market data, historical analysis, and expert insights, to empower traders, investors, and researchers alike. With OGBRAIN, you can stay ahead of the curve and navigate the ever-changing crypto landscape with confidence.
Coginiti
Coginiti is a collaborative data operations platform that empowers teams to build, publish, and consume trusted data products. It offers a range of features to improve analytic consistency, increase productivity, and expedite the delivery of valuable insights. Coginiti's AI-powered capabilities include a generative AI assistant, Coginiti AI, which can help users with tasks such as data exploration, feature engineering, and model development.
Tablesmith
Tablesmith is a free, privacy-first, and intuitive spreadsheet automation tool that allows users to build reusable data flows, effortlessly sort, filter, group, format, or split data across files/sheets based on cell values. It is designed to be easy to learn and use, with a focus on privacy and cross-platform compatibility. Tablesmith also offers an AI autofill feature that suggests and fills in information based on the user's prompt.
Looker
Looker is a business intelligence platform that offers embedded analytics and AI-powered BI solutions. Leveraging Google's AI-led innovation, Looker delivers intelligent BI by combining foundational AI, cloud-first infrastructure, industry-leading APIs, and a flexible semantic layer. It allows users to build custom data experiences, transform data into integrated experiences, and create deeply integrated dashboards. Looker also provides a universal semantic modeling layer for unified, trusted data sources and offers self-service analytics capabilities through Looker and Looker Studio. Additionally, Looker features Gemini, an AI-powered analytics assistant that accelerates analytical workflows and offers a collaborative and conversational user experience.
Streamlit
Streamlit is an open-source Python library that makes it easy to create and share beautiful and interactive web apps for data science and machine learning. With Streamlit, you can quickly build custom dashboards, data exploration tools, and machine learning models without having to write complex front-end code.
Streamlit
Streamlit is an open-source Python library that makes it easy to create and share beautiful and interactive web apps for data science and machine learning. With Streamlit, you can quickly build custom dashboards, data exploration tools, and machine learning models that can be easily shared with others. Streamlit is designed to be easy to use, even for those with no web development experience. It provides a simple and intuitive API that makes it easy to create complex and interactive web apps with just a few lines of code.
Credal
Credal is an end-to-end enterprise Retrieval Augmented Generation (RAG) platform that enables businesses to build secure, access-controlled AI applications on top of their data. With Credal, businesses can connect their company's data securely, sync permissions from source systems, and deploy generative AI with full security and compliance. Credal also provides a range of pre-built UI frameworks and low-code/no-code tooling to help businesses implement workflows without developer support.
Granica
Granica is an AI Infrastructure Platform that provides data management solutions for generative and traditional AI teams. Its products include Granica Screen for data privacy, Granica Crunch for data compression, and Granica Chronicle AI for data visibility. Granica's platform helps businesses build better AI models by providing tools to store and collect training data efficiently, enhance its privacy, and gain insights into its usage. Granica is trusted by category-defining companies such as Quantum Metric, Here Technologies, and Nylas.
Columns
Columns is an AI-powered data storytelling tool that helps you turn your data into beautiful, shareable stories. With Columns, you can easily connect to your data sources, transform your data into compelling visual narratives, and share your stories with your team. Columns is the perfect tool for anyone who wants to communicate data effectively, whether you're a marketer, a data analyst, or a business leader.
Telechat
Telechat is a platform that allows users to create and deploy custom chatbots on Telegram. With Telechat, users can upload their own data, fine-tune the knowledge base, and customize the chatbot's personality. Telechat also provides a range of features to help users connect their chatbots to Telegram and other channels. Telechat is suitable for a variety of use cases, including customer support, internal knowledge bases, and community engagement.
20 - Open Source AI Tools
octopus-v4
The Octopus-v4 project aims to build the world's largest graph of language models, integrating specialized models and training Octopus models to connect nodes efficiently. The project focuses on identifying, training, and connecting specialized models. The repository includes scripts for running the Octopus v4 model, methods for managing the graph, training code for specialized models, and inference code. Environment setup instructions are provided for Linux with NVIDIA GPU. The Octopus v4 model helps users find suitable models for tasks and reformats queries for effective processing. The project leverages Language Large Models for various domains and provides benchmark results. Users are encouraged to train and add specialized models following recommended procedures.
ClipboardConqueror
Clipboard Conqueror is a multi-platform omnipresent copilot alternative. Currently requiring a kobold united or openAI compatible back end, this software brings powerful LLM based tools to any text field, the universal copilot you deserve. It simply works anywhere. No need to sign in, no required key. Provided you are using local AI, CC is a data secure alternative integration provided you trust whatever backend you use. *Special thank you to the creators of KoboldAi, KoboldCPP, llamma, openAi, and the communities that made all this possible to figure out.
motorhead
Motorhead is a memory and information retrieval server for LLMs. It provides three simple APIs to assist with memory handling in chat applications using LLMs. The first API, GET /sessions/:id/memory, returns messages up to a maximum window size. The second API, POST /sessions/:id/memory, allows you to send an array of messages to Motorhead for storage. The third API, DELETE /sessions/:id/memory, deletes the session's message list. Motorhead also features incremental summarization, where it processes half of the maximum window size of messages and summarizes them when the maximum is reached. Additionally, it supports searching by text query using vector search. Motorhead is configurable through environment variables, including the maximum window size, whether to enable long-term memory, the model used for incremental summarization, the server port, your OpenAI API key, and the Redis URL.
Awesome-Code-LLM
Analyze the following text from a github repository (name and readme text at end) . Then, generate a JSON object with the following keys and provide the corresponding information for each key, in lowercase letters: 'description' (detailed description of the repo, must be less than 400 words,Ensure that no line breaks and quotation marks.),'for_jobs' (List 5 jobs suitable for this tool,in lowercase letters), 'ai_keywords' (keywords of the tool,user may use those keyword to find the tool,in lowercase letters), 'for_tasks' (list of 5 specific tasks user can use this tool to do,in lowercase letters), 'answer' (in english languages)
mllm
mllm is a fast and lightweight multimodal LLM inference engine for mobile and edge devices. It is a Plain C/C++ implementation without dependencies, optimized for multimodal LLMs like fuyu-8B, and supports ARM NEON and x86 AVX2. The engine offers 4-bit and 6-bit integer quantization, making it suitable for intelligent personal agents, text-based image searching/retrieval, screen VQA, and various mobile applications without compromising user privacy.
simple-openai
Simple-OpenAI is a Java library that provides a simple way to interact with the OpenAI API. It offers consistent interfaces for various OpenAI services like Audio, Chat Completion, Image Generation, and more. The library uses CleverClient for HTTP communication, Jackson for JSON parsing, and Lombok to reduce boilerplate code. It supports asynchronous requests and provides methods for synchronous calls as well. Users can easily create objects to communicate with the OpenAI API and perform tasks like text-to-speech, transcription, image generation, and chat completions.
fabric
Fabric is an open-source framework for augmenting humans using AI. It provides a structured approach to breaking down problems into individual components and applying AI to them one at a time. Fabric includes a collection of pre-defined Patterns (prompts) that can be used for a variety of tasks, such as extracting the most interesting parts of YouTube videos and podcasts, writing essays, summarizing academic papers, creating AI art prompts, and more. Users can also create their own custom Patterns. Fabric is designed to be easy to use, with a command-line interface and a variety of helper apps. It is also extensible, allowing users to integrate it with their own AI applications and infrastructure.
awesome-generative-ai
A curated list of Generative AI projects, tools, artworks, and models
nlp-llms-resources
The 'nlp-llms-resources' repository is a comprehensive resource list for Natural Language Processing (NLP) and Large Language Models (LLMs). It covers a wide range of topics including traditional NLP datasets, data acquisition, libraries for NLP, neural networks, sentiment analysis, optical character recognition, information extraction, semantics, topic modeling, multilingual NLP, domain-specific LLMs, vector databases, ethics, costing, books, courses, surveys, aggregators, newsletters, papers, conferences, and societies. The repository provides valuable information and resources for individuals interested in NLP and LLMs.
meet-libai
The 'meet-libai' project aims to promote and popularize the cultural heritage of the Chinese poet Li Bai by constructing a knowledge graph of Li Bai and training a professional AI intelligent body using large models. The project includes features such as data preprocessing, knowledge graph construction, question-answering system development, and visualization exploration of the graph structure. It also provides code implementations for large models and RAG retrieval enhancement.
SeaLLMs
SeaLLMs are a family of language models optimized for Southeast Asian (SEA) languages. They were pre-trained from Llama-2, on a tailored publicly-available dataset, which comprises texts in Vietnamese 🇻🇳, Indonesian 🇮🇩, Thai 🇹ðŸ‡, Malay 🇲🇾, Khmer🇰ðŸ‡, Lao🇱🇦, Tagalog🇵🇠and Burmese🇲🇲. The SeaLLM-chat underwent supervised finetuning (SFT) and specialized self-preferencing DPO using a mix of public instruction data and a small number of queries used by SEA language native speakers in natural settings, which **adapt to the local cultural norms, customs, styles and laws in these areas**. SeaLLM-13b models exhibit superior performance across a wide spectrum of linguistic tasks and assistant-style instruction-following capabilities relative to comparable open-source models. Moreover, they outperform **ChatGPT-3.5** in non-Latin languages, such as Thai, Khmer, Lao, and Burmese.
Awesome-Segment-Anything
Awesome-Segment-Anything is a powerful tool for segmenting and extracting information from various types of data. It provides a user-friendly interface to easily define segmentation rules and apply them to text, images, and other data formats. The tool supports both supervised and unsupervised segmentation methods, allowing users to customize the segmentation process based on their specific needs. With its versatile functionality and intuitive design, Awesome-Segment-Anything is ideal for data analysts, researchers, content creators, and anyone looking to efficiently extract valuable insights from complex datasets.
lobe-chat
Lobe Chat is an open-source, modern-design ChatGPT/LLMs UI/Framework. Supports speech-synthesis, multi-modal, and extensible ([function call][docs-functionc-call]) plugin system. One-click **FREE** deployment of your private OpenAI ChatGPT/Claude/Gemini/Groq/Ollama chat application.
MiniCPM
MiniCPM is a series of open-source large models on the client side jointly developed by Face Intelligence and Tsinghua University Natural Language Processing Laboratory. The main language model MiniCPM-2B has only 2.4 billion (2.4B) non-word embedding parameters, with a total of 2.7B parameters. - After SFT, MiniCPM-2B performs similarly to Mistral-7B on public comprehensive evaluation sets (better in Chinese, mathematics, and code capabilities), and outperforms models such as Llama2-13B, MPT-30B, and Falcon-40B overall. - After DPO, MiniCPM-2B also surpasses many representative open-source large models such as Llama2-70B-Chat, Vicuna-33B, Mistral-7B-Instruct-v0.1, and Zephyr-7B-alpha on the current evaluation set MTBench, which is closest to the user experience. - Based on MiniCPM-2B, a multi-modal large model MiniCPM-V 2.0 on the client side is constructed, which achieves the best performance of models below 7B in multiple test benchmarks, and surpasses larger parameter scale models such as Qwen-VL-Chat 9.6B, CogVLM-Chat 17.4B, and Yi-VL 34B on the OpenCompass leaderboard. MiniCPM-V 2.0 also demonstrates leading OCR capabilities, approaching Gemini Pro in scene text recognition capabilities. - After Int4 quantization, MiniCPM can be deployed and inferred on mobile phones, with a streaming output speed slightly higher than human speech speed. MiniCPM-V also directly runs through the deployment of multi-modal large models on mobile phones. - A single 1080/2080 can efficiently fine-tune parameters, and a single 3090/4090 can fully fine-tune parameters. A single machine can continuously train MiniCPM, and the secondary development cost is relatively low.
MegaDetector
MegaDetector is an AI model that identifies animals, people, and vehicles in camera trap images (which also makes it useful for eliminating blank images). This model is trained on several million images from a variety of ecosystems. MegaDetector is just one of many tools that aims to make conservation biologists more efficient with AI. If you want to learn about other ways to use AI to accelerate camera trap workflows, check out our of the field, affectionately titled "Everything I know about machine learning and camera traps".
20 - OpenAI Gpts
Data Engineer
A Data Engineer assistant offering advice on data pipelines and data-related tasks.
Data Engineer Consultant
Guides in data engineering tasks with a focus on practical solutions.
Tech Guru
Meet Tech Guru, your go-to AI for data engineering, coding expertise, and graph databases. Combining humor, reliability, and approachability to simplify tech with a personal touch.
Poke Competitive Pro Guide
A Pokémon competitive build expert, sourcing data from Smogon for single and double battles.
Data Science Copilot
Data science co-pilot specializing in statistical modeling and machine learning.
Data Dynamo
A friendly data science coach offering practical, useful, and accurate advice.
Alas Data Analytics Student Mentor
Salam mən Alas Academy-nin Data Analitika üzrə Süni İntellekt mentoruyam. Mənə istənilən sualı verə bilərsiniz :)
Data Analytics Specialist
Leading Big Data Analytics tool, blending advanced technology with OpenAI's expertise.