Best AI tools for< Contribute Data >
20 - AI tool Sites
Towards Data Science
Towards Data Science is a Medium publication dedicated to sharing concepts, ideas, and codes in the field of data science. It provides a platform for data scientists, researchers, and practitioners to connect, learn, and contribute to the advancement of the field.
Datafitai
Datafitai is a community platform for ChatGPT prompting, where users can find and share top-rated prompts for various topics such as marketing, coding, finance, writing, gaming, and art. The platform aims to improve the accuracy of ChatGPT responses by providing high-quality prompts. Users can explore, share, and contribute to a wide range of prompts to enhance their ChatGPT experience.
Quick, Draw!
Quick, Draw! is a game built with machine learning. You draw, and a neural network tries to guess what you're drawing. Of course, it doesn't always work. But the more you play with it, the more it will learn. So far we have trained it on a few hundred concepts, and we hope to add more over time. We made this as an example of how you can use machine learning in fun ways.
RejuveAI
RejuveAI is a decentralized token-based system that aims to democratize longevity globally. The Longevity App allows users to monitor essential health metrics, enhance lifespan, and earn RJV tokens. The application leverages revolutionary AI technology for human body analysis, collaborating with researchers, clinics, and data enthusiasts to combat aging. By harnessing sophisticated AI models and neural nets, Bayesian nets, simulations, and the AGI engine, RejuveAI targets in-depth representation of living systems. The platform offers exclusive discounts on travel, supplements, medical tests, and longevity therapies, making innovative outcomes affordable and accessible.
Neurochain AI
Neurochain AI is a decentralized AI-as-a-Service (DeAIAS) network that provides an innovative solution for building, launching, and using AI-powered decentralized applications (dApps). It offers a community-driven approach to AI development, incentivizing contributors with $NCN rewards. The platform aims to address challenges in the centralized AI landscape by democratizing AI development and leveraging global computing resources. Neurochain AI also features a community-powered content generation engine and is developing its own independent blockchain. The team behind Neurochain AI includes experienced professionals in infrastructure, cryptography, computer science, and AI research.
Allen Institute for AI (AI2)
The Allen Institute for AI (AI2) is a leading research institute dedicated to advancing artificial intelligence technologies for the common good. They focus on Natural Language Processing, Computer Vision, and AI applications for the environment. AI2 collaborates with diverse teams to tackle challenging problems in AI research, aiming to create world-changing AI solutions. The institute promotes diversity, equity, and inclusion in the research community, and offers opportunities for individuals to contribute to impactful AI projects.
ONNX
ONNX is an open standard for machine learning interoperability, providing a common format to represent machine learning models. It defines a set of operators and a file format for AI developers to use models across various frameworks, tools, runtimes, and compilers. ONNX promotes interoperability, hardware access, and community engagement.
AIhub
AIhub is a platform that connects the AI community and the world by providing news articles, opinions, and education related to artificial intelligence. It serves as a hub for sharing information, resources, and contributing to the advancement of AI technologies. The platform covers a wide range of topics such as AI research, machine learning, robotics, and the societal impact of AI.
Sarvam AI
Sarvam AI is an AI application focused on leading transformative research in AI to develop, deploy, and distribute Generative AI applications in India. The platform aims to build efficient large language models for India's diverse linguistic culture and enable new GenAI applications through bespoke enterprise models. Sarvam AI is also developing an enterprise-grade platform for developing and evaluating GenAI apps, while contributing to open-source models and datasets to accelerate AI innovation.
Comfy Org
Comfy Org is an open-source AI tooling platform dedicated to advancing and democratizing AI technology. The platform offers tools like node manager, node registry, CLI, automated testing, and public documentation to support the ComfyUI ecosystem. Comfy Org aims to make state-of-the-art AI models accessible to a wider audience by fostering an open-source and community-driven approach. The team behind Comfy Org consists of individuals passionate about developing and maintaining various components of the platform, ensuring a reliable and secure environment for users to explore and contribute to AI tooling.
Golem
Golem is an AI chat application that provides a new ChatGPT experience. It offers a beautiful and user-friendly design, ensuring delightful interactions. Users can chat with a Large Language Model (LLM) securely, with data stored locally or on their personal cloud. Golem is open-source, allowing contributions and use as a reference for Nuxt 3 projects.
Fritz AI
Fritz AI is an AI tool that scans and ranks all AI tools, apps, and websites based on a set of criteria to determine the best and most ethical options. They provide technical guides, reviews, and tutorials to help users get started with machine learning. Fritz AI focuses on ethics, functionality, user experience, and innovation when evaluating tools. Users can contribute tool suggestions and collaborate with the Fritz AI team. The platform also offers beginner-friendly guides, consulting services, and promotes ethical use of AI and machine learning technologies.
DeepLearning.AI
DeepLearning.AI is an AI education platform offering courses and resources to help individuals start or advance their careers in artificial intelligence. Founded by renowned AI expert Andrew Ng, the platform provides a wide range of courses, specializations, newsletters, and community forums to help learners build a strong foundation in machine learning and AI skills. Subscribers can access the latest AI news, insights, and events, and benefit from the expertise of industry leaders. With a focus on practical learning and real-world applications, DeepLearning.AI aims to empower individuals to harness the power of AI and contribute to the rapidly evolving field.
Replicate
Replicate is an AI tool that allows users to run and fine-tune open-source models, deploy custom models at scale, and generate images, text, videos, music, and speech with just one line of code. It provides a platform for the community to contribute and explore thousands of production-ready AI models, enabling users to push the boundaries of AI beyond academic papers and demos. With features like fine-tuning models, deploying custom models, and scaling on Replicate, users can easily create and deploy AI solutions for various tasks.
Devin AI
Devin AI, developed by Cognition Labs, is the world's first fully autonomous AI software engineer. It streamlines software development by handling complex tasks, allowing engineers to focus on more challenging aspects. Devin AI possesses advanced programming skills, can manage complex tasks, understands and learns contextually, integrates with developer tools, and offers collaborative features. It can build and deploy applications, detect and fix bugs, contribute to open-source projects, train AI models, and handle GitHub repositories. Devin AI has demonstrated strong performance in issue resolution, surpassing previous AI models. It is currently in early access, with plans for future enhancements and integration with various development tools and platforms.
Pl@ntNet
Pl@ntNet is a citizen science project available as an application that helps you identify plants from your photos. It is a collaborative project that brings together scientists, naturalists, and citizens from all over the world to collect and share data on plant diversity. The app uses artificial intelligence to identify plants from photos, and the data collected is used to create a global database of plant diversity. Pl@ntNet is free to use and is available in over 20 languages.
Easy Save AI
Easy Save AI is a comprehensive directory of Digital Marketing AI tools available online and curated by a digital marketing expert, Muritala Yusuf. Easy Save AI's primary objective is to ensure that AI is accessible to everyone. You can conveniently utilize our website to discover new AI tools and services or locate specific ones based on your requirements by Using our easy-to-use filter on the home page. AI technology is constantly progressing, and experts are continuously developing sophisticated models for various applications. Our directory includes an array of AI tools such as AI copywriters, text and image generators, AI transcription, SEO automation tools, and more. There is something suitable for every individual! Our website is committed to offering user-friendly AI tools and resources that can contribute to the success of you and your business in the digital era. We meticulously evaluate and curate each tool to ensure they possess valuable features and are accessible to both novices and experts. With the Easy Save AI platform, you can locate the AI tools you require and save valuable time and money. We sometimes have discounts on AI Tools and we always specify on the product page for you to use.
Alignerr
Alignerr is a platform powered by Labelbox that offers subject matter experts the opportunity to align AI models by creating high-quality data in their field of expertise. The platform aims to build the future of Generative AI by enabling experts to contribute to tasks such as coding improvement, data science synthesis, basic math and chemistry comprehension, and creative writing. Alignerr provides a transparent pay structure and allows individuals to work from home on their own schedule, earning up to $150/hr. Contributors can play a pivotal role in shaping the future of artificial intelligence by working on tasks that involve rating or ranking assignments, open rewrite tasks, and multi-modal assignments. The platform emphasizes the responsible development of AI technologies and offers flexibility for professionals to balance work with personal life effortlessly.
Dreamseer
Dreamseer is an innovative AI-powered platform that delves into the realm of dreams, offering a novel approach to self-discovery and personal growth. By harnessing the power of data science and blending it with the art of dream interpretation, Dreamseer empowers users to explore the depths of their unconscious minds, gain insights into their emotions and motivations, and embark on a transformative journey of self-evolution. The platform fosters a collaborative community spirit, where users can contribute their dreams to a collective Dreamverse, co-creating a shared narrative and enriching the understanding of the human psyche. Dreamseer's commitment to privacy and data security ensures that users' dreams remain confidential and protected.
LlamaIndex
LlamaIndex is a leading data framework designed for building LLM (Large Language Model) applications. It allows enterprises to turn their data into production-ready applications by providing functionalities such as loading data from various sources, indexing data, orchestrating workflows, and evaluating application performance. The platform offers extensive documentation, community-contributed resources, and integration options to support developers in creating innovative LLM applications.
20 - Open Source AI Tools
DataEngineeringPilipinas
DataEngineeringPilipinas is a repository dedicated to data engineering resources in the Philippines. It serves as a platform for data engineering professionals to contribute and access high-quality content related to data engineering. The repository provides guidelines for contributing, including forking the repository, making changes, and submitting contributions. It emphasizes the importance of quality, relevance, and respect in the contributions made to the project. By following the guidelines and contributing to the repository, users can help build a valuable resource for the data engineering community in the Philippines and beyond.
Kiln
Kiln is an intuitive tool for fine-tuning LLM models, generating synthetic data, and collaborating on datasets. It offers desktop apps for Windows, MacOS, and Linux, zero-code fine-tuning for various models, interactive data generation, and Git-based version control. Users can easily collaborate with QA, PM, and subject matter experts, generate auto-prompts, and work with a wide range of models and providers. The tool is open-source, privacy-first, and supports structured data tasks in JSON format. Kiln is free to use and helps build high-quality AI products with datasets, facilitates collaboration between technical and non-technical teams, allows comparison of models and techniques without code, ensures structured data integrity, and prioritizes user privacy.
AI_for_Science_paper_collection
AI for Science paper collection is an initiative by AI for Science Community to collect and categorize papers in AI for Science areas by subjects, years, venues, and keywords. The repository contains `.csv` files with paper lists labeled by keys such as `Title`, `Conference`, `Type`, `Application`, `MLTech`, `OpenReviewLink`. It covers top conferences like ICML, NeurIPS, and ICLR. Volunteers can contribute by updating existing `.csv` files or adding new ones for uncovered conferences/years. The initiative aims to track the increasing trend of AI for Science papers and analyze trends in different applications.
Dataset
DL3DV-10K is a large-scale dataset of real-world scene-level videos with annotations, covering diverse scenes with different levels of reflection, transparency, and lighting. It includes 10,510 multi-view scenes with 51.2 million frames at 4k resolution, and offers benchmark videos for novel view synthesis (NVS) methods. The dataset is designed to facilitate research in deep learning-based 3D vision and provides valuable insights for future research in NVS and 3D representation learning.
data-prep-kit
Data Prep Kit is a community project aimed at democratizing and speeding up unstructured data preparation for LLM app developers. It provides high-level APIs and modules for transforming data (code, language, speech, visual) to optimize LLM performance across different use cases. The toolkit supports Python, Ray, Spark, and Kubeflow Pipelines runtimes, offering scalability from laptop to datacenter-scale processing. Developers can contribute new custom modules and leverage the data processing library for building data pipelines. Automation features include workflow automation with Kubeflow Pipelines for transform execution.
awesome-open-data-annotation
At ZenML, we believe in the importance of annotation and labeling workflows in the machine learning lifecycle. This repository showcases a curated list of open-source data annotation and labeling tools that are actively maintained and fit for purpose. The tools cover various domains such as multi-modal, text, images, audio, video, time series, and other data types. Users can contribute to the list and discover tools for tasks like named entity recognition, data annotation for machine learning, image and video annotation, text classification, sequence labeling, object detection, and more. The repository aims to help users enhance their data-centric workflows by leveraging these tools.
free-for-life
A massive list including a huge amount of products and services that are completely free! ⭐ Star on GitHub • 🤝 Contribute # Table of Contents * APIs, Data & ML * Artificial Intelligence * BaaS * Code Editors * Code Generation * DNS * Databases * Design & UI * Domains * Email * Font * For Students * Forms * Linux Distributions * Messaging & Streaming * PaaS * Payments & Billing * SSL
dcai-course
This repository serves as the website for the Introduction to Data-Centric AI class. It contains lab assignments and resources for the course. Users can contribute by opening issues or submitting pull requests. The website can be built locally using Docker and Jekyll. The design is based on Missing Semester. All contents, including source code, lecture notes, and videos, are licensed under CC BY-NC-SA 4.0.
PyAirbyte
PyAirbyte brings the power of Airbyte to every Python developer by providing a set of utilities to use Airbyte connectors in Python. It enables users to easily manage secrets, work with various connectors like GitHub, Shopify, and Postgres, and contribute to the project. PyAirbyte is not a replacement for Airbyte but complements it, supporting data orchestration frameworks like Airflow and Snowpark. Users can develop ETL pipelines and import connectors from local directories. The tool simplifies data integration tasks for Python developers.
machine-learning
Ocademy is an AI learning community dedicated to Python, Data Science, Machine Learning, Deep Learning, and MLOps. They promote equal opportunities for everyone to access AI through open-source educational resources. The repository contains curated AI courses, tutorials, books, tools, and resources for learning and creating Generative AI. It also offers an interactive book to help adults transition into AI. Contributors are welcome to join and contribute to the community by following guidelines. The project follows a code of conduct to ensure inclusivity and welcomes contributions from those passionate about Data Science and AI.
examples
This repository contains a collection of sample applications and Jupyter Notebooks for hands-on experience with Pinecone vector databases and common AI patterns, tools, and algorithms. It includes production-ready examples for review and support, as well as learning-optimized examples for exploring AI techniques and building applications. Users can contribute, provide feedback, and collaborate to improve the resource.
Robyn
Robyn is an experimental, semi-automated and open-sourced Marketing Mix Modeling (MMM) package from Meta Marketing Science. It uses various machine learning techniques to define media channel efficiency and effectivity, explore adstock rates and saturation curves. Built for granular datasets with many independent variables, especially suitable for digital and direct response advertisers with rich data sources. Aiming to democratize MMM, make it accessible for advertisers of all sizes, and contribute to the measurement landscape.
memfree
MemFree is an open-source hybrid AI search engine that allows users to simultaneously search their personal knowledge base (bookmarks, notes, documents, etc.) and the Internet. It features a self-hosted super fast serverless vector database, local embedding and rerank service, one-click Chrome bookmarks index, and full code open source. Users can contribute by opening issues for bugs or making pull requests for new features or improvements.
NeuroAI_Course
Neuromatch Academy NeuroAI Course Syllabus is a repository that contains the schedule and licensing information for the NeuroAI course. The course is designed to provide participants with a comprehensive understanding of artificial intelligence in neuroscience. It covers various topics related to AI applications in neuroscience, including machine learning, data analysis, and computational modeling. The content is primarily accessed from the ebook provided in the repository, and the course is scheduled for July 15-26, 2024. The repository is shared under a Creative Commons Attribution 4.0 International License and software elements are additionally licensed under the BSD (3-Clause) License. Contributors to the project are acknowledged and welcomed to contribute further.
chat-with-your-data-solution-accelerator
Chat with your data using OpenAI and AI Search. This solution accelerator uses an Azure OpenAI GPT model and an Azure AI Search index generated from your data, which is integrated into a web application to provide a natural language interface, including speech-to-text functionality, for search queries. Users can drag and drop files, point to storage, and take care of technical setup to transform documents. There is a web app that users can create in their own subscription with security and authentication.
gcloud-aio
This repository contains shared codebase for two projects: gcloud-aio and gcloud-rest. gcloud-aio is built for Python 3's asyncio, while gcloud-rest is a threadsafe requests-based implementation. It provides clients for Google Cloud services like Auth, BigQuery, Datastore, KMS, PubSub, Storage, and Task Queue. Users can install the library using pip and refer to the documentation for usage details. Developers can contribute to the project by following the contribution guide.
local-rag
Local RAG is an offline, open-source tool that allows users to ingest files for retrieval augmented generation (RAG) using large language models (LLMs) without relying on third parties or exposing sensitive data. It supports offline embeddings and LLMs, multiple sources including local files, GitHub repos, and websites, streaming responses, conversational memory, and chat export. Users can set up and deploy the app, learn how to use Local RAG, explore the RAG pipeline, check planned features, known bugs and issues, access additional resources, and contribute to the project.
BetaML.jl
The Beta Machine Learning Toolkit is a package containing various algorithms and utilities for implementing machine learning workflows in multiple languages, including Julia, Python, and R. It offers a range of supervised and unsupervised models, data transformers, and assessment tools. The models are implemented entirely in Julia and are not wrappers for third-party models. Users can easily contribute new models or request implementations. The focus is on user-friendliness rather than computational efficiency, making it suitable for educational and research purposes.
DB-GPT-Hub
DB-GPT-Hub is an experimental project leveraging Large Language Models (LLMs) for Text-to-SQL parsing. It includes stages like data collection, preprocessing, model selection, construction, and fine-tuning of model weights. The project aims to enhance Text-to-SQL capabilities, reduce model training costs, and enable developers to contribute to improving Text-to-SQL accuracy. The ultimate goal is to achieve automated question-answering based on databases, allowing users to execute complex database queries using natural language descriptions. The project has successfully integrated multiple large models and established a comprehensive workflow for data processing, SFT model training, prediction output, and evaluation.
DataHorse
DataHorse is an open-source tool and Python library that simplifies data science for everyone. It allows users to interact with data in plain English without requiring technical skills. Users can create graphs, modify data, and build machine learning models to make predictions. The tool is designed to help businesses and individuals quickly understand their data and make data-driven decisions with ease.
14 - OpenAI Gpts
Volunteer.bot
Welcome to Volunteer.bot, your go-to AI for volunteer opportunities and guidance. Find meaningful ways to contribute to community, environmental, and global causes. Accessible, informative, and supportive, we're here to help you make a difference
Open Source Starter Guide
Open Source Guide for Everyone: First time contributors, maintainers, and the curious.
Design System Technical Specialist
Expert in Technical Design System Foundations and Components
Birdwatching GPT
Expert in bird identification, habitats, and personalized birdwatching guidance.
Reaganbot
Let me explain how my brilliant policies contributed to the success of our great country.
OpenIndex.ai
Chat with all the knowledge, documents and collections contributed to the OpenIndex search engine.
Web3 GPT
A Web3 expert providing in-depth knowledge on blockchain, cryptocurrencies, and more.