![Awesome-AI-Data-Guided-Projects](/statics/github-mark.png)
Awesome-AI-Data-Guided-Projects
A curated list of data science & AI guided projects to start building your portfolio
Stars: 83
![screenshot](/screenshots_githubs/youssefHosni-Awesome-AI-Data-Guided-Projects.jpg)
A curated list of data science & AI guided projects to start building your portfolio. The repository contains guided projects covering various topics such as large language models, time series analysis, computer vision, natural language processing (NLP), and data science. Each project provides detailed instructions on how to implement specific tasks using different tools and technologies.
README:
A curated list of data science & AI guided projects to start building your portfolio
- Building a Conversational Chatbot with Langchain and Large Language Models
- Building a Conversational Chatbot with Langchain and Large Language Models
- Chat with your Document using OpenAI API & Streamlit
- Building A Falcon 4 0BChat Web Application using HuggingFace & Gardio
- Building GPT Banker Using LLaMA 2 70B
- Finetune Llama 2 On Your Local Machine Using HuggingFace Autotrain
- Building a Lex Fridman Podcast Summarization App with Whisper Jax, Azure OpenAI, and Langchain
- Creating a Veterinary Chatbot using Llama 2: Harnessing Gen AI for Pet Care
- Deploy Llama 2 on AWS SageMaker using DLC (Deep Learning Containers)
- Article/Blog Generation App using Llama2, Langchain, and Pexels
- Efficiently Train Large Language Models with LoRA and Hugging Face
- Fine-Tune Llama 2 Model in a Colab Notebook
- Guanaco Chatbot Demo with LLaMA-7B Model
- PEFT Finetune-Bloom-560m-tagger
- Finetune_Meta_OPT-6–1b_Model_bnb_peft
- Finetune Falcon-7b with BNB Self-Supervised Training
- FineTune LLaMa2 with QLoRa
- Stable Vicuna 1 3B_8bit in Google Colab
- GPT-Neo-X 20B bnb2bit Training
- MPT-Instruct-30B Model Training
- RLHF Training for CustomDataset for AnyModel
- Fine Tuning Microsoft Phi 15 B On Custom Dataset
- Finetuning OpenAI GPT3.5 Turbo
- Finetuning Mistral-7b using Autotrain Advanced
- Stock Price Prediction And Forecasting Using Stacked LSTM- Deep Learning
- Time Series Forecasting with Facebook Prophet and Python
- Forecasting Future Sales Using ARIMA and SARIMAX
- Forecasting Weather with Neural Prophet and Python
- Time Series Forecasting with XGBoost — Use Python and machine learning to predict energy consumption
- Build A Stock Prediction Web App In Python
- Time Series Forecasting with PyCaret Regression Module
- [Hourly Energy Data Time Series Analysis](](https://towardsdatascience.com/part-1-time-series-analysis-predicting-hourly-energy-consumption-of-san-diego-short-term-long-3a1dd1a589c9)
- Stock Market Performance Analysis using Python
- Exploratory Data Analysis for Time Series Data using PyCaret
- Machine Learning — Anomaly Detection via PyCaret
- An End-to-End Unsupervised Anomaly Detection
- Time Series Anomaly Detection with PyFBAD
- Build a Deep Facial Recognition App from Paper to Code
- Real-Time Sign Language Detection Web App
- Building Production-Ready Enterprise-Level Image Classifier with AWS & React
- Building Computer Vision Mobile Apps
- Building an Image Search Engine with Python and OpenCV
- Vehicle Detection, Tracking, and Speed Estimation using a Raspberry Pi & Intel Movidius NCS
- MaskFormer Segmentation Model with Hugging Face Transformers
- Age Detection with Deep Learning
- Building a Horse to Zebra CycleGAN Webapp with Streamlit
- Computer Vision App with Azure Cognitive Services
- Object Detection with Amazon Sagemaker
- Potato Disease Classification Mobile APP
- Sports Celebrity Image Classification Web App
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for Awesome-AI-Data-Guided-Projects
Similar Open Source Tools
![Awesome-AI-Data-Guided-Projects Screenshot](/screenshots_githubs/youssefHosni-Awesome-AI-Data-Guided-Projects.jpg)
Awesome-AI-Data-Guided-Projects
A curated list of data science & AI guided projects to start building your portfolio. The repository contains guided projects covering various topics such as large language models, time series analysis, computer vision, natural language processing (NLP), and data science. Each project provides detailed instructions on how to implement specific tasks using different tools and technologies.
![Awesome-AI-Data-GitHub-Repos Screenshot](/screenshots_githubs/youssefHosni-Awesome-AI-Data-GitHub-Repos.jpg)
Awesome-AI-Data-GitHub-Repos
Awesome AI & Data GitHub-Repos is a curated list of essential GitHub repositories covering the AI & ML landscape. It includes resources for Natural Language Processing, Large Language Models, Computer Vision, Data Science, Machine Learning, MLOps, Data Engineering, SQL & Database, and Statistics. The repository aims to provide a comprehensive collection of projects and resources for individuals studying or working in the field of AI and data science.
![paper-reading Screenshot](/screenshots_githubs/JackonYang-paper-reading.jpg)
paper-reading
This repository is a collection of tools and resources for deep learning infrastructure, covering programming languages, algorithms, acceleration techniques, and engineering aspects. It provides information on various online tools for chip architecture, CPU and GPU benchmarks, and code analysis. Additionally, it includes content on AI compilers, deep learning models, high-performance computing, Docker and Kubernetes tutorials, Protobuf and gRPC guides, and programming languages such as C++, Python, and Shell. The repository aims to bridge the gap between algorithm understanding and engineering implementation in the fields of AI and deep learning.
![big-AGI Screenshot](/screenshots_githubs/enricoros-big-AGI.jpg)
big-AGI
big-AGI is an AI suite designed for professionals seeking function, form, simplicity, and speed. It offers best-in-class Chats, Beams, and Calls with AI personas, visualizations, coding, drawing, side-by-side chatting, and more, all wrapped in a polished UX. The tool is powered by the latest models from 12 vendors and open-source servers, providing users with advanced AI capabilities and a seamless user experience. With continuous updates and enhancements, big-AGI aims to stay ahead of the curve in the AI landscape, catering to the needs of both developers and AI enthusiasts.
![Tutorial-of-AI-Kit-with-Raspberry-Pi-From-Zero-to-Hero Screenshot](/screenshots_githubs/Seeed-Projects-Tutorial-of-AI-Kit-with-Raspberry-Pi-From-Zero-to-Hero.jpg)
Tutorial-of-AI-Kit-with-Raspberry-Pi-From-Zero-to-Hero
This course is designed to teach you how to harness the power of AI on the Raspberry Pi, with a focus on using an AI kit for computer vision tasks. Learn to integrate AI into IoT applications, from object detection to visual recognition. Suitable for hobbyists, students, and professionals to bring AI-driven solutions to life on resource-constrained devices like the Raspberry Pi.
![llamator Screenshot](/screenshots_githubs/RomiconEZ-llamator.jpg)
llamator
LLAMATOR is a Red Teaming python-framework designed for testing chatbots and LLM-systems. It provides support for custom attacks, a wide range of attacks on RAG/Agent/Prompt in English and Russian, custom configuration of chat clients, history of attack requests and responses in Excel and CSV format, and test report document generation in DOCX format. The tool is classified under OWASP for Prompt Injection, Prompt Leakage, and Misinformation. It is supported by AI Security Lab ITMO, Raft Security, and AI Talent Hub.
![ai_all_resources Screenshot](/screenshots_githubs/nivu-ai_all_resources.jpg)
ai_all_resources
This repository is a compilation of excellent ML and DL tutorials created by various individuals and organizations. It covers a wide range of topics, including machine learning fundamentals, deep learning, computer vision, natural language processing, reinforcement learning, and more. The resources are organized into categories, making it easy to find the information you need. Whether you're a beginner or an experienced practitioner, you're sure to find something valuable in this repository.
![Awesome-Lists-and-CheatSheets Screenshot](/screenshots_githubs/wx-chevalier-Awesome-Lists-and-CheatSheets.jpg)
Awesome-Lists-and-CheatSheets
Awesome-Lists is a curated index of selected resources spanning various fields including programming languages and theories, web and frontend development, server-side development and infrastructure, cloud computing and big data, data science and artificial intelligence, product design, etc. It includes articles, books, courses, examples, open-source projects, and more. The repository categorizes resources according to the knowledge system of different domains, aiming to provide valuable and concise material indexes for readers. Users can explore and learn from a wide range of high-quality resources in a systematic way.
![MahjongCopilot Screenshot](/screenshots_githubs/latorc-MahjongCopilot.jpg)
MahjongCopilot
Mahjong Copilot is an AI assistant for the game Mahjong, based on the mjai (Mortal model) bot implementation. It provides step-by-step guidance for each move in the game, and can also be used to automatically play and join games. Mahjong Copilot supports both 3-person and 4-person Mahjong games, and is available in multiple languages.
![nuitrack-sdk Screenshot](/screenshots_githubs/3DiVi-nuitrack-sdk.jpg)
nuitrack-sdk
Nuitrack™ is an ultimate 3D body tracking solution developed by 3DiVi Inc. It enables body motion analytics applications for virtually any widespread depth sensors and hardware platforms, supporting a wide range of applications from real-time gesture recognition on embedded platforms to large-scale multisensor analytical systems. Nuitrack provides highly-sophisticated 3D skeletal tracking, basic facial analysis, hand tracking, and gesture recognition APIs for UI control. It offers two skeletal tracking engines: classical for embedded hardware and AI for complex poses, providing a human-centric spatial understanding tool for natural and intelligent user engagement.
![intel-extension-for-tensorflow Screenshot](/screenshots_githubs/intel-intel-extension-for-tensorflow.jpg)
intel-extension-for-tensorflow
Intel® Extension for TensorFlow* is a high performance deep learning extension plugin based on TensorFlow PluggableDevice interface. It aims to accelerate AI workloads by allowing users to plug Intel CPU or GPU devices into TensorFlow on-demand, exposing the computing power inside Intel's hardware. The extension provides XPU specific implementation, kernels & operators, graph optimizer, device runtime, XPU configuration management, XPU backend selection, and options for turning on/off advanced features.
![DecryptPrompt Screenshot](/screenshots_githubs/DSXiangLi-DecryptPrompt.jpg)
DecryptPrompt
This repository does not provide a tool, but rather a collection of resources and strategies for academics in the field of artificial intelligence who are feeling depressed or overwhelmed by the rapid advancements in the field. The resources include articles, blog posts, and other materials that offer advice on how to cope with the challenges of working in a fast-paced and competitive environment.
![Awesome-Lists Screenshot](/screenshots_githubs/wx-chevalier-Awesome-Lists.jpg)
Awesome-Lists
Awesome-Lists is a curated list of awesome lists across various domains of computer science and beyond, including programming languages, web development, data science, and more. It provides a comprehensive index of articles, books, courses, open source projects, and other resources. The lists are organized by topic and subtopic, making it easy to find the information you need. Awesome-Lists is a valuable resource for anyone looking to learn more about a particular topic or to stay up-to-date on the latest developments in the field.
![ai-chatbot-framework Screenshot](/screenshots_githubs/alfredfrancis-ai-chatbot-framework.jpg)
ai-chatbot-framework
An AI Chatbot framework built in Python. It allows users to easily create Natural Language conversational scenarios with no coding efforts. The tool continuously learns from conversations to improve its capabilities. It can be integrated with various channels like Messenger and Slack. Users can create AI-powered chatbots without expertise in artificial intelligence.
![KB-Builder Screenshot](/screenshots_githubs/hduchat-KB-Builder.jpg)
KB-Builder
KB Builder is an open-source knowledge base generation system based on the LLM large language model. It utilizes the RAG (Retrieval-Augmented Generation) data generation enhancement method to provide users with the ability to enhance knowledge generation and quickly build knowledge bases based on RAG. It aims to be the central hub for knowledge construction in enterprises, offering platform-based intelligent dialogue services and document knowledge base management functionality. Users can upload docx, pdf, txt, and md format documents and generate high-quality knowledge base question-answer pairs by invoking large models through the 'Parse Document' feature.
For similar tasks
![Flowise Screenshot](/screenshots_githubs/FlowiseAI-Flowise.jpg)
Flowise
Flowise is a tool that allows users to build customized LLM flows with a drag-and-drop UI. It is open-source and self-hostable, and it supports various deployments, including AWS, Azure, Digital Ocean, GCP, Railway, Render, HuggingFace Spaces, Elestio, Sealos, and RepoCloud. Flowise has three different modules in a single mono repository: server, ui, and components. The server module is a Node backend that serves API logics, the ui module is a React frontend, and the components module contains third-party node integrations. Flowise supports different environment variables to configure your instance, and you can specify these variables in the .env file inside the packages/server folder.
![nlux Screenshot](/screenshots_githubs/nluxai-nlux.jpg)
nlux
nlux is an open-source Javascript and React JS library that makes it super simple to integrate powerful large language models (LLMs) like ChatGPT into your web app or website. With just a few lines of code, you can add conversational AI capabilities and interact with your favourite LLM.
![generative-ai-go Screenshot](/screenshots_githubs/google-generative-ai-go.jpg)
generative-ai-go
The Google AI Go SDK enables developers to use Google's state-of-the-art generative AI models (like Gemini) to build AI-powered features and applications. It supports use cases like generating text from text-only input, generating text from text-and-images input (multimodal), building multi-turn conversations (chat), and embedding.
![awesome-langchain-zh Screenshot](/screenshots_githubs/microxxx-awesome-langchain-zh.jpg)
awesome-langchain-zh
The awesome-langchain-zh repository is a collection of resources related to LangChain, a framework for building AI applications using large language models (LLMs). The repository includes sections on the LangChain framework itself, other language ports of LangChain, tools for low-code development, services, agents, templates, platforms, open-source projects related to knowledge management and chatbots, as well as learning resources such as notebooks, videos, and articles. It also covers other LLM frameworks and provides additional resources for exploring and working with LLMs. The repository serves as a comprehensive guide for developers and AI enthusiasts interested in leveraging LangChain and LLMs for various applications.
![Large-Language-Model-Notebooks-Course Screenshot](/screenshots_githubs/peremartra-Large-Language-Model-Notebooks-Course.jpg)
Large-Language-Model-Notebooks-Course
This practical free hands-on course focuses on Large Language models and their applications, providing a hands-on experience using models from OpenAI and the Hugging Face library. The course is divided into three major sections: Techniques and Libraries, Projects, and Enterprise Solutions. It covers topics such as Chatbots, Code Generation, Vector databases, LangChain, Fine Tuning, PEFT Fine Tuning, Soft Prompt tuning, LoRA, QLoRA, Evaluate Models, Knowledge Distillation, and more. Each section contains chapters with lessons supported by notebooks and articles. The course aims to help users build projects and explore enterprise solutions using Large Language Models.
![ai-chatbot Screenshot](/screenshots_githubs/vercel-ai-chatbot.jpg)
ai-chatbot
Next.js AI Chatbot is an open-source app template for building AI chatbots using Next.js, Vercel AI SDK, OpenAI, and Vercel KV. It includes features like Next.js App Router, React Server Components, Vercel AI SDK for streaming chat UI, support for various AI models, Tailwind CSS styling, Radix UI for headless components, chat history management, rate limiting, session storage with Vercel KV, and authentication with NextAuth.js. The template allows easy deployment to Vercel and customization of AI model providers.
![awesome-local-llms Screenshot](/screenshots_githubs/vince-lam-awesome-local-llms.jpg)
awesome-local-llms
The 'awesome-local-llms' repository is a curated list of open-source tools for local Large Language Model (LLM) inference, covering both proprietary and open weights LLMs. The repository categorizes these tools into LLM inference backend engines, LLM front end UIs, and all-in-one desktop applications. It collects GitHub repository metrics as proxies for popularity and active maintenance. Contributions are encouraged, and users can suggest additional open-source repositories through the Issues section or by running a provided script to update the README and make a pull request. The repository aims to provide a comprehensive resource for exploring and utilizing local LLM tools.
![Awesome-AI-Data-Guided-Projects Screenshot](/screenshots_githubs/youssefHosni-Awesome-AI-Data-Guided-Projects.jpg)
Awesome-AI-Data-Guided-Projects
A curated list of data science & AI guided projects to start building your portfolio. The repository contains guided projects covering various topics such as large language models, time series analysis, computer vision, natural language processing (NLP), and data science. Each project provides detailed instructions on how to implement specific tasks using different tools and technologies.
For similar jobs
![weave Screenshot](/screenshots_githubs/wandb-weave.jpg)
weave
Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.
![LLMStack Screenshot](/screenshots_githubs/trypromptly-LLMStack.jpg)
LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.
![VisionCraft Screenshot](/screenshots_githubs/VisionCraft-org-VisionCraft.jpg)
VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.
![kaito Screenshot](/screenshots_githubs/Azure-kaito.jpg)
kaito
Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.
![PyRIT Screenshot](/screenshots_githubs/Azure-PyRIT.jpg)
PyRIT
PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.
![tabby Screenshot](/screenshots_githubs/TabbyML-tabby.jpg)
tabby
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.
![spear Screenshot](/screenshots_githubs/isl-org-spear.jpg)
spear
SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.
![Magick Screenshot](/screenshots_githubs/Oneirocom-Magick.jpg)
Magick
Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.