END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects

Stars: 145

Visit

The 'END TO END GENERATIVE AI PROJECTS' repository is a collection of awesome industry projects utilizing Large Language Models (LLM) for various tasks such as chat applications with PDFs, image to speech generation, video transcribing and summarizing, resume tracking, text to SQL conversion, invoice extraction, medical chatbot, financial stock analysis, and more. The projects showcase the deployment of LLM models like Google Gemini Pro, HuggingFace Models, OpenAI GPT, and technologies such as Langchain, Streamlit, LLaMA2, LLaMAindex, and more. The repository aims to provide end-to-end solutions for different AI applications.

README:

END TO END GENERATIVE AI PROJECTS

Awesome Projects Collection on LLM Models

🌟🧑‍💻End to End Generative AI Industry Projects on LLM Models, RAG, AI Agents, AI Chatbot, MultiModals with Deployment 👩‍💻🌟

S.No	Project Name	Description	Tech Stack
1	Multi-PDFs 📚ChatApp AI Agent 🤖	Chat seamlessly with Multiple PDFs using Langchain, Google Gemini Pro & FAISS Vector DB with Seamless Streamlit Deployment. Get instant, accurate responses from Awesome Google Gemini OpenSource language Model. 📚💬 Transform your PDF experience now! 🔥✨	`F/w:` Langchain `Model`: Google Gemini Pro, `Vector DB`: FAISS `Deployment`: Streamlit
2	🖼️Image to Speech GenAI Tool Using LLM 🌟♨️	AI tool that generates an Audio short story based on the context of an uploaded image by prompting a GenAI LLM model.	`F/w:` Langchain `Model`: HuggingFace Models, OpenAI GPT-3.5, `Vector Deployment`: Streamlit, Hugging Spaces
3	Youtube Video Transcribe Summarizer LLM App	End To End Youtube Video Transcribe Summarizer LLM App With Google Gemini Pro providing detailed notes based on YouTube video transcripts. With the power of AI, you can now convert video transcripts into comprehensive study materials.	Google Gemini Pro
4	End to end RAG LLM App	Step-by-Step Guide to Building a RAG LLM App with LLamA2 and LLaMAindex	LLamA2 and LLaMAindex
5	Resume ATS Tracking LLM Project	This is a project aiming to optimize the recruitment process. It integrates an advanced Applicant Tracking System with Google Gemini Pro, streamlining resume parsing, keyword matching, and candidate evaluation for an efficient end-to-end solution in talent acquisition.	Google Gemini Pro
6	End To End Text To SQL LLM App Along With Querying SQL Database	The "Text to SQL LLM App with Google Gemini Pro" is a software application that facilitates the conversion of natural language queries into SQL commands. It also enables querying SQL databases directly using the generated SQL commands.	Using Google Gemini Pro
7	End To End Multi Language Invoice Extractor Project	MultiLanguage Invoice Extractor 💼✨ Discover the power of MultiLanguage Invoice Extractor! This Streamlit app, powered by Google Gemini Pro Vision AI, makes extracting information from invoice images a breeze. Upload images, add prompts, and get detailed responses effortlessly. With multi-language support.	Using Google Gemini Pro
8	PDF Document Question Answering LLM System		Langchain,Cassandra,Astra DB,Vector Database
9	Fine Tune LLAMA 2 With Custom Dataset		Using `LoRA` And `QLoRA` Techniques
10	End to End RAG LLM App: Indexing & Querying Multiple Pdf's		Using Llamaindex and OpenAI
11	Real Time Financial Stock Analysis		Using `CrewAI`, `Groq`, `LangChain` & some other APIs like `browserless, Serper and SEC EDGAR API`
12	Medical ChatBot	The Llama2 Medical Bot is a powerful tool designed to provide medical information by answering user queries using state-of-the-art language models and vector stores. The bot runs on a decent CPU machine with a minimum of 16GB of RAM.	Using `Llama2` and `Sentence Transformers`. Powered by `Langchain` and `Chainlit`
13	Medical Mixture-of Experts LLM	Medical Mixture of Experts LLM using Mergekit.	MergeKit
14	Haystack and Mistral 7B RAG Implementation	Haystack and Mistral 7B RAG Implementation. It is based on completely open-source stack.	Haystack-and-Mistral-7B-RAG
15	Power QnA Chatbot	Question Answer Generation App using Mistral 7B, Langchain, and FastAPI.	Mistral 7B, Langchain, and FastAPI.
16	RAG	Gemma-7B-RAG-using-Ollama	Gemma-7B-RAG-using-Ollama
17	On-device LLM Inference	On-device LLM Inference using Mediapipe LLM Inference API.	Using Mediapipe LLM Inference API.
18	Personal Voice Assistant using OpenAI
19	Fast Fine Tuning and DPO Training of LLMs using Unsloth
20	Groq Chat App	Groq Chat App built using Groq API and Streamlit.	Groq API and Streamlit.
21	Medical RAG using Bio-Mistral-7B	This is a RAG implementation using Open Source stack. BioMistral 7B has been used to build this app along with PubMedBert as an embedding model, Qdrant as a self hosted Vector DB, and Langchain & Llama CPP as an orchestration frameworks	RAG implementation, BioMistral 7B, PubMedBert, Qdrant, Langchain & Llama CPP
22	End-to-End RAG Implementation-using Amazon Bedrock		Amazon Bedrock
23	Faster Stable Diffusion using SSD-1B	Faster Stable Diffusion using SSD-1B. A gradio app inside for demo.	Stable Diffusion using SSD-1B, Gradio
24	Phi-2 Fine-Tuning	Phi-2 Fine Tuning to build a mental health GPT.	Phi-2-Fine-Tuning
25	Medical RAG Using Meditron-7B-LLM	Medical RAG QA App using Meditron 7B LLM, Qdrant Vector Database, and PubMedBERT Embedding Model.	Meditron 7B LLM, Qdrant, PubMedBERT
26	Fastest Image Generation using LCM LoRA.		LoRA
27	HyDE based RAG using NVIDIA NIM		HyDE based RAG, NVIDIA NIM
28	Building Intelligent Systems Using Visdum-AI		Visdum-AI
29	Zephyr 7B beta RAG Demo inside a Gradio App	Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta.	Zephyr 7B beta, RAG, Gradio, BGE Embeddings, ChromaDB
30	LangChain Expression Language	Intro to LangChain Expression Language.	LEL
31	Fine Tuning Multimodal LLM	Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.	Multimodal LLM "Idefics 9B"
32	RAG Tool using Haystack, Mistral and Chainlit	RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.	RAG, Haystack, Mistral, Chainlit
33	Prompt Compression Using LLMLingua	Prompt Compression using LLMLingua. It helps with token's cost and latency.	Prompt Compression, LLMLingua
34	Stream Diffusion in Colab
35	Multimodal-RAG Using Langchain	Multimodal-RAG-using-Langchain	RAG, Langchain
36	Secure-AI-LLM Chatbots Using Prompt Injection Prevention Techniques	Prompt Injection & Prevention techniques. Secure your AI Chatbots built using LLMs.	Prompt Injection, LLMs
37	GGUF Quantization Of any LLM	GGUF-Quantization-of-any-LLM	GGUF-Quantization
38	Deltamon Anime Using LoRA	Deltamon-Anime-using-LoRA	LoRA
39	Evaluation of LLMs and RAGs	Evaluation-of-LLMs-and-RAGs. A complete guide to evaluate LLMs and RAGs covering theory and code based approaches.	LLMs, RAGs
40	Unsloth Fine-Tuning	Unsloth-Fine-Tuning	Unsloth
41	SLIM Models by LLMWare	SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.	SLIM Models, LLMWare, AI Agents, Function Calls, Streamlit App
42	Small Multimodal Vision Model	Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.	Small Multimodal Vision Model "Imp-v1-3b", Phi-2, Siglip
43	Langsmith Implementation	Langsmith-Implementation	Langsmith
44	Langserve Implementation	Langserve-Implementation	Langserve
45	Multimodal AI App using Llava 7B and Gradio	Multimodal AI App using Llava 7B and Gradio	Llava 7B, Gradio
46	Perplexity Lite	Perplexity Lite using Langgraph, Tavily, and GPT-4.	LangGraph, Tavily and GPT-4.
47	Generative-AI-LLM-Projects	Gen AI End To End Large Language Model Projects	30+ Gen AI End To End Large Language Model Projects With Latest OpenSource Models, Fine Tuning
48	MusicAI	Custom Music Generation with Transformers and PyTorch	Transformers, PyTorch
49	Audio Summarization App using Gemini LLM	Audio Summarization App using Gemini LLM	Gemini 1.5, LLM
50	Fine Tune Multimodal LLM "Idefics 2" using QLoRA	Fine Tune Multimodal LLM "Idefics 2" using QLoRA.	Multimodal LLM "Idefics 2", QLoRA
51	Llama 3 ORPO FineTuning	Llama 3 ORPO Fine Tuning on A100 in Colab Pro.	Llama 3 ORPO
52	RAG using Llama3, Langchain and ChromaDB	This project utilizes Llama3 Langchain and ChromaDB to establish a Retrieval Augmented Generation (RAG) system. This system empowers you to ask questions about your documents, even if the information wasn't included in the training data for the Large Language Model (LLM). Retrieval Augmented Generation works by first performing a retrieval step when presented with a question. This step fetches relevant documents from a special vector database, where the documents have been indexed.	RAG using Llama3, Langchain and ChromaDB
53	LLAMA-3 70B LLM with NVIDIA	Meet LLAMA3 Chat AI App! 🚀 Meta Unveils Llama 3, the Most Powerful Open Source Model Yet. Chat seamlessly with LLAMA3 Chatbot. Get instant, Accurate responses from Awesome Llama3 OpenSource language Model📚💬	LLAMA-3 70B LLM with NVIDIA, Streamlit UI
54	Efficiently fine-tune Llama 3 with PyTorch FSDP and Q-Lora	Efficiently fine-tune Llama 3 with PyTorch FSDP and Q-Lora	Llama 3 with PyTorch FSDP and Q-Lora, Fine Tuning
55	🌟META LLAMA3 GENAI Real World UseCases End To End Implementation Guides📚	LLAMA3 GENAI UseCases	Llama3, FineTuning, Deployment, RAG, Langchain
56	⭐Meta's LLaMA3-Quantization🦌💎💫	LLaMA3-Quantization is the official implementation of paper "How Good Are Low-bit Quantized LLAMA3 Models?". Here evaluation is done on the 10 existing post-training quantization and LoRA-finetuning methods of LLaMa3 on 1-8 bits and diverse datasets to comprehensively reveal LLaMa3's low-bit quantization performance.	Quantization, GenerativeAI, llama3-meta-ai
57	Ollama-UseCases🌟	This repo brings numerous use cases from the Open Source Ollama	Ollama
58	AI Agents💫	Design Patterns for Multi Agents Frameworks Like Autogen, Langraph, Taskweaver, Crewai,etc	Multi Agents Frameworks Like Autogen, Langraph, Taskweaver, Crewai
59	RAG with LlamaIndex and NVIDIA	RAG with LlamaIndex and NVIDIA	RAG with LlamaIndex and NVIDIA
60	Quantize LLM using AWQ	Quantize LLM using AWQ	Quantize LLM using AWQ
61	LLMs Inference and Fine Tuning	Estimate Memory Consumption of LLMs Inference and Fine Tuning	LLMs Inference and Fine Tuning
62	Phi-3 LLM by Microsoft	Phi-3 LLM by Microsoft Implementation	Phi-3 LLM
63	🔥Advanced RAG💫🌟	Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 , Agents.	Advanced Retrieval-Augmented Generation (RAG), Langchain, OpenAI GPTs ,META LLAMA3 , Agents.
64	RAG-using-AWS-Bedrock-and-Azure-OpenAI	RAG-using-AWS-Bedrock-and-Azure-OpenAI	RAG, AWS-Bedrock, Azure-OpenAI, Generative AI
65	LLM SECURITY 2024	Securing LLM's Against Top 10 OWASP Large Language Model Vulnerabilities 2024	OWASP, LLM Security, Vulnerability's, Data Security, Cyber Security, Generative AI, LLM Security
66	GPT4o-API-Implementation-GPT4-RAG	Getting Started with GPT4 API, GPT4 RAG, OpenAI GPT4 Assistant, OpenAI Models	openai-api, gpt-4, large-language-models, generative-ai, gpt4-api, gpt4o
67	PaliGemma Inference and Fine Tuning	PaliGemma Inference and Fine Tuning	PaliGemma, Inference, Fine Tuning, Generative-AI
68	LLMs Evaluation	LLMs Evaluation	LLMs Evaluation, Generative AI
69	Building RAG With OpenAI GPT-4o(omni) Model Using Objectbox Vector Database	Building RAG With OpenAI GPT-4o(omni) Model Using Objectbox Vector Database	RAG, OpenAI GPT-4o(omni) Model,MObjectbox Vector Database
70	PaliGemma FineTuning	PaliGemma FineTuning	PaliGemma, FineTuning
71	RAG Evaluator	A library for evaluating Retrieval-Augmented Generation (RAG) systems	RAG Evaluator, Metrics: BLEU, ROUGE, BERT, Perplexity,Diversity, Racial Bias
72	Griptape: Create Customisable Multi AI Agents from Scratch	Griptape: Create Customisable Multi AI Agents from Scratch	Agent-based-framework, Griptape, llm, Generative-ai, AIagents
73	Synthetic Data Generation using LLM	Synthetic Data Generation using LLM via Argilla, Distilabel, ChatGPT, etc.	Synthetic Data Generation, LLM, Argilla, Distilabel, ChatGPT
74	Groq-Whisper Fast Transcription App	Groq-Whisper Fast Transcription App built using Groq API and Streamlit	Groq-Whisper, LLM, Streamlit
75	CrewAI AgentOps	CrewAI AgentOps: Monitor your AI Agents	Agentops, Generative-AI, Crewai, AIagents
76	Agentic RAG using Crew AI	Agentic RAG using Crew AI	RAG, Generative-AI, Crewai, AIagents, Agentic-RAG, Agentic-ai, Crewai-RAG
77	AI Agents using Crew AI	AI Agents Streamlit App using Crew AI	AI Agents, Streamlit App, GenerativeAI, Crew AI
78	Multi GPU Fine Training LLMs	Multi GPU Fine Training LLMs using DeepSpeed and Accelerate.	accelerate, gpu-computing, finetuning, deepspeed, large-language-models, generative-ai
79	LLM based Finance Agent	An intelligent agent utilizing Large Language Models (LLMs) for automated financial news retrieval and stock price prediction.	Agent-based,finance-api,LLMs, generative-ai, gemini-pro
80	Multi-Agent AI App	The Multi-Agents AI App from Scratch is a Python-based application leveraging OpenAI's GPT-4o model to perform specialized tasks through a collaborative Multi-Agent Architecture. Built with Streamlit for an intuitive web interface without any Agents frameworks/libraries, this system includes agents for Summarizing Medical Texts, Writing Research Articles, and Sanitizing Medical Data.	Agent-based, Multi-Agent Architecture, LLMs, GenerativeAI, Streamlit
81	RAG Based LLM AI Chatbot	RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Container)	RAG, LLAMA Model, LLMs, GenerativeAI, Streamlit, Qdrant
	More Projects list is coming...!!!

©️ Awesome LLM Projects License 🪪

Distributed under the MIT License. See LICENSE for more information.

If you found Generative AI Projects Implementation fruitful do drop ⭐ to this repo and if you have Exciting Ideas, Contributions are welcome! 🌟🔦💁

Follow me on

For Tasks:

Click tags to check more tools for each tasks

chat with pdfs generate audio stories summarize video transcripts track resumes extract invoice information

For Jobs:

data scientist machine learning engineer ai researcher nlp engineer ai solutions architect

Alternative AI tools for END-TO-END-GENERATIVE-AI-PROJECTS

Similar Open Source Tools

END-TO-END-GENERATIVE-AI-PROJECTS

github

: 145

AI0x0.com

AI 0x0 is a versatile AI query generation desktop floating assistant application that supports MacOS and Windows. It allows users to utilize AI capabilities in any desktop software to query and generate text, images, audio, and video data, helping them work more efficiently. The application features a dynamic desktop floating ball, floating dialogue bubbles, customizable presets, conversation bookmarking, preset packages, network acceleration, query mode, input mode, mouse navigation, deep customization of ChatGPT Next Web, support for full-format libraries, online search, voice broadcasting, voice recognition, voice assistant, application plugins, multi-model support, online text and image generation, image recognition, frosted glass interface, light and dark theme adaptation for each language model, and free access to all language models except Chat0x0 with a key.

github

: 3.5k

AI-Competition-Collections

AI-Competition-Collections is a repository that collects and curates various experiences and tips from AI competitions. It includes posts on competition experiences in computer vision, NLP, speech, and other AI-related fields. The repository aims to provide valuable insights and techniques for individuals participating in AI competitions, covering topics such as image classification, object detection, OCR, adversarial attacks, and more.

github

: 365

ZhiLight

ZhiLight is a highly optimized large language model (LLM) inference engine developed by Zhihu and ModelBest Inc. It accelerates the inference of models like Llama and its variants, especially on PCIe-based GPUs. ZhiLight offers significant performance advantages compared to mainstream open-source inference engines. It supports various features such as custom defined tensor and unified global memory management, optimized fused kernels, support for dynamic batch, flash attention prefill, prefix cache, and different quantization techniques like INT8, SmoothQuant, FP8, AWQ, and GPTQ. ZhiLight is compatible with OpenAI interface and provides high performance on mainstream NVIDIA GPUs with different model sizes and precisions.

github

: 832

Awesome_Multimodel_LLM

github

: 231

agentica

Agentica is a human-centric framework for building large language model agents. It provides functionalities for planning, memory management, tool usage, and supports features like reflection, planning and execution, RAG, multi-agent, multi-role, and workflow. The tool allows users to quickly code and orchestrate agents, customize prompts, and make API calls to various services. It supports API calls to OpenAI, Azure, Deepseek, Moonshot, Claude, Ollama, and Together. Agentica aims to simplify the process of building AI agents by providing a user-friendly interface and a range of functionalities for agent development.

github

: 108

Prompt-Engineering-Holy-Grail

The Prompt Engineering Holy Grail repository is a curated resource for prompt engineering enthusiasts, providing essential resources, tools, templates, and best practices to support learning and working in prompt engineering. It covers a wide range of topics related to prompt engineering, from beginner fundamentals to advanced techniques, and includes sections on learning resources, online courses, books, prompt generation tools, prompt management platforms, prompt testing and experimentation, prompt crafting libraries, prompt libraries and datasets, prompt engineering communities, freelance and job opportunities, contributing guidelines, code of conduct, support for the project, and contact information.

github

: 366

gpupixel

GPUPixel is a real-time, high-performance image and video filter library written in C++11 and based on OpenGL/ES. It incorporates a built-in beauty face filter that achieves commercial-grade beauty effects. The library is extremely easy to compile and integrate with a small size, supporting platforms including iOS, Android, Mac, Windows, and Linux. GPUPixel provides various filters like skin smoothing, whitening, face slimming, big eyes, lipstick, and blush. It supports input formats like YUV420P, RGBA, JPEG, PNG, and output formats like RGBA and YUV420P. The library's performance on devices like iPhone and Android is optimized, with low CPU usage and fast processing times. GPUPixel's lib size is compact, making it suitable for mobile and desktop applications.

github

: 1.7k

lawyer-llama

Lawyer LLaMA is a large language model that has been specifically trained on legal data, including Chinese laws, regulations, and case documents. It has been fine-tuned on a large dataset of legal questions and answers, enabling it to understand and respond to legal inquiries in a comprehensive and informative manner. Lawyer LLaMA is designed to assist legal professionals and individuals with a variety of law-related tasks, including: * **Legal research:** Quickly and efficiently search through vast amounts of legal information to find relevant laws, regulations, and case precedents. * **Legal analysis:** Analyze legal issues, identify potential legal risks, and provide insights on how to proceed. * **Document drafting:** Draft legal documents, such as contracts, pleadings, and legal opinions, with accuracy and precision. * **Legal advice:** Provide general legal advice and guidance on a wide range of legal matters, helping users understand their rights and options. Lawyer LLaMA is a powerful tool that can significantly enhance the efficiency and effectiveness of legal research, analysis, and decision-making. It is an invaluable resource for lawyers, paralegals, law students, and anyone else who needs to navigate the complexities of the legal system.

github

: 751

Awesome-Knowledge-Distillation-of-LLMs

A collection of papers related to knowledge distillation of large language models (LLMs). The repository focuses on techniques to transfer advanced capabilities from proprietary LLMs to smaller models, compress open-source LLMs, and refine their performance. It covers various aspects of knowledge distillation, including algorithms, skill distillation, verticalization distillation in fields like law, medical & healthcare, finance, science, and miscellaneous domains. The repository provides a comprehensive overview of the research in the area of knowledge distillation of LLMs.

github

: 890

rag-web-ui

RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology. It helps enterprises and individuals build intelligent Q&A systems based on their own knowledge bases. By combining document retrieval and large language models, it delivers accurate and reliable knowledge-based question-answering services. The system is designed with features like intelligent document management, advanced dialogue engine, and a robust architecture. It supports multiple document formats, async document processing, multi-turn contextual dialogue, and reference citations in conversations. The architecture includes a backend stack with Python FastAPI, MySQL + ChromaDB, MinIO, Langchain, JWT + OAuth2 for authentication, and a frontend stack with Next.js, TypeScript, Tailwind CSS, Shadcn/UI, and Vercel AI SDK for AI integration. Performance optimization includes incremental document processing, streaming responses, vector database performance tuning, and distributed task processing. The project is licensed under the Apache-2.0 License and is intended for learning and sharing RAG knowledge only, not for commercial purposes.

github

: 2.0k

Nocode-Wep

Nocode/WEP is a forward-looking office visualization platform that includes modules for document building, web application creation, presentation design, and AI capabilities for office scenarios. It supports features such as configuring bullet comments, global article comments, multimedia content, custom drawing boards, flowchart editor, form designer, keyword annotations, article statistics, custom appreciation settings, JSON import/export, content block copying, and unlimited hierarchical directories. The platform is compatible with major browsers and aims to deliver content value, iterate products, share technology, and promote open-source collaboration.

github

: 143

Speech-AI-Forge

Speech-AI-Forge is a project developed around TTS generation models, implementing an API Server and a WebUI based on Gradio. The project offers various ways to experience and deploy Speech-AI-Forge, including online experience on HuggingFace Spaces, one-click launch on Colab, container deployment with Docker, and local deployment. The WebUI features include TTS model functionality, speaker switch for changing voices, style control, long text support with automatic text segmentation, refiner for ChatTTS native text refinement, various tools for voice control and enhancement, support for multiple TTS models, SSML synthesis control, podcast creation tools, voice creation, voice testing, ASR tools, and post-processing tools. The API Server can be launched separately for higher API throughput. The project roadmap includes support for various TTS models, ASR models, voice clone models, and enhancer models. Model downloads can be manually initiated using provided scripts. The project aims to provide inference services and may include training-related functionalities in the future.

github

: 1.2k

Qbot

Qbot is an AI-oriented automated quantitative investment platform that supports diverse machine learning modeling paradigms, including supervised learning, market dynamics modeling, and reinforcement learning. It provides a full closed-loop process from data acquisition, strategy development, backtesting, simulation trading to live trading. The platform emphasizes AI strategies such as machine learning, reinforcement learning, and deep learning, combined with multi-factor models to enhance returns. Users with some Python knowledge and trading experience can easily utilize the platform to address trading pain points and gaps in the market.

github

: 7.0k

ipex-llm

github

: 7.6k

MindChat

MindChat is a psychological large language model designed to help individuals relieve psychological stress and solve mental confusion, ultimately improving mental health. It aims to provide a relaxed and open conversation environment for users to build trust and understanding. MindChat offers privacy, warmth, safety, timely, and convenient conversation settings to help users overcome difficulties and challenges, achieve self-growth, and development. The tool is suitable for both work and personal life scenarios, providing comprehensive psychological support and therapeutic assistance to users while strictly protecting user privacy. It combines psychological knowledge with artificial intelligence technology to contribute to a healthier, more inclusive, and equal society.

github

: 436

For similar tasks

rag-chatbot

rag-chatbot is a tool that allows users to chat with multiple PDFs using Ollama and LlamaIndex. It provides an easy setup for running on local machines or Kaggle notebooks. Users can leverage models from Huggingface and Ollama, process multiple PDF inputs, and chat in multiple languages. The tool offers a simple UI with Gradio, supporting chat with history and QA modes. Setup instructions are provided for both Kaggle and local environments, including installation steps for Docker, Ollama, Ngrok, and the rag_chatbot package. Users can run the tool locally and access it via a web interface. Future enhancements include adding evaluation, better embedding models, knowledge graph support, improved document processing, MLX model integration, and Corrective RAG.

github

: 245

pdftochat

PDFToChat is a tool that allows users to chat with their PDF documents in seconds. It is powered by Together AI and Pinecone, utilizing a tech stack including Next.js, Mixtral, M2 Bert, LangChain.js, MongoDB Atlas, Bytescale, Vercel, Clerk, and Tailwind CSS. Users can deploy the tool to Vercel or any other host by setting up Together.ai, MongoDB Atlas database, Bytescale, Clerk, and Vercel. The tool enables users to interact with PDFs through chat, with future tasks including adding features like trash icon for deleting PDFs, exploring different embedding models, implementing auto scrolling, improving replies, benchmarking accuracy, researching chunking and retrieval best practices, adding demo video, upgrading to Next.js 14, adding analytics, customizing tailwind prose, saving chats in postgres DB, compressing large PDFs, implementing custom uploader, session tracking, error handling, and support for images in PDFs.

github

: 916

END-TO-END-GENERATIVE-AI-PROJECTS

github

: 145

llama-index

This repository, llama-index, contains a collection of apps powered by LlamaIndex. LlamaIndex is an open-source project that provides a simple interface between LLMs and external data sources like APIs, PDFs, SQL etc. It provides indices over structured and unstructured data, helping to abstract away the differences across data sources. The repository includes apps like chat-with-pdf and summarize-url, showcasing the capabilities of LlamaIndex in interacting with PDFs and summarizing URLs.

github

: 53

papersgpt-for-zotero

PapersGPT For Zotero is an AI plugin that enhances papers reading and research efficiency by integrating cutting-edge LLMs and offering seamless Zotero integration. Users can ask questions, extract insights, and converse with PDFs directly, making it a powerful research assistant for scholars, researchers, and anyone dealing with large amounts of text in PDF format. The plugin ensures privacy and data safety by using locally stored models and modules, with the ability to switch between different models easily. It provides a user-friendly interface for managing and chatting documents within Zotero, making research tasks more streamlined and productive.

github

: 720

For similar jobs

weave

Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

github

: 855

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

github

: 1.5k

VisionCraft

The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

github

: 94

kaito

Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

github

: 405

PyRIT

PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

github

: 2.3k

tabby

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

github

: 30.6k

spear

SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

github

: 224

Magick

Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.

github

: 675