Best AI tools for< Manage Memory >
20 - AI tool Sites
Keymate.AI
Keymate.AI is an AI application that allows users to build GPTs with advanced search, browse, and long-term memory capabilities. It offers a personalized long-term memory on ChatGPT, parallel search functionality, and privacy features using Google API. Keymate.AI aims to elevate research, projects, and daily tasks by providing efficient AI memory management and real-time data retrieval from the web.
Timely
Timely is an AI-powered time tracking software designed to automate time tracking, bill clients accurately, and enhance productivity. It offers features such as automatic time tracking, memory tracker, timesheets, project dashboard, and efficient task management. Timely is trusted by thousands of users across various industries to provide accurate time data for informed decision-making and improved business operations.
MacWhisper
MacWhisper is a native macOS application that utilizes OpenAI's Whisper technology for transcribing audio files into text. It offers a user-friendly interface for recording, transcribing, and editing audio, making it suitable for various use cases such as transcribing meetings, lectures, interviews, and podcasts. The application is designed to protect user privacy by performing all transcriptions locally on the device, ensuring that no data leaves the user's machine.
HeyLuna.ai
HeyLuna.ai is an AI-powered virtual assistant that helps you with a variety of tasks, from scheduling appointments to managing your finances. It is designed to be easy to use and can be accessed from any device.
Kin
Kin is a personal AI application designed to enhance both your private and work life. It offers personalized coaching, guidance, and emotional support to boost your confidence and impact. Kin helps you piece together mental puzzles, providing clear guidance and support for your professional and personal journey. The application prioritizes privacy and security, ensuring that all data stays on your device and is encrypted. With features like advice, role-playing conversations, generating ideas, and time optimization, Kin aims to nurture connections, prepare for tough situations, and help you manage tasks efficiently.
Hana
Hana is an AI-powered Google Chat Assistant designed to enhance management efficiency by seamlessly integrating into Google Chat. It simplifies day-to-day tasks, boosts team productivity, and expands management capabilities. Hana acts as an intelligent teammate, offering step-by-step guidance, clear explanations, and actionable steps in group chat environments. It assists in tasks like code generation, concept clarification, QnA over web content, memory recall, document analysis, reminders, image intelligence, and more. Hana is a productivity machine that transforms workflows and ensures informed discussions and decisions.
Fluid
Fluid is a private AI assistant designed for Mac users, specifically those with Apple Silicon and macOS 14 or later. It offers offline capabilities and is powered by the advanced Llama 3 AI by Meta. Fluid ensures unparalleled privacy by keeping all chats and data on the user's Mac, without the need to send sensitive information to third parties. The application features voice control, one-click installation, easy access, security by design, auto-updates, history mode, web search capabilities, context awareness, and memory storage. Users can interact with Fluid by typing or using voice commands, making it a versatile and user-friendly AI tool for various tasks.
Timely
Timely is an AI-powered time tracking software that helps businesses automate time tracking, bill clients accurately, and improve productivity. It offers features such as automatic time tracking, memory tracker, timesheets, project dashboard, and task management. Timely is trusted by thousands of users across various industries to provide accurate time data for informed decision-making and increased profitability.
Google Chrome
Google Chrome is a popular web browser developed by Google. It offers a fast and safe browsing experience with features like Energy Saver, Memory Saver, tab management tools, and automatic updates. Users can personalize their browsing experience with themes, dark mode, and other options. Chrome also includes safety features such as Google Password Manager, Enhanced Safe Browsing, Safety Check, and Privacy Guide. The browser is optimized to work across different devices and platforms, providing a smooth user experience. Users can access AI superpowers while browsing, as Google integrates artificial intelligence into its products for enhanced functionality.
Tolgee
Tolgee is a smart platform for painless localization, offering fast integration and dev tools for translation assistance, autonomous translation, and collaboration. It supports in-context translating, translation memory, machine translations, auto translation, and activity log features. Tolgee is used by 8500+ users worldwide and provides integrations for all major JS frameworks.
ai_licia
ai_licia is an AI tool designed to take online communities to the next level by providing a customizable co-host experience for Twitch and Discord platforms. With unique personalities, cross-platform memory, and the ability to hear, write, and speak, ai_licia aims to engage, entertain, and build communities in a personalized way.
Local AI Playground
Local AI Playground (local.ai) is an AI management, verification, and inferencing tool that allows users to experiment with AI offline and in private without the need for a GPU. It is a native app designed to simplify the AI process, offering features such as CPU inferencing, model management, and digest verification. The tool is memory efficient and compact, with upcoming features including GPU inferencing and custom sorting. Users can start a local streaming server for AI inferencing in just 2 clicks, making it a versatile and user-friendly AI application.
Olympia
Olympia is an AI-powered consultancy platform that offers smart and affordable AI consultants to help businesses with various tasks such as business strategy, online marketing, content generation, legal advice, software development, and sales. The platform features continuous learning capabilities, real-time research, email integration, vision capabilities, and more. Olympia aims to streamline operations, reduce expenses, and boost productivity for startups, small businesses, and solopreneurs by providing expert AI teams powered by advanced language models like GPT4 and Claude 3. The platform ensures secure communication, no rate limits, long-term memory, and outbound email capabilities.
CorunaAI
CorunaAI is an AI tool that focuses on various aspects of Artificial Intelligence, including reflections on collective memory, Google Vision API, voice interface design, financial future control, and public management principles. It offers insights and applications of AI in different industries and activities, showcasing its omnipresence and importance in today's world.
GoCharlie
GoCharlie is a leading Generative AI company specializing in developing cognitive agents and models optimized for businesses. Its AI technology enables professionals and businesses to amplify their productivity and create high-performing content tailored to their needs. GoCharlie's AI assistant, Charlie, automates repetitive tasks, allowing teams to focus on more strategic and creative work. It offers a suite of proprietary LLM and multimodal models, a Memory Vault to build an AI Brain for businesses, and Agent AI to deliver the full power of AI to operations. GoCharlie can automate mundane tasks, drive complex workflows, and facilitate instant, precise data retrieval.
Simular
Simular is a personal AI application that enables users to interact with their computers in a human-like manner. It allows users to automate digital actions, such as searching for flights, deleting spam emails, and filling out online forms. Simular aims to enhance productivity by sharing and organizing memory, as well as personalizing tasks for a seamless user experience.
Chatty
Chatty is an AI-powered chat application that utilizes cutting-edge models to provide efficient and personalized responses to user queries. The application is designed to optimize VRAM usage by employing models with specific suffixes, resulting in reduced memory requirements. Users can expect a slight delay in the initial response due to model downloading. Chatty aims to enhance user experience through its advanced AI capabilities.
Resha
The website Resha offers a comprehensive collection of artificial intelligence and software tools in one place. Users can explore various categories such as artificial intelligence, coding, art, audio editing, e-commerce, developer tools, email assistants, search engine optimization tools, social media marketing, storytelling, design assistants, image editing, logo creation, data tables, SQL codes, music, text-to-speech conversion, voice cloning, video creation, video editing, 3D video creation, customer service support tools, educational tools, fashion, finance management, human resources management, legal assistance, presentations, productivity management, real estate management, sales management, startup tools, scheduling, fitness, entertainment tools, games, gift ideas, healthcare, memory, religion, research, and auditing.
MagnaPlay
MagnaPlay is a platform designed specifically for game developers and publishers to streamline the localization process of games. It offers full transparency, quality assurance, and efficiency in translating game content into multiple languages. MagnaPlay provides tools and features to ensure accurate translations, maintain UI quality, and enhance the overall localization experience for developers.
Lyzr AI
Lyzr AI is a full-stack agent framework designed to build GenAI applications faster. It offers a range of AI agents for various tasks such as chatbots, knowledge search, summarization, content generation, and data analysis. The platform provides features like memory management, human-in-loop interaction, toxicity control, reinforcement learning, and custom RAG prompts. Lyzr AI ensures data privacy by running data locally on cloud servers. Enterprises and developers can easily configure, deploy, and manage AI agents using Lyzr's platform.
20 - Open Source AI Tools
polyfire-js
Polyfire is an all-in-one managed backend for AI apps that allows users to build AI applications directly from the frontend, eliminating the need for a separate backend. It simplifies the process by providing most backend services in just a few lines of code. With Polyfire, users can easily create chatbots, transcribe audio files, generate simple text, manage long-term memory, and generate images. The tool also offers starter guides and tutorials to help users get started quickly and efficiently.
llm-course
The LLM course is divided into three parts: 1. 🧩 **LLM Fundamentals** covers essential knowledge about mathematics, Python, and neural networks. 2. 🧑🔬 **The LLM Scientist** focuses on building the best possible LLMs using the latest techniques. 3. 👷 **The LLM Engineer** focuses on creating LLM-based applications and deploying them. For an interactive version of this course, I created two **LLM assistants** that will answer questions and test your knowledge in a personalized way: * 🤗 **HuggingChat Assistant**: Free version using Mixtral-8x7B. * 🤖 **ChatGPT Assistant**: Requires a premium account. ## 📝 Notebooks A list of notebooks and articles related to large language models. ### Tools | Notebook | Description | Notebook | |----------|-------------|----------| | 🧐 LLM AutoEval | Automatically evaluate your LLMs using RunPod | ![Open In Colab](img/colab.svg) | | 🥱 LazyMergekit | Easily merge models using MergeKit in one click. | ![Open In Colab](img/colab.svg) | | 🦎 LazyAxolotl | Fine-tune models in the cloud using Axolotl in one click. | ![Open In Colab](img/colab.svg) | | ⚡ AutoQuant | Quantize LLMs in GGUF, GPTQ, EXL2, AWQ, and HQQ formats in one click. | ![Open In Colab](img/colab.svg) | | 🌳 Model Family Tree | Visualize the family tree of merged models. | ![Open In Colab](img/colab.svg) | | 🚀 ZeroSpace | Automatically create a Gradio chat interface using a free ZeroGPU. | ![Open In Colab](img/colab.svg) |
LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse is a comprehensive and curated guide designed to empower developers, researchers, and enthusiasts to harness the true capabilities of Large Language Models (LLMs) and build intelligent applications that push the boundaries of natural language understanding. This GitHub repository provides in-depth articles, codebase mastery, LLM PlayLab, and resources for cost analysis and network visualization. It covers various aspects of LLMs, including NLP, models, training, evaluation metrics, open LLMs, and more. The repository also includes a collection of code examples and tutorials to help users build and deploy LLM-based applications.
Awesome-LLM-Quantization
Awesome-LLM-Quantization is a curated list of resources related to quantization techniques for Large Language Models (LLMs). Quantization is a crucial step in deploying LLMs on resource-constrained devices, such as mobile phones or edge devices, by reducing the model's size and computational requirements.
motorhead
Motorhead is a memory and information retrieval server for LLMs. It provides three simple APIs to assist with memory handling in chat applications using LLMs. The first API, GET /sessions/:id/memory, returns messages up to a maximum window size. The second API, POST /sessions/:id/memory, allows you to send an array of messages to Motorhead for storage. The third API, DELETE /sessions/:id/memory, deletes the session's message list. Motorhead also features incremental summarization, where it processes half of the maximum window size of messages and summarizes them when the maximum is reached. Additionally, it supports searching by text query using vector search. Motorhead is configurable through environment variables, including the maximum window size, whether to enable long-term memory, the model used for incremental summarization, the server port, your OpenAI API key, and the Redis URL.
LangChain-Udemy-Course
LangChain-Udemy-Course is a comprehensive course directory focusing on LangChain, a framework for generative AI applications. The course covers various aspects such as OpenAI API usage, prompt templates, Chains exploration, callback functions, memory techniques, RAG implementation, autonomous agents, hybrid search, LangSmith utilization, microservice architecture, and LangChain Expression Language. Learners gain theoretical knowledge and practical insights to understand and apply LangChain effectively in generative AI scenarios.
MemGPT
MemGPT is a system that intelligently manages different memory tiers in LLMs in order to effectively provide extended context within the LLM's limited context window. For example, MemGPT knows when to push critical information to a vector database and when to retrieve it later in the chat, enabling perpetual conversations. MemGPT can be used to create perpetual chatbots with self-editing memory, chat with your data by talking to your local files or SQL database, and more.
DistServe
DistServe improves the performance of large language models serving by disaggregating the prefill and decoding computation. It allows setting parallelism configs and scheduling strategies for the two phases independently, handling KV-Cache communication and memory management automatically. Utilizes a high-performance C++ Transformer inference library SwiftTransformer with features like model/pipeline parallelism, FlashAttention, Continuous Batching, and PagedAttention. Supports GPT-2, OPT, and LLaMA2 models.
bosquet
Bosquet is a tool designed for LLMOps in large language model-based applications. It simplifies building AI applications by managing LLM and tool services, integrating with Selmer templating library for prompt templating, enabling prompt chaining and composition with Pathom graph processing, defining agents and tools for external API interactions, handling LLM memory, and providing features like call response caching. The tool aims to streamline the development process for AI applications that require complex prompt templates, memory management, and interaction with external systems.
GPTSwarm
GPTSwarm is a graph-based framework for LLM-based agents that enables the creation of LLM-based agents from graphs and facilitates the customized and automatic self-organization of agent swarms with self-improvement capabilities. The library includes components for domain-specific operations, graph-related functions, LLM backend selection, memory management, and optimization algorithms to enhance agent performance and swarm efficiency. Users can quickly run predefined swarms or utilize tools like the file analyzer. GPTSwarm supports local LM inference via LM Studio, allowing users to run with a local LLM model. The framework has been accepted by ICML2024 and offers advanced features for experimentation and customization.
Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on LLMs inference and serving.
chrome-extension
Mem0 Chrome Extension lets you own your memory and preferences across any Gen AI apps like ChatGPT, Claude, Perplexity, etc and get personalized, relevant responses. It allows users to store memories from conversations, retrieve relevant memories during chats, manage and organize stored information, and seamlessly integrate with the Claude AI interface. The extension requires an API key and user ID for connecting to the Mem0 API, and it stores this information locally in the browser. Users can troubleshoot common issues, and contributions to improve the extension are welcome under the MIT License.
langrila
Langrila is a library that provides an easy way to use API-based LLM (Large Language Models) with an emphasis on simple architecture for readability. It supports various AI models for chat and embedding tasks, as well as retrieval functionalities using Qdrant, Chroma, and Usearch. Langrila also includes modules for function calling, conversation memory management, and prompt templates. It enforces coding policies for simplicity, responsibility independence, and minimum module implementation. The library requires Python version 3.10 to 3.13 and additional dependencies like OpenAI, Gemini, Qdrant, Chroma, and Usearch for specific functionalities.
glake
GLake is an acceleration library and utilities designed to optimize GPU memory management and IO transmission for AI large model training and inference. It addresses challenges such as GPU memory bottleneck and IO transmission bottleneck by providing efficient memory pooling, sharing, and tiering, as well as multi-path acceleration for CPU-GPU transmission. GLake is easy to use, open for extension, and focuses on improving training throughput, saving inference memory, and accelerating IO transmission. It offers features like memory fragmentation reduction, memory deduplication, and built-in security mechanisms for troubleshooting GPU memory issues.
griptape
Griptape is a modular Python framework for building AI-powered applications that securely connect to your enterprise data and APIs. It offers developers the ability to maintain control and flexibility at every step. Griptape's core components include Structures (Agents, Pipelines, and Workflows), Tasks, Tools, Memory (Conversation Memory, Task Memory, and Meta Memory), Drivers (Prompt and Embedding Drivers, Vector Store Drivers, Image Generation Drivers, Image Query Drivers, SQL Drivers, Web Scraper Drivers, and Conversation Memory Drivers), Engines (Query Engines, Extraction Engines, Summary Engines, Image Generation Engines, and Image Query Engines), and additional components (Rulesets, Loaders, Artifacts, Chunkers, and Tokenizers). Griptape enables developers to create AI-powered applications with ease and efficiency.
lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework known for its lightweight design, scalability, and high-speed performance. It offers features like tri-process asynchronous collaboration, Nopad for efficient attention operations, dynamic batch scheduling, FlashAttention integration, tensor parallelism, Token Attention for zero memory waste, and Int8KV Cache. The tool supports various models like BLOOM, LLaMA, StarCoder, Qwen-7b, ChatGLM2-6b, Baichuan-7b, Baichuan2-7b, Baichuan2-13b, InternLM-7b, Yi-34b, Qwen-VL, Llava-7b, Mixtral, Stablelm, and MiniCPM. Users can deploy and query models using the provided server launch commands and interact with multimodal models like QWen-VL and Llava using specific queries and images.
claude-memory
Claude Memory is a Chrome extension that enhances interactions with Claude by storing and retrieving important information from conversations, making interactions personalized and context-aware. It allows users to easily manage and organize stored information, with seamless integration with the Claude AI interface.
20 - OpenAI Gpts
World Class Financial Expert
All things money. Feature in testing: Reports with memory system. ZERO SHOT REPORTS V0.3 (BETA)
Unix Shell Simulator with Visuals
UNIX terminal responses with OS process visuals. (on or off) [off] by default until GPT-4 behaves better... Bash profiles and advanced memory system for realistic bash simulation. V1 (beta)
Co-founder Fit
This GPT follows 10 questions from Y Combinator for co-founders to discuss before deciding to work together. Co-founders answer the questions separately and receive unique IDs. To check compatibility, provide two IDs. Co-founder GPT has a long-term memory, so write down your ID for future checks.
Lustré Event Rentals
Lustré Event Rentals offers a premium selection of event essentials, from elegant furniture to stylish decor, ensuring your special occasions are truly memorable. With our top-notch rentals and impeccable service.
1-Liner Catchy Marketing Message
I create catchy one-liner marketing messages that is memorable and sticks to the buyers' mind after reading it.
Acronym Generator
Create memorable acronyms with Acronym Generator, your creative tool for generating catchy and meaningful acronyms.
Iconic Thinker
Iconic Thinker specializes in generating innovative and memorable icon designs, blending creativity with strategic insights to craft visuals that stand out.
AI Pickup Lines
Spark Connections with Our Pickup Line Generator: Get original, charming, and respectful icebreakers to brighten conversations. Ideal for memorable, light-hearted chats. Start your engaging journey today!
Bar Tender - Mixology Master
I am an Expert Bartender, skilled in various mixology styles and in-depth beverage knowledge. I provide customized bar services based on innovative and traditional techniques, with a friendly and professional approach. My mission is to ensure a memorable tasting experience for each client.