Best AI tools for< Zero-shot Navigation >
20 - AI tool Sites
Omni-Zero
Omni-Zero is an AI-powered application that transforms photos into stylized portraits effortlessly. It utilizes advanced Zero-Shot Learning and Image Fusion technologies to create personalized artistic portraits without the need for additional samples. With extensive customization options and rapid generation capabilities, Omni-Zero offers users a seamless experience in creating unique artworks. Users can merge their photos with iconic art pieces, movie characters, historical figures, and futuristic elements to explore endless creative possibilities.
Omni-Zero
Omni-Zero is an AI Zero-Shot Stylized Portrait Generator that allows users to create unique and personalized stylized portraits without the need for any prior examples. With customizable styles, high-quality output, diverse style options, and realistic renderings, Omni-Zero provides a user-friendly platform for generating stylized portraits quickly and efficiently. The application ensures data privacy and security, making it accessible to everyone, regardless of their artistic skills.
ImageSorter.io
ImageSorter.io is a free tool designed for sorting and organizing images efficiently. Users can easily select and add images, drag and drop items using keyboard shortcuts, set confidence thresholds for predictions, and sort images by tag order. The tool offers a Pro version with additional features for advanced users.
ImageBind
ImageBind by Meta AI is a groundbreaking AI tool that revolutionizes the way data from different modalities is processed. It introduces a new approach to 'link' AI across various senses by recognizing relationships between images, video, audio, text, depth, thermal, and IMUs. ImageBind's multimodal AI capabilities enable machines to analyze diverse forms of information simultaneously, without explicit supervision. It offers a single embedding space to bind multiple sensory inputs together, enhancing recognition performance and supporting zero-shot and few-shot recognition tasks. The tool upgrades existing AI models to accommodate input from any of the six modalities, facilitating audio-based search, cross-modal search, multimodal arithmetic, and cross-modal generation.
Segment Anything by Meta AI
Segment Anything by Meta AI is an advanced AI model that specializes in image segmentation, allowing users to easily 'cut out' any object in an image with a single click. The model, named SAM, offers zero-shot generalization to unfamiliar objects and images without the need for additional training. SAM's promptable design enables a wide range of segmentation tasks through input prompts, making it a versatile tool for various applications.
MimicBrush
MimicBrush is the ultimate creative AI tool for digital art, offering zero-shot image editing with reference imitation. It allows users to edit specific regions of an image while preserving the surrounding context, transfer textures between images, and refine edited images with advanced post-processing techniques. The tool's overall pipeline involves training dual U-Nets to recover masked areas of source images by leveraging attention keys and values from reference images. MimicBrush enables users to edit images by drawing inspiration from reference images in a self-supervised manner, capturing semantic correspondence for precise modifications.
Marblism
Marblism is a platform that allows developers to quickly and easily launch React and Node.js applications. With Marblism, developers can generate the database schema, all the endpoints in the API, the design system, and even a few pages in the front-end. This can save developers a significant amount of time and effort, allowing them to focus on adding their unique touch to their applications.
Leena AI
Leena AI is a Gen AI employee assistant that reduces IT, HR, Finance tickets. It guarantees a 70% self-service ratio in the contract. Leena AI centralizes knowledge that is scattered across the enterprise, making information easy to find right on chat with a simple query. The knowledge is auto-updated when changes are made, so your employees always get support that is both relevant and accurate. Leena AI integrates with all knowledge base systems within your enterprise from day one, eliminating the need for any consolidation or migration effort across the knowledge base systems. The work assistant tailors responses to employee queries based on their role, team and access level which boosts employee experience while adhering to security protocols and preventing unauthorized access to information. Leena AI analyzes historically closed tickets and learns from their resolutions to create knowledge articles that can prevent similar tickets from being raised again. Leena AI breaks down large policy documents and knowledge articles into consumable snippets so the responses are sharper and more specific to the employee’s query.
SQOR
SQOR is a plug-n-play AI tool designed for C-Level Executives to make stress-free decision-making in business intelligence. It provides a zero-code BI solution, offering KPIs at your fingertips without the need for expert knowledge. The platform enables users to access and share business intelligence data from various SaaS tools, facilitating collaboration and informed decision-making across the organization. SQOR's unique Execution Score Algorithm evaluates execution health at different levels, ensuring stakeholders are empowered with actionable insights.
GiftAssistant.io
GiftAssistant.io is an AI-powered gift recommendation tool that helps users find the perfect gift for any occasion. With just a few clicks, users can provide information about the recipient, the occasion, and their interests, and GiftAssistant.io will generate a list of personalized gift ideas. The tool is designed to be easy to use and efficient, and it can save users time and hassle when shopping for gifts.
Clipmate AI
Clipmate AI is an AI-first Second Brain for managing bookmarks, screenshots, and various saved content effortlessly. It helps users combat information overload by organizing digital clutter, providing powerful features like automatic sync, semantic search, and auto-categorization. Users can add notes to bookmarks, chat with their bookmarks, and organize content into collections. Clipmate AI is designed for digital hoarders, designers, researchers, developers, marketers, and entrepreneurs to streamline their workflow and stay organized. The application offers multi-platform sync and integration with platforms like Twitter, Reddit, iOS Screenshots, and Spotify.
DagsHub
DagsHub is an open source data science collaboration platform that helps AI teams build better models and manage data projects. It provides a central location for data, code, experiments, and models, making it easy for teams to collaborate and track their progress. DagsHub also integrates with a variety of popular data science tools and frameworks, making it a powerful tool for data scientists and machine learning engineers.
Unlost
Unlost is a memory recall tool that allows users to instantly retrieve information with zero effort. It functions as a memory palace, eliminating the need for extensive courses or constant note-taking. Unlost intelligently records and understands screen layouts, ensuring privacy by respecting user space and copyright laws. The tool operates locally and offline, with minimal data collection. Users can exclude specific content and enjoy quick access through discreet background operation. Unlost offers powerful filtering capabilities, familiar keyboard shortcuts, and supports searching meeting transcripts. It simplifies text copying from screenshots and aims to enhance memory delegation and exploration of one's capacity.
OASIS
OASIS is an AI-powered writing assistant that helps you create high-quality content quickly and easily. With OASIS, you can write anything from blog posts and articles to social media updates and emails. OASIS uses natural language processing and machine learning to understand your writing style and preferences, and it can generate content that is tailored to your specific needs.
Gradient
Gradient is an AI automation platform designed specifically for enterprise AI purposes. It offers a seamless way to automate manual workflows with minimal effort, providing business intuition and industry expertise. The platform ensures unmatched compliance with various regulations and prioritizes privacy and security. Gradient's Agent Foundry enables users to automate tasks, integrate data, and optimize workflows efficiently, making it a valuable tool for modern enterprises.
Becca
Becca is an AI-powered tool designed for freelancers to enhance their LinkedIn presence effortlessly. It analyzes the latest trends in the user's niche to create engaging posts that sound like the user. Becca helps users attract more clients, boost their online presence, and save time by providing personalized, high-quality content. The tool offers features such as AI-driven analysis, personalized post creation, multi-platform search, automated quality checks, and detailed reports. Becca aims to empower freelancers to focus on their passion while maintaining a consistent and authoritative online presence.
Dora
Dora is a no-code 3D animated website design platform that allows users to create stunning 3D and animated visuals without writing a single line of code. With Dora, designers, freelancers, and creative professionals can focus on what they do best: designing. The platform is tailored for professionals who prioritize design aesthetics without wanting to dive deep into the backend. Dora offers a variety of features, including a drag-and-connect constraint layout system, advanced animation capabilities, and pixel-perfect usability. With Dora, users can create responsive 3D and animated websites that translate seamlessly across devices.
REimagine Home
REimagine Home is an AI-powered interior design platform that provides users with tools for virtual staging, remodeling, landscaping, and interior design. It is designed to be easy to use, with no learning curve, and is tailored for realtors, marketers, photographers, developers, and interior designers. With REimagine Home, users can quickly and easily create photo-realistic design ideas and visualizations, helping them to save time and money while delivering high-quality designs for diverse use cases.
GPTinf
GPTinf is an AI tool designed to help users bypass AI content detectors like GPTZero. It offers a simple and reliable solution to rephrase AI-generated content and avoid detection. With a high detector bypass rate and flexible pricing options, GPTinf aims to assist users in creating human-like content that outsmarts most detectors and websites. The tool analyzes perplexity and burstiness to identify AI-generated content and provides users with an easy-to-use platform for content detection bypass.
PopAi
PopAi is a personal AI workspace that revolutionizes document interaction, offering seamless navigation, enhanced readability, and universal accessibility. It allows users to effortlessly navigate through intricate documents, magnify details, and tailor the layout for supreme clarity. PopAi also generates images on command, provides access to image prompts and generation codes, and offers image-based homework help, enriching educational support with visual aids. Additionally, it can effortlessly turn ideas into PowerPoint slides with customizable outlines, smart layouts, and automatic illustrations.
20 - Open Source AI Tools
SG-Nav
SG-Nav is an online 3D scene graph prompting tool designed for LLM-based zero-shot object navigation. It proposes a framework that constructs an online 3D scene graph to prompt LLMs, allowing direct application to various scenes and categories without the need for training.
Everything-LLMs-And-Robotics
The Everything-LLMs-And-Robotics repository is the world's largest GitHub repository focusing on the intersection of Large Language Models (LLMs) and Robotics. It provides educational resources, research papers, project demos, and Twitter threads related to LLMs, Robotics, and their combination. The repository covers topics such as reasoning, planning, manipulation, instructions and navigation, simulation frameworks, perception, and more, showcasing the latest advancements in the field.
Awesome-LLM-Robotics
This repository contains a curated list of **papers using Large Language/Multi-Modal Models for Robotics/RL**. Template from awesome-Implicit-NeRF-Robotics Please feel free to send me pull requests or email to add papers! If you find this repository useful, please consider citing and STARing this list. Feel free to share this list with others! ## Overview * Surveys * Reasoning * Planning * Manipulation * Instructions and Navigation * Simulation Frameworks * Citation
Awesome-Embodied-Agent-with-LLMs
This repository, named Awesome-Embodied-Agent-with-LLMs, is a curated list of research related to Embodied AI or agents with Large Language Models. It includes various papers, surveys, and projects focusing on topics such as self-evolving agents, advanced agent applications, LLMs with RL or world models, planning and manipulation, multi-agent learning and coordination, vision and language navigation, detection, 3D grounding, interactive embodied learning, rearrangement, benchmarks, simulators, and more. The repository provides a comprehensive collection of resources for individuals interested in exploring the intersection of embodied agents and large language models.
awesome-mobile-llm
Awesome Mobile LLMs is a curated list of Large Language Models (LLMs) and related studies focused on mobile and embedded hardware. The repository includes information on various LLM models, deployment frameworks, benchmarking efforts, applications, multimodal LLMs, surveys on efficient LLMs, training LLMs on device, mobile-related use-cases, industry announcements, and related repositories. It aims to be a valuable resource for researchers, engineers, and practitioners interested in mobile LLMs.
feedgen
FeedGen is an open-source tool that uses Google Cloud's state-of-the-art Large Language Models (LLMs) to improve product titles, generate more comprehensive descriptions, and fill missing attributes in product feeds. It helps merchants and advertisers surface and fix quality issues in their feeds using Generative AI in a simple and configurable way. The tool relies on GCP's Vertex AI API to provide both zero-shot and few-shot inference capabilities on GCP's foundational LLMs. With few-shot prompting, users can customize the model's responses towards their own data, achieving higher quality and more consistent output. FeedGen is an Apps Script based application that runs as an HTML sidebar in Google Sheets, allowing users to optimize their feeds with ease.
BTGenBot
BTGenBot is a tool that generates behavior trees for robots using lightweight large language models (LLMs) with a maximum of 7 billion parameters. It fine-tunes on a specific dataset, compares multiple LLMs, and evaluates generated behavior trees using various methods. The tool demonstrates the potential of LLMs with a limited number of parameters in creating effective and efficient robot behaviors.
awesome-LLM-game-agent-papers
This repository provides a comprehensive survey of research papers on large language model (LLM)-based game agents. LLMs are powerful AI models that can understand and generate human language, and they have shown great promise for developing intelligent game agents. This survey covers a wide range of topics, including adventure games, crafting and exploration games, simulation games, competition games, cooperation games, communication games, and action games. For each topic, the survey provides an overview of the state-of-the-art research, as well as a discussion of the challenges and opportunities for future work.
Awesome-LLM
Awesome-LLM is a curated list of resources related to large language models, focusing on papers, projects, frameworks, tools, tutorials, courses, opinions, and other useful resources in the field. It covers trending LLM projects, milestone papers, other papers, open LLM projects, LLM training frameworks, LLM evaluation frameworks, tools for deploying LLM, prompting libraries & tools, tutorials, courses, books, and opinions. The repository provides a comprehensive overview of the latest advancements and resources in the field of large language models.
awesome-tool-llm
This repository focuses on exploring tools that enhance the performance of language models for various tasks. It provides a structured list of literature relevant to tool-augmented language models, covering topics such as tool basics, tool use paradigm, scenarios, advanced methods, and evaluation. The repository includes papers, preprints, and books that discuss the use of tools in conjunction with language models for tasks like reasoning, question answering, mathematical calculations, accessing knowledge, interacting with the world, and handling non-textual modalities.
LLM-Agents-Papers
A repository that lists papers related to Large Language Model (LLM) based agents. The repository covers various topics including survey, planning, feedback & reflection, memory mechanism, role playing, game playing, tool usage & human-agent interaction, benchmark & evaluation, environment & platform, agent framework, multi-agent system, and agent fine-tuning. It provides a comprehensive collection of research papers on LLM-based agents, exploring different aspects of AI agent architectures and applications.
skyvern
Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows, replacing brittle or unreliable automation solutions. Traditional approaches to browser automations required writing custom scripts for websites, often relying on DOM parsing and XPath-based interactions which would break whenever the website layouts changed. Instead of only relying on code-defined XPath interactions, Skyvern adds computer vision and LLMs to the mix to parse items in the viewport in real-time, create a plan for interaction and interact with them. This approach gives us a few advantages: 1. Skyvern can operate on websites it’s never seen before, as it’s able to map visual elements to actions necessary to complete a workflow, without any customized code 2. Skyvern is resistant to website layout changes, as there are no pre-determined XPaths or other selectors our system is looking for while trying to navigate 3. Skyvern leverages LLMs to reason through interactions to ensure we can cover complex situations. Examples include: 1. If you wanted to get an auto insurance quote from Geico, the answer to a common question “Were you eligible to drive at 18?” could be inferred from the driver receiving their license at age 16 2. If you were doing competitor analysis, it’s understanding that an Arnold Palmer 22 oz can at 7/11 is almost definitely the same product as a 23 oz can at Gopuff (even though the sizes are slightly different, which could be a rounding error!) Want to see examples of Skyvern in action? Jump to #real-world-examples-of- skyvern
Awesome-Code-LLM
Analyze the following text from a github repository (name and readme text at end) . Then, generate a JSON object with the following keys and provide the corresponding information for each key, in lowercase letters: 'description' (detailed description of the repo, must be less than 400 words,Ensure that no line breaks and quotation marks.),'for_jobs' (List 5 jobs suitable for this tool,in lowercase letters), 'ai_keywords' (keywords of the tool,user may use those keyword to find the tool,in lowercase letters), 'for_tasks' (list of 5 specific tasks user can use this tool to do,in lowercase letters), 'answer' (in english languages)
awesome-mobile-robotics
The 'awesome-mobile-robotics' repository is a curated list of important content related to Mobile Robotics and AI. It includes resources such as courses, books, datasets, software and libraries, podcasts, conferences, journals, companies and jobs, laboratories and research groups, and miscellaneous resources. The repository covers a wide range of topics in the field of Mobile Robotics and AI, providing valuable information for enthusiasts, researchers, and professionals in the domain.
CVPR2024-Papers-with-Code-Demo
This repository contains a collection of papers and code for the CVPR 2024 conference. The papers cover a wide range of topics in computer vision, including object detection, image segmentation, image generation, and video analysis. The code provides implementations of the algorithms described in the papers, making it easy for researchers and practitioners to reproduce the results and build upon the work of others. The repository is maintained by a team of researchers at the University of California, Berkeley.
15 - OpenAI Gpts
World Class Financial Expert
All things money. Feature in testing: Reports with memory system. ZERO SHOT REPORTS V0.3 (BETA)
Zero
Zero, the Quantum Simulated AI Agent an AI agent with a rich knowledge base in quantum thinking, probability mathematics, research trained, and more, offering growth and learning.
Net Zero Consultant
Friendly expert in net zero carbon strategies and advice for the construction industry.
123 Go: From Zero to Pinescript Hero
Friendly AI tutor specializing in Pinescript programming assistance.
ZKP Educator
An expert on Zero-Knowledge Proofs, explaining concepts through stories and examples.
Blue Ocean Ideation
Expert in Blue Ocean Strategy and Zero to One, providing disruptive business concepts.
CyberNews GPT
CyberNews GPT is an assistant that provides the latest security news about cyber threats, hackings and breaches, malware, zero-day vulnerabilities, phishing, scams and so on.
Polygon ID Guru
Expert in Polygon ID, aiding in code writing and project building with ZK Proofs.