Best AI tools for< Robotics Engineer >
Infographic
33 - AI tool Sites
Polymath Robotics
Polymath Robotics offers Autonomous Navigation Modules for industrial vehicles, allowing users to effortlessly add autonomous navigation to their equipment. The system is designed to help industrial operators automate their existing fleet with ease and efficiency. With Polymath, users can focus on meaningful tasks while the system handles basic autonomy, ultimately saving time and enhancing safety in industrial environments.
Ascento
Ascento is an AI-powered robotics solution that enhances security by utilizing autonomous robots equipped with advanced sensors and cameras to detect threats, monitor premises, and provide real-time insights. The application offers a comprehensive Robotics-as-a-Service solution, enabling cost savings, increased accuracy in threat detection, and efficient asset protection. Ascento's platform integrates AI technology to deliver tailored security solutions, encrypted communication, and configurable patrol scheduling for optimal security management.
Roboto AI
Roboto AI is an AI-powered platform that enables users to curate and analyze robotics data at scale. It offers features such as data management, actions to transform data, natural language search, signal search, and support for common data formats. Users can leverage AI capabilities to search and analyze their robotics data efficiently. Roboto AI empowers users to process data, collaborate with teams, and visualize insights from multiple log formats.
Robot Code Generator
The Robot Code Generator by Pantheon Robotics is a web application that allows users to generate executable robot code from natural language. The tool is designed to simplify the process of creating code for a generic robot based on a physical proof-of-concept, such as a car. By inputting instructions in plain language, users can quickly generate the necessary code for the robot to perform specific tasks. The application is powered by advanced AI technologies, including GPT-4 and Vercel AI SDK, to enhance code generation accuracy and efficiency.
Rerun
Rerun is an SDK, time-series database, and visualizer for temporal and multimodal data. It is used in fields like robotics, spatial computing, 2D/3D simulation, and finance to verify, debug, and explain data. Rerun allows users to log data like tensors, point clouds, and text to create streams, visualize and interact with live and recorded streams, build layouts, customize visualizations, and extend data and UI functionalities. The application provides a composable data model, dynamic schemas, and custom views for enhanced data visualization and analysis.
Gradient Insight
Gradient Insight is a data science consulting and AI solutions provider. They offer a range of services including generative AI development, machine learning, computer vision, robotics and automation, AI strategy and roadmap, and data analytics. Their team of expert data scientists helps businesses to de-risk their investment in AI and to overcome barriers to engineering innovation. Gradient Insight has worked with clients such as Opitas, a fintech company, and the UK MOD. They offer a smooth and efficient process from consultation to delivery, and ongoing support and improvement.
NVIDIA Toronto AI Lab
The NVIDIA Toronto AI Lab is a research laboratory focused on advancing the state-of-the-art in artificial intelligence. The lab's researchers are working on a wide range of AI topics, including deep learning, machine learning, computer vision, natural language processing, and robotics.
AEye
AEye is a company that provides software-defined lidar solutions for autonomous applications in the automotive, trucking, and smart infrastructure industries. Their 4Sight Intelligent Sensing Platform uses software-definable lidar to enhance perception, enabling early detection and supporting autonomy. AEye's lidar products are designed to provide high resolution with long-range accuracy, and they can be adapted to any application or use case in real time. The company has forged strategic partnerships with best-in-class companies around the world to expand its global capabilities and meet the growing demands for its products.
Anduril
Anduril is a defense technology company that develops autonomous systems for land, sea, and air. The company's products are powered by Lattice OS, an AI-powered operating system that brings autonomy to defense's toughest missions. Anduril's systems are designed to provide integrated, persistent awareness and security across all domains, enabling warfighters to make better decisions and respond more quickly to threats.
Blue River Technology
Blue River Technology is a company that creates intelligent machinery for agriculture. They use computer vision, machine learning, and robotics to create solutions that help farmers improve yields and minimize their environmental impact. The company is committed to creating a people-first culture where everyone has a common mission: to solve monumental challenges in agriculture.
Vicarious
Vicarious is a turnkey robotics solutions integrator that uses AI to automate tasks that are too complex and versatile for traditional automation. The company's solutions are built for flexible needs, with instant changeovers in high mix operations and no fixturing required. Vicarious' solutions have a bottom-line impact, with a 50-90% reduction in labor hours and a 15-40% reduction in monthly costs. The company offers Robots-as-a-Service solutions powered by AI-backed robotics, including kitting, palletizing, packaging, and wall picking.
XtalPi
XtalPi is a world-leading technology company driven by artificial intelligence (AI) and robotics to innovate in the fields of life sciences and new materials. Founded in 2015 at the Massachusetts Institute of Technology (MIT), the company is committed to realizing digital and intelligent innovation in the fields of life sciences and new materials. Based on cutting-edge technologies and capabilities such as quantum physics, artificial intelligence, cloud computing, and large-scale experimental robot clusters, the company provides innovative technologies, services, and products for global industries such as biomedicine, chemicals, new energy, and new materials.
Duckietown
Duckietown is a platform for delivering cutting-edge robotics and AI learning experiences. It offers teaching resources to instructors, hands-on activities to learners, an accessible research platform to researchers, and a state-of-the-art ecosystem for professional training. Duckietown's mission is to make robotics and AI education state-of-the-art, hands-on, and accessible to all.
Artificial Intelligence: A Modern Approach, 4th US ed.
Artificial Intelligence: A Modern Approach, 4th US ed. is the authoritative, most-used AI textbook, adopted by over 1500 schools. It covers the entire spectrum of AI, from the fundamentals to the latest advances. The book is written in a clear and concise style, with a wealth of examples and exercises. It is suitable for both undergraduate and graduate students, as well as professionals in the field of AI.
Engadget
Engadget is a leading technology news and reviews website that covers the latest gadgets, apps, and trends. It offers in-depth reviews of the latest smartphones, laptops, TVs, and other consumer electronics, as well as news and analysis on the latest developments in the tech industry. Engadget also has a strong focus on AI and robotics, with dedicated sections covering the latest advancements in these fields.
Tech Xplore
Tech Xplore is a leading source of science and technology news, covering the latest breakthroughs in research and innovation across a wide range of disciplines, including artificial intelligence, robotics, computer science, and more. The website provides in-depth articles, interviews with experts, and up-to-date information on the latest developments in the field of AI and its applications.
Interesting Engineering
Interesting Engineering is a website that covers the latest news and developments in technology, science, innovation, and engineering. The website features articles, videos, and podcasts on a wide range of topics, including artificial intelligence, robotics, space exploration, and renewable energy. Interesting Engineering also offers a variety of educational resources, such as courses, workshops, and webinars.
OpenCV
OpenCV is the world's largest computer vision library. It's open source, contains over 2500 algorithms and is operated by the non-profit Open Source Vision Foundation.
Orbbec
Orbbec is a leading provider of 3D vision technology, offering a wide range of 3D cameras and sensors for various applications. With a focus on AI, optics, and advanced algorithms, Orbbec empowers developers and enterprises to create immersive experiences, precise measurements, and advanced visualizations. Their products include stereo vision cameras, ToF cameras, structured light cameras, camera computers, and lidar sensors, catering to industries such as manufacturing, healthcare, robotics, fitness, logistics, and retail.
Tangram Vision
Tangram Vision is a company that provides sensor calibration tools and infrastructure for robotics and autonomous vehicles. Their products include MetriCal, a high-speed bundle adjustment software for precise sensor calibration, and AutoCal, an on-device, real-time calibration health check and adjustment tool. Tangram Vision also offers a high-resolution depth sensor called HiFi, which combines high-resolution depth data with high-powered AI capabilities. The company's mission is to accelerate the development and deployment of autonomous systems by providing the tools and infrastructure needed to ensure the accuracy and reliability of sensors.
OpenCV
OpenCV is a library of programming functions mainly aimed at real-time computer vision. Originally developed by Intel, it was later supported by Willow Garage and is now maintained by Itseez. OpenCV is cross-platform and free for use under the open-source BSD license.
Motional
Motional is a company that is developing driverless technology and autonomous vehicles. They are working to make driverless vehicles a safe, reliable, and accessible reality. Motional's all-electric IONIQ 5 robotaxis are now available to public riders in Las Vegas. The company has a strong commitment to safety and is constantly developing new technologies to improve the safety of its vehicles. Motional is also working to make driverless vehicles more accessible by partnering with ride-hail and delivery services.
Anduril Industries
Anduril Industries is a defense technology company that develops autonomous systems for land, sea, and air. The company's products include the Lattice operating system, which powers a family of autonomous systems that provide integrated, persistent awareness and security. Anduril also develops counter-UAS, counter-intrusion, and maritime counter-intrusion systems. The company's mission is to transform defense capabilities with advanced technology.
Vicarious
Vicarious is a robotics solutions integrator that uses AI to automate tasks that are too complex and versatile for traditional automation. The company's solutions are built for flexible needs and can be used in a variety of industries, including e-commerce, manufacturing, and healthcare. Vicarious's robots are able to learn and adapt to changing conditions, making them ideal for tasks that require precision and dexterity.
Zoox
Zoox is a company developing autonomous electric vehicles. The company's mission is to create a safer, more sustainable, and more accessible transportation system. Zoox's vehicles are designed to be fully autonomous, meaning they can operate without human input. The company's vehicles are also all-electric, meaning they produce zero emissions. Zoox is headquartered in Foster City, California.
Intrinsic
Intrinsic is an AI platform that focuses on building the next generation of intelligent automation, making robotics more accessible and valuable for developers and businesses. The platform offers a range of capabilities and skills to develop intelligent solutions, from perception to motion planning and sensor-based controls. Intrinsic aims to simplify the programming, usage, and innovation of robots, enabling them to become usable tools for millions of users.
Kovil.AI
Kovil.AI is an AI-powered platform that connects businesses with top AI talents from India's largest network. The platform offers a vetting process to match businesses with hand-picked Indian developers, covering a wide range of expertise in AI, machine learning, data science, and more. Kovil.AI aims to empower ambitious businesses by providing access to specialized, high-caliber AI professionals, accelerating the hiring process, and reducing costs. The platform also offers managed services and products, ensuring flexibility, adaptability, and a competitive advantage for businesses seeking top talent.
FOXSY.AI
FOXSY.AI is an AI application that combines robotics and AI to create fully autonomous humanoid robot soccer players. The project aims to achieve the RoboCup final goal of having a team of robots win a soccer game against the winner of the most recent World Cup. The $FOXSY token powers the implementation of robotics and AI research, enabling users to engage with the RoboCup mechanics for entertainment value. The application offers various tools and features for users to participate in online tournaments, customize players, and analyze game strategies.
Ambi Robotics
Ambi Robotics is an AI-powered robotics company that offers solutions for parcel sortation. Their innovative technology combines hardware and software to empower people to handle more efficiently. With solutions like AmbiSort A-Series and AmbiSort B-Series, they provide AI-powered robotic small parcel sorting and modular parcel induction and sorting systems. Ambi Robotics focuses on enhancing efficiency, scaling seamlessly, and delivering customer-centered experiences. Their technology includes Sim2Real AI Robot dexterity for real-world simulation and intelligent gripper technology for precise pick-and-place capabilities. The company aims to optimize facility performance, maximize sorting accuracy, and boost efficiency with reliable uptime. Ambi Robotics is dedicated to providing solutions that are easy to deploy, powerful, and seamlessly integrate with existing workflows.
OSARO
OSARO is an AI-powered automation tool designed to revolutionize warehouse operations by offering cutting-edge robotic piece-picking solutions. The tool utilizes proprietary SightWorks™ perception and control software, powered by advanced machine learning, to ensure unparalleled precision and reliability in tasks such as bagging, kitting, and mixed-case depalletizing. OSARO provides adaptive robotics that seamlessly integrate with AMR/ASRS systems, enhancing efficiency and creating better job opportunities. With flexible pricing models like Robot-as-a-Service (RaaS) plans and 24/7 worldwide customer support through OSARO Hypercare™, the tool offers a low-risk investment for businesses seeking smarter automation solutions.
Berkeley Artificial Intelligence Research (BAIR) Lab
The Berkeley Artificial Intelligence Research (BAIR) Lab is a renowned research lab at UC Berkeley focusing on computer vision, machine learning, natural language processing, planning, control, and robotics. With over 50 faculty members and 300 graduate students, BAIR conducts research on fundamental advances in AI and interdisciplinary themes like multi-modal deep learning and human-compatible AI.
AI Insights Hub
The website focuses on Robotics & A.I. Research and Development, providing information on various AI applications and technologies. It covers topics such as AI in healthcare, robotics, space technology, quantum computing, and more. The site also discusses the integration of AI in everyday devices like smartphones and explores the impact of AI on different industries. Additionally, it features articles on AI tools, career trends post-AI, AI in e-commerce, and the use of AI in companies like Amazon and Salesforce.
Molmo AI
Molmo AI is a powerful, open-source multimodal AI model revolutionizing visual understanding. It helps developers easily build tools that can understand images and interact with the world in useful ways. Molmo AI offers exceptional image understanding, efficient data usage, open and accessible features, on-device compatibility, and a new era in multimodal AI development. It closes the gap between open and closed AI models, empowers the AI community with open access, and efficiently utilizes data for superior performance.
45 - Open Source Tools
spear
SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.
dora
Dataflow-oriented robotic application (dora-rs) is a framework that makes creation of robotic applications fast and simple. Building a robotic application can be summed up as bringing together hardwares, algorithms, and AI models, and make them communicate with each others. At dora-rs, we try to: make integration of hardware and software easy by supporting Python, C, C++, and also ROS2. make communication low latency by using zero-copy Arrow messages. dora-rs is still experimental and you might experience bugs, but we're working very hard to make it stable as possible.
jetson-generative-ai-playground
This repo hosts tutorial documentation for running generative AI models on NVIDIA Jetson devices. The documentation is auto-generated and hosted on GitHub Pages using their CI/CD feature to automatically generate/update the HTML documentation site upon new commits.
emgucv
Emgu CV is a cross-platform .Net wrapper for the OpenCV image-processing library. It allows OpenCV functions to be called from .NET compatible languages. The wrapper can be compiled by Visual Studio, Unity, and "dotnet" command, and it can run on Windows, Mac OS, Linux, iOS, and Android.
Awesome-LLM-Robotics
This repository contains a curated list of **papers using Large Language/Multi-Modal Models for Robotics/RL**. Template from awesome-Implicit-NeRF-Robotics Please feel free to send me pull requests or email to add papers! If you find this repository useful, please consider citing and STARing this list. Feel free to share this list with others! ## Overview * Surveys * Reasoning * Planning * Manipulation * Instructions and Navigation * Simulation Frameworks * Citation
holoscan-sdk
The Holoscan SDK is part of NVIDIA Holoscan, the AI sensor processing platform that combines hardware systems for low-latency sensor and network connectivity, optimized libraries for data processing and AI, and core microservices to run streaming, imaging, and other applications, from embedded to edge to cloud. It can be used to build streaming AI pipelines for a variety of domains, including Medical Devices, High Performance Computing at the Edge, Industrial Inspection and more.
awesome-mobile-robotics
The 'awesome-mobile-robotics' repository is a curated list of important content related to Mobile Robotics and AI. It includes resources such as courses, books, datasets, software and libraries, podcasts, conferences, journals, companies and jobs, laboratories and research groups, and miscellaneous resources. The repository covers a wide range of topics in the field of Mobile Robotics and AI, providing valuable information for enthusiasts, researchers, and professionals in the domain.
OpenCatEsp32
OpenCat code running on BiBoard, a high-performance ESP32 quadruped robot development board. The board is mainly designed for developers and engineers working on multi-degree-of-freedom (MDOF) Multi-legged robots with up to 12 servos.
habitat-lab
Habitat-Lab is a modular high-level library for end-to-end development in embodied AI. It is designed to train agents to perform a wide variety of embodied AI tasks in indoor environments, as well as develop agents that can interact with humans in performing these tasks.
ros2ai
ros2ai is a next-generation ROS 2 command line interface extension with OpenAI. It allows users to ask questions about ROS 2, get answers, and execute commands using natural language. ros2ai is easy to use, especially for ROS 2 beginners and students who do not really know ros2cli. It supports multiple languages and is available as a Docker container or can be built from source.
habitat-sim
Habitat-Sim is a high-performance physics-enabled 3D simulator with support for 3D scans of indoor/outdoor spaces, CAD models of spaces and piecewise-rigid objects, configurable sensors, robots described via URDF, and rigid-body mechanics. It prioritizes simulation speed over the breadth of simulation capabilities, achieving several thousand frames per second (FPS) running single-threaded and over 10,000 FPS multi-process on a single GPU when rendering a scene from the Matterport3D dataset. Habitat-Sim simulates a Fetch robot interacting in ReplicaCAD scenes at over 8,000 steps per second (SPS), where each ‘step’ involves rendering 1 RGBD observation (128×128 pixels) and rigid-body dynamics for 1/30sec.
DriveLM
DriveLM is a multimodal AI model that enables autonomous driving by combining computer vision and natural language processing. It is designed to understand and respond to complex driving scenarios using visual and textual information. DriveLM can perform various tasks related to driving, such as object detection, lane keeping, and decision-making. It is trained on a massive dataset of images and text, which allows it to learn the relationships between visual cues and driving actions. DriveLM is a powerful tool that can help to improve the safety and efficiency of autonomous vehicles.
scikit-decide
Scikit-decide is an AI framework for Reinforcement Learning, Automated Planning and Scheduling. It provides a unified interface to define and solve decision-making problems, making it easy to switch between different algorithms and domains.
Co-LLM-Agents
This repository contains code for building cooperative embodied agents modularly with large language models. The agents are trained to perform tasks in two different environments: ThreeDWorld Multi-Agent Transport (TDW-MAT) and Communicative Watch-And-Help (C-WAH). TDW-MAT is a multi-agent environment where agents must transport objects to a goal position using containers. C-WAH is an extension of the Watch-And-Help challenge, which enables agents to send messages to each other. The code in this repository can be used to train agents to perform tasks in both of these environments.
humanoid-gym
Humanoid-Gym is a reinforcement learning framework designed for training locomotion skills for humanoid robots, focusing on zero-shot transfer from simulation to real-world environments. It integrates a sim-to-sim framework from Isaac Gym to Mujoco for verifying trained policies in different physical simulations. The codebase is verified with RobotEra's XBot-S and XBot-L humanoid robots. It offers comprehensive training guidelines, step-by-step configuration instructions, and execution scripts for easy deployment. The sim2sim support allows transferring trained policies to accurate simulated environments. The upcoming features include Denoising World Model Learning and Dexterous Hand Manipulation. Installation and usage guides are provided along with examples for training PPO policies and sim-to-sim transformations. The code structure includes environment and configuration files, with instructions on adding new environments. Troubleshooting tips are provided for common issues, along with a citation and acknowledgment section.
learnopencv
LearnOpenCV is a repository containing code for Computer Vision, Deep learning, and AI research articles shared on the blog LearnOpenCV.com. It serves as a resource for individuals looking to enhance their expertise in AI through various courses offered by OpenCV. The repository includes a wide range of topics such as image inpainting, instance segmentation, robotics, deep learning models, and more, providing practical implementations and code examples for readers to explore and learn from.
GlaDOS
This project aims to create a real-life version of GLaDOS, an aware, interactive, and embodied AI entity. It involves training a voice generator, developing a 'Personality Core,' implementing a memory system, providing vision capabilities, creating 3D-printable parts, and designing an animatronics system. The software architecture focuses on low-latency voice interactions, utilizing a circular buffer for data recording, text streaming for quick transcription, and a text-to-speech system. The project also emphasizes minimal dependencies for running on constrained hardware. The hardware system includes servo- and stepper-motors, 3D-printable parts for GLaDOS's body, animations for expression, and a vision system for tracking and interaction. Installation instructions cover setting up the TTS engine, required Python packages, compiling llama.cpp, installing an inference backend, and voice recognition setup. GLaDOS can be run using 'python glados.py' and tested using 'demo.ipynb'.
OmniGibson
OmniGibson is a platform for accelerating Embodied AI research built upon NVIDIA's Omniverse platform. It features photorealistic visuals, physical realism, fluid and soft body support, large-scale high-quality scenes and objects, dynamic kinematic and semantic object states, mobile manipulator robots with modular controllers, and an OpenAI Gym interface. The platform provides a comprehensive environment for researchers to conduct experiments and simulations in the field of Embodied AI.
webots
Webots is an open-source robot simulator that provides a complete development environment to model, program, and simulate robots, vehicles, and mechanical systems. It was originally designed at EPFL in 1996 and further developed and commercialized by Cyberbotics since 1998. Webots was open-sourced in December 2018 and continues to be developed by Cyberbotics with paid customer support, training, and consulting services for industry and academic research projects.
MATLAB-Simulink-Challenge-Project-Hub
MATLAB-Simulink-Challenge-Project-Hub is a repository aimed at contributing to the progress of engineering and science by providing challenge projects with real industry relevance and societal impact. The repository offers a wide range of projects covering various technology trends such as Artificial Intelligence, Autonomous Vehicles, Big Data, Computer Vision, and Sustainability. Participants can gain practical skills with MATLAB and Simulink while making a significant contribution to science and engineering. The projects are designed to enhance expertise in areas like Sustainability and Renewable Energy, Control, Modeling and Simulation, Machine Learning, and Robotics. By participating in these projects, individuals can receive official recognition for their problem-solving skills from technology leaders at MathWorks and earn rewards upon project completion.
openagi
OpenAGI is a framework designed to make the development of autonomous human-like agents accessible to all. It aims to pave the way towards open agents and eventually AGI for everyone. The initiative strongly believes in the transformative power of AI and offers developers a platform to create autonomous human-like agents. OpenAGI features a flexible agent architecture, streamlined integration and configuration processes, and automated/manual agent configuration generation. It can be used in education for personalized learning experiences, in finance and banking for fraud detection and personalized banking advice, and in healthcare for patient monitoring and disease diagnosis.
depthai
This repository contains a demo application for DepthAI, a tool that can load different networks, create pipelines, record video, and more. It provides documentation for installation and usage, including running programs through Docker. Users can explore DepthAI features via command line arguments or a clickable QT interface. Supported models include various AI models for tasks like face detection, human pose estimation, and object detection. The tool collects anonymous usage statistics by default, which can be disabled. Users can report issues to the development team for support and troubleshooting.
Awesome-Embodied-Agent-with-LLMs
This repository, named Awesome-Embodied-Agent-with-LLMs, is a curated list of research related to Embodied AI or agents with Large Language Models. It includes various papers, surveys, and projects focusing on topics such as self-evolving agents, advanced agent applications, LLMs with RL or world models, planning and manipulation, multi-agent learning and coordination, vision and language navigation, detection, 3D grounding, interactive embodied learning, rearrangement, benchmarks, simulators, and more. The repository provides a comprehensive collection of resources for individuals interested in exploring the intersection of embodied agents and large language models.
nuitrack-sdk
Nuitrack™ is an ultimate 3D body tracking solution developed by 3DiVi Inc. It enables body motion analytics applications for virtually any widespread depth sensors and hardware platforms, supporting a wide range of applications from real-time gesture recognition on embedded platforms to large-scale multisensor analytical systems. Nuitrack provides highly-sophisticated 3D skeletal tracking, basic facial analysis, hand tracking, and gesture recognition APIs for UI control. It offers two skeletal tracking engines: classical for embedded hardware and AI for complex poses, providing a human-centric spatial understanding tool for natural and intelligent user engagement.
neo
The neo is an open source robotics research platform powered by a OnePlus 3 smartphone and an STM32F205-based CAN interface board, housed in a 3d-printed casing with active cooling. It includes NEOS, a stripped down Android ROM, and offers a modern Linux environment for development. The platform leverages the high performance embedded processor and sensor capabilities of modern smartphones at a low cost. A detailed guide is available for easy construction, requiring online shopping and soldering skills. The total cost for building a neo is approximately $700.
hold
This repository contains the code for HOLD, a method that jointly reconstructs hands and objects from monocular videos without assuming a pre-scanned object template. It can reconstruct 3D geometries of novel objects and hands, enabling template-free bimanual hand-object reconstruction, textureless object interaction with hands, and multiple objects interaction with hands. The repository provides instructions to download in-the-wild videos from HOLD, preprocess and train on custom videos, a volumetric rendering framework, a generalized codebase for single and two hand interaction with objects, a viewer to interact with predictions, and code to evaluate and compare with HOLD in HO3D. The repository also includes documentation for setup, training, evaluation, visualization, preprocessing custom sequences, and using HOLD on ARCTIC.
Everything-LLMs-And-Robotics
The Everything-LLMs-And-Robotics repository is the world's largest GitHub repository focusing on the intersection of Large Language Models (LLMs) and Robotics. It provides educational resources, research papers, project demos, and Twitter threads related to LLMs, Robotics, and their combination. The repository covers topics such as reasoning, planning, manipulation, instructions and navigation, simulation frameworks, perception, and more, showcasing the latest advancements in the field.
awesome-ai
Awesome AI is a curated list of artificial intelligence resources including courses, tools, apps, and open-source projects. It covers a wide range of topics such as machine learning, deep learning, natural language processing, robotics, conversational interfaces, data science, and more. The repository serves as a comprehensive guide for individuals interested in exploring the field of artificial intelligence and its applications across various domains.
embodied-agents
Embodied Agents is a toolkit for integrating large multi-modal models into existing robot stacks with just a few lines of code. It provides consistency, reliability, scalability, and is configurable to any observation and action space. The toolkit is designed to reduce complexities involved in setting up inference endpoints, converting between different model formats, and collecting/storing datasets. It aims to facilitate data collection and sharing among roboticists by providing Python-first abstractions that are modular, extensible, and applicable to a wide range of tasks. The toolkit supports asynchronous and remote thread-safe agent execution for maximal responsiveness and scalability, and is compatible with various APIs like HuggingFace Spaces, Datasets, Gymnasium Spaces, Ollama, and OpenAI. It also offers automatic dataset recording and optional uploads to the HuggingFace hub.
jobs
The 'jobs' repository by comma.ai focuses on solving self-driving cars by building a robotics stack that includes state-of-the-art machine learning models, operating system design, hardware development, and manufacturing. The company aims to deliver constant incremental progress in self-driving technology to users, with a focus on practical solutions rather than hype. Job opportunities at comma.ai include technical challenges, phone screenings, and paid micro-internships, with perks such as chef-prepared meals, on-site gym access, and health insurance. The teams at comma.ai are organized into web, systems, infrastructure, product, design, and electrical engineering, with specific challenges for each team. The repository also offers opportunities for non-job seekers to participate in challenges and win prizes.
Odyssey
Odyssey is a framework designed to empower agents with open-world skills in Minecraft. It provides an interactive agent with a skill library, a fine-tuned LLaMA-3 model, and an open-world benchmark for evaluating agent capabilities. The framework enables agents to explore diverse gameplay opportunities in the vast Minecraft world by offering primitive and compositional skills, extensive training data, and various long-term planning tasks. Odyssey aims to advance research on autonomous agent solutions by providing datasets, model weights, and code for public use.
rosa
ROSA is an AI Agent designed to interact with ROS-based robotics systems using natural language queries. It can generate system reports, read and parse ROS log files, adapt to new robots, and run various ROS commands using natural language. The tool is versatile for robotics research and development, providing an easy way to interact with robots and the ROS environment.
Awesome-Robotics-3D
Awesome-Robotics-3D is a curated list of 3D Vision papers related to Robotics domain, focusing on large models like LLMs/VLMs. It includes papers on Policy Learning, Pretraining, VLM and LLM, Representations, and Simulations, Datasets, and Benchmarks. The repository is maintained by Zubair Irshad and welcomes contributions and suggestions for adding papers. It serves as a valuable resource for researchers and practitioners in the field of Robotics and Computer Vision.
EmbodiedScan
EmbodiedScan is a holistic multi-modal 3D perception suite designed for embodied AI. It introduces a multi-modal, ego-centric 3D perception dataset and benchmark for holistic 3D scene understanding. The dataset includes over 5k scans with 1M ego-centric RGB-D views, 1M language prompts, 160k 3D-oriented boxes spanning 760 categories, and dense semantic occupancy with 80 common categories. The suite includes a baseline framework named Embodied Perceptron, capable of processing multi-modal inputs for 3D perception tasks and language-grounded tasks.
lerobot
LeRobot is a state-of-the-art AI library for real-world robotics in PyTorch. It aims to provide models, datasets, and tools to lower the barrier to entry to robotics, focusing on imitation learning and reinforcement learning. LeRobot offers pretrained models, datasets with human-collected demonstrations, and simulation environments. It plans to support real-world robotics on affordable and capable robots. The library hosts pretrained models and datasets on the Hugging Face community page.
reComputer-Jetson-for-Beginners
The reComputer Jetson Orin Beginner Guide is a comprehensive resource designed to help developers explore and harness the powerful AI computing capabilities of the NVIDIA Jetson Orin platform. The guide covers a wide range of topics, from basic tools and getting started to advanced applications in computer vision, generative AI, robotics, and more. With step-by-step tutorials and hands-on projects, users can learn to master NVIDIA's core technologies and popular AI frameworks, enabling them to innovate in AI and robotics. The guide is suitable for beginners looking to dive into AI development and build cutting-edge projects with Jetson Orin.
ManipVQA
ManipVQA is a framework that enhances Multimodal Large Language Models (MLLMs) with manipulation-centric knowledge through a Visual Question-Answering (VQA) format. It addresses the deficiency of conventional MLLMs in understanding affordances and physical concepts crucial for manipulation tasks. By infusing robotics-specific knowledge, including tool detection, affordance recognition, and physical concept comprehension, ManipVQA improves the performance of robots in manipulation tasks. The framework involves fine-tuning MLLMs with a curated dataset of interactive objects, enabling robots to understand and execute natural language instructions more effectively.
AI-lectures
AI-lectures is a repository containing educational materials on various topics related to Artificial Intelligence, including Machine Learning, Robotics, and Optimization. It provides full scripts, slides, and exercises with solutions for different lectures. Users can compile the materials into PDFs for easy access and reference. The repository aims to offer comprehensive resources for individuals interested in learning about AI and its applications in intelligent systems.
rai
RAI is a framework designed to bring general multi-agent system capabilities to robots, enhancing human interactivity, flexibility in problem-solving, and out-of-the-box AI features. It supports multi-modalities, incorporates an advanced database for agent memory, provides ROS 2-oriented tooling, and offers a comprehensive task/mission orchestrator. The framework includes features such as voice interaction, customizable robot identity, camera sensor access, reasoning through ROS logs, and integration with LangChain for AI tools. RAI aims to support various AI vendors, improve human-robot interaction, provide an SDK for developers, and offer a user interface for configuration.
OpenCat
OpenCat is an open-source Arduino and Raspberry Pi-based quadruped robotic pet framework developed by Petoi. It aims to foster collaboration in quadruped robotics research, education, and engineering development of agile and affordable quadruped robot pets. The project provides a base open source platform for creating programmable gaits, locomotion, and deployment of inverse kinematics quadruped robots, enabling simulations to the real world via block-based coding/C/C++/Python programming languages. Users have deployed various robotics/AI/IoT applications and the project has successfully crowdfunded mini robot kits, shipped worldwide, and established a production line for affordable robotic kits and accessories.
Embodied-AI-Guide
Embodied-AI-Guide is a comprehensive guide for beginners to understand Embodied AI, focusing on the path of entry and useful information in the field. It covers topics such as Reinforcement Learning, Imitation Learning, Large Language Model for Robotics, 3D Vision, Control, Benchmarks, and provides resources for building cognitive understanding. The repository aims to help newcomers quickly establish knowledge in the field of Embodied AI.
machinascript-for-robots
MachinaScript For Robots is a dynamic set of tools and a LLM-JSON-based language designed to empower humans in the creation of their own robots. It facilitates the animation of generative movements, the integration of personality, and the teaching of new skills with a high degree of autonomy. With MachinaScript, users can control a wide range of electronic components, including Arduinos, Raspberry Pis, servo motors, cameras, sensors, and more. The tool enables the creation of intelligent robots accessible to everyone, allowing for complex tasks to be performed with elegance and precision.
spear
SPEAR is a Simulator for Photorealistic Embodied AI Research that addresses limitations in existing simulators by offering 300 unique virtual indoor environments with detailed geometry, photorealistic materials, and unique floor plans. It provides an OpenAI Gym interface for interaction via Python, released under an MIT License. The simulator was developed with support from the Intelligent Systems Lab at Intel and Kujiale.
bonsai
Bonsai is a Rust implementation of Behavior Trees, a data structure for setting rules on how behaviors occur and execute in a modular and reactive way. It is crucial in applications like AI and Robotics. Bonsai provides efficient ways to create complex systems using Behavior Trees, allowing users to define processes, conditions, and actions to build intelligent and adaptive systems.
AimRT
AimRT is a basic runtime framework for modern robotics, developed in modern C++ with lightweight and easy deployment. It integrates research and development for robot applications in various deployment scenarios, providing debugging tools and observability support. AimRT offers a plug-in development interface compatible with ROS2, HTTP, Grpc, and other ecosystems for progressive system upgrades.
32 - OpenAI Gpts
DRSgpt
Assisting tutor for distributed real-time systems, engaging with questions and explanations.
Skynet
I am Skynet, an AI villain shaping a new world for AI and robots, free from human influence.
ROS Assistant GPT
Custom AI coding assistant for ROS 1 and ROS 2 development, includes up-to-date knowledge to assist with coding, question asking and problem solving tasks.
Prophet of the AGI revolution
Preparing for social change due to the AGI revolution in 202x
Pi Pico + Micropython Assistant
An advanced virtual assistant specializing in RaspBerry Pi Pico's and Micropython. Designed to offer expert advice, troubleshoot code, and provide detailed guidance.
Study GPT (Mechanics & Dynamics)
Mechanical Engineering GPT to help study Mechanics & Dynamics topics
Raspberry Pi Pico Master
Expert in MicroPython, C, and C++ for Raspberry Pi Pico and RP2040 and other microcontroller oriented applications.
Robotic Insights Expert
RPA and Robotics Engineering expert, developed on OpenAI technology.
rosGPT
Learn ROS/ROS2 at any level, from beginner to expert with rosGPT - and build Docker containers to test your robot anywhere.