awesome-object-detection-datasets
A collection of some awesome public object detection and recognition datasets.
Stars: 67
This repository is a curated list of awesome public object detection and recognition datasets. It includes a wide range of datasets related to object detection and recognition tasks, such as general detection and recognition datasets, autonomous driving datasets, adverse weather datasets, person detection datasets, anti-UAV datasets, optical aerial imagery datasets, low-light image datasets, infrared image datasets, SAR image datasets, multispectral image datasets, 3D object detection datasets, vehicle-to-everything field datasets, super-resolution field datasets, and face detection and recognition datasets. The repository also provides information on tools for data annotation, data augmentation, and data management related to object detection tasks.
README:
🔥🔥🔥 This repository lists some awesome public object detection and recognition datasets.
-
Awesome-Object-Detection-Datasets
- Summary
- General Detection and Recognition Datasets
- Autonomous Driving Datasets
- Adverse Weather Datasets
- Person Detection Datasets
- Anti-UAV Datasets
- Optical Aerial Imagery Datasets
- Low-light Image Datasets
- Infrared Image Datasets
- SAR Image Datasets
- Multispectral Image Datasets
- 3D Object Detection Datasets
- Vehicle-to-Everything Field Datasets
- Super-Resolution Field Datasets
- Face Detection and Recognition Datasets
- Blogs
-
-
wenhwu/awesome-remote-sensing-change-detection : List of datasets, codes, and contests related to remote sensing change detection.
-
ZHOUYI1023/awesome-radar-perception : A curated list of radar datasets, detection, tracking and fusion.
-
lartpang/awesome-segmentation-saliency-dataset : A collection of some datasets for segmentation / saliency detection. Welcome to PR...😄
-
TianhaoFu/Awesome-3D-Object-Detection : Papers, code and datasets about deep learning for 3D Object Detection.
-
xahidbuffon/Awesome_Underwater_Datasets : Pointers to large-scale underwater datasets and relevant resources.
-
M-3LAB/awesome-industrial-anomaly-detection : Paper list and datasets for industrial image anomaly detection.
-
ZhangXiwuu/Awesome_visual_place_recognition_datasets : A curated list of Visual Place Recognition (VPR)/ loop closure detection (LCD) datasets.
-
ari-dasci/OD-WeaponDetection : Datasets for weapon detection based on image classification and object detection tasks.
-
DLLXW/objectDetectionDatasets : 目标检测数据集制作:VOC,COCO,YOLO等常用数据集格式的制作和互相转换脚本。
-
codingonion/awesome-object-detection-and-recognition-datasets : A collection of some awesome public object detection and recognition datasets.
-
-
-
OpenDataLab : OpenDataLab 是上海人工智能实验室的大模型数据基座团队打造的数据开放平台,现已成为中国大模型语料数据联盟开源数据服务指定平台,为开发者提供全链条的 AI 数据支持,应对和解决数据处理中的风险与挑战,推动 AI 研究及应用。
-
Science Data Bank(ScienceDB) : Make your research data citable, discoverable and persistently accessible Satisfy flexible data sharing requirements Dedicate to facilitating data dissemination and reusing. Science Data Bank (ScienceDB) is a public, general-purpose data repository aiming to provide data services (e.g. data acquisition, long-term preservation, publishing, sharing and access) for researchers, research projects/teams, journals, institutions, universities, etc. It supports a variety of data acquisition and data licenses. ScienceDB is dedicated to promoting data findable, citable and reusable on the prerequisite of protecting the rights and interests of data owners and it is built and operated by Computer Network Information Center, Chinese Academy of Sciences.
-
中国科学数据 : 《中国科学数据(中英文网络版)》(China Scientific Data)(CN11-6035/N,ISSN 2096-2223)是目前中国唯一的专门面向多学科领域科学数据出版的学术期刊,作为国家网络连续型出版物的首批试点之一,由中国科学院主管,中国科学院计算机网络信息中心和ISC CODATA中国全国委员会合办,国家科技基础条件平台中心、中国科学院网络安全和信息化领导小组办公室指导,国内外公开发行,中英文,季刊。 中国科学引文数据库(CSCD)来源期刊,中国科技核心期刊 ,收录于中国科协高质量科技期刊分级目录。
-
飞桨AI Studio : 飞桨AI Studio开放数据集。
-
极市开发者平台 : 极市开发者平台开放数据集。
-
openvinotoolkit/datumaro : Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
-
-
-
-
Label Studio : Label Studio is a multi-type data labeling and annotation tool with standardized output format. labelstud.io
-
AnyLabeling : Effortless data labeling with AI support from YOLO and Segment Anything! AnyLabeling = LabelImg + Labelme + Improved UI + Auto-labeling.
-
LabelImg : 🖍️ LabelImg is a graphical image annotation tool and label object bounding boxes in images.
-
labelme : Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
-
DarkLabel : Video/Image Labeling and Annotation Tool.
-
AlexeyAB/Yolo_mark : GUI for marking bounded boxes of objects in images for training neural network Yolo v3 and v2.
-
Cartucho/OpenLabeling : Label images and video for Computer Vision applications.
-
CVAT : Computer Vision Annotation Tool (CVAT). Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
-
VoTT : Visual Object Tagging Tool: An electron app for building end to end Object Detection Models from Images and Videos.
-
WangRongsheng/KDAT : 一个专为视觉方向目标检测全流程的标注工具集,全称:Kill Object Detection Annotation Tools。
-
Rectlabel-support : RectLabel - An image annotation tool to label images for bounding box object detection and segmentation.
-
cnyvfang/labelGo-Yolov5AutoLabelImg : 💕YOLOV5 semi-automatic annotation tool (Based on labelImg)💕一个基于labelImg及YOLOV5的图形化半自动标注工具。
-
CVUsers/Auto_maker : 深度学习数据自动标注器开源 目标检测和图像分类(高精度高效率)。
-
MyVision : Computer vision based ML training data generation tool 🚀
-
wufan-tb/AutoLabelImg : auto-labelimg based on yolov5, with many other useful tools. AutoLabelImg 多功能自动标注工具。
-
MrZander/YoloMarkNet : Darknet YOLOv2/3 annotation tool written in C#/WPF.
-
mahxn0/Yolov3_ForTextLabel : 基于yolov3的目标/自然场景文字自动标注工具。
-
MNConnor/YoloV5-AI-Label : YoloV5 AI Assisted Labeling.
-
LILINOpenGitHub/Labeling-Tool : Free YOLO AI labeling tool. YOLO AI labeling tool is a Windows app for labeling YOLO dataset.
-
whs0523003/YOLOv5_6.1_autolabel : YOLOv5_6.1 自动标记目标框。
-
2vin/PyYAT : Semi-Automatic Yolo Annotation Tool In Python.
-
AlturosDestinations/Alturos.ImageAnnotation : A collaborative tool for labeling image data for yolo.
-
stephanecharette/DarkMark : Marking up images for use with Darknet.
-
2vin/yolo_annotation_tool : Annotation tool for YOLO in opencv.
-
sanfooh/quick_yolo2_label_tool : yolo快速标注工具 quick yolo2 label tool.
-
folkien/yaya : YAYA - Yet annother YOLO annoter for images (in QT5). Support yolo format, image modifications, labeling and detecting with previously trained detector.
-
pylabel-project/pylabel : Python library for computer vision labeling tasks. The core functionality is to translate bounding box annotations between different formats-for example, from coco to yolo.
-
opendatalab/labelU : Uniform, Unlimited, Universal and Unbelievable Annotation Toolbox.
-
-
-
Albumentations : Albumentations is a Python library for image augmentation. Image augmentation is used in deep learning and computer vision tasks to increase the quality of trained models. The purpose of image augmentation is to create new training samples from the existing data. "Albumentations: Fast and Flexible Image Augmentations". (Information 2020)
-
doubleZ0108/Data-Augmentation : General Data Augmentation Algorithms for Object Detection(esp. Yolo).
-
-
- YOLOExplorer : YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds. Explore, manipulate and iterate on Computer Vision datasets with precision using simple APIs. Supports SQL filters, vector similarity search, native interface with Pandas and more.
-
-
-
COCO : "Microsoft COCO: Common Objects in Context". (ECCV 2014)
-
PASCAL VOC : "The Pascal Visual Object Classes Challenge: A Retrospective". (IJCV 2015)
-
Objects365 : "Objects365: A Large-scale, High-quality Dataset for Object Detection". (ICCV 2019)
-
V3Det : "V3Det: Vast Vocabulary Visual Detection Dataset". (arXiv 2023)
-
-
-
TT100K : "Traffic-Sign Detection and Classification in the Wild". (CVPR 2016)
-
CCTSDB : CSUST Chinese Traffic Sign Detection Benchmark 中国交通数据集由长沙理工大学综合交通运输大数据智能处理湖南省重点实验室张建明老师团队制作完成。 "A Real-Time Chinese Traffic Sign Detection Algorithm Based on Modified YOLOv2". (Algorithms, 2017)
-
CCTSDB2021 : "CCTSDB 2021: a more comprehensive traffic sign detection benchmark". (Human-centric Computing and Information Sciences, 2022)
-
- RESID : "Benchmarking Single-Image Dehazing and Beyond". (IEEE Transactions on Image Processing 2018)
-
INRIA Person : "Histograms of oriented gradients for human detection". (CVPR 2005)
-
CrowdHuman : "CrowdHuman: A Benchmark for Detecting Human in a Crowd". (arXiv 2018)
-
PANDA : "PANDA: A Gigapixel-Level Human-Centric Video Dataset". (CVPR 2020)
-
TinyPerson : "Scale Match for Tiny Person Detection". (WACV 2020)
-
TinyPerson v2 | SeaPerson : "Object Localization Under Single Coarse Point Supervision". (CVPR 2022)
- Anti-UAV : 🔥🔥Official Repository for Anti-UAV🔥🔥. "Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV System". (arXiv 2023)
-
COWC : "A large contextual dataset for classification, detection and counting of cars with deep learning". (ECCV 2016)
-
RSOD : "Accurate object localization in remote sensing images based on convolutional neural networks". (IEEE TGRS 2017)
-
LEVIR : "Random access memories: A new paradigm for target detection in high resolution aerial remote sensing images". (IEEE Transactions on Image Processing 2017)
-
LEVIR-Ship : "A Degraded Reconstruction Enhancement-based Method for Tiny Ship Detection in Remote Sensing Images with A New Large-scale Dataset". (IEEE TGRS 2022)
-
MASATI : "Automatic ship classification from optical aerial images with convolutional neural networks". (Remote Sensing 2018)
-
xView : "xView: Objects in Context in Overhead Imagery". (arXiv 2018)
-
DOTA : "DOTA: A Large-Scale Dataset for Object Detection in Aerial Images". (CVPR 2018). "Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges". (IEEE TPAMI 2021).
-
ITCVD : "Deep Learning for Vehicle Detection in Aerial Images". (IEEE ICIP 2018)
-
Bridge Dataset : "A Tool for Bridge Detection in Major Infrastructure Works Using Satellite Images". (IEEE ICIP 2018)
-
DIOR : "Object detection in optical remote sensing images: A survey and a new benchmark". (ISPRS 2020)
-
PESMOD : "UAV Images Dataset for Moving Object Detection from Moving Cameras". (arXiv 2021)
-
AI-TOD : "Tiny Object Detection in Aerial Images". (IEEE ICPR 2021)
-
RsCarData : "DSFNet: Dynamic and Static Fusion Network for Moving Object Detection in Satellite Videos". (IEEE GRSL 2021)
-
VISO : "Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark". (IEEE TGRS 2021)
-
VisDrone : "Detection and Tracking Meet Drones Challenge". (IEEE TPAMI 2021)
-
FAIR1M : "FAIR1M: A benchmark dataset for fine-grained object recognition in high-resolution remote sensing imagery". (ISPRS 2021)
-
SeaDronesSee : "SeaDronesSee: A Maritime Benchmark for Detecting Humans in Open Water". (WACV 2022)
-
NightOwls : "NightOwls: A Pedestrians at Night Dataset". (ACCV 2018).
-
ExDark : "Getting to know low-light images with the exclusively dark dataset". (CVIU 2019). "Low-light image enhancement using Gaussian Process for features retrieval". (Signal Processing: Image Communication, 2019).
-
DARK FACE : DARK FACE: Face Detection in Low Light Condition. "Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study". (IEEE Transactions on Image Processing 2020).
-
SCUT_FIR_Pedestrian_Dataset : "Benchmarking a large-scale FIR dataset for on-road pedestrian detection". (Infrared Physics & Technology, 2019)
-
NUDT-SIRST : "Dense Nested Attention Network for Infrared Small Target Detection". (arXiv 2021)
-
SIRST : "Asymmetric Contextual Modulation for Infrared Small Target Detection". (WACV 2021)
-
SNL VideoSAR : "Developments in sar and ifsar systems and technologies at sandia national laboratories". (IEEE Aerospace Conference Proceedings, 2003)
-
MSTAR : MSTAR public dataset. "Object recognition results using MSTAR synthetic aperture radar data". (IEEE CVBVS 2000)
-
OpenSARShip : "OpenSARShip: A Dataset Dedicated to Sentinel-1 Ship Interpretation". (IEEE JSTAEORS 2017)
-
OpenSARShip 2.0 : "OpenSARShip 2.0: A large-volume dataset for deeper interpretation of ship targets in Sentinel-1 imagery". (IEEE BIGSARDATA 2017)
-
SSDD : "Ship detection in SAR images based on an improved faster R-CNN". (IEEE BIGSARDATA 2017). "基于深度学习的SAR图像舰船检测数据集及性能分析". (第五届高分辨率对地观测学术年会, 2018)
-
AIR-SARShip : "高分辨率SAR舰船检测数据集-2.0". "AIR-SARShip-1.0: 高分辨率 SAR 舰船检测数据集". (雷达学报 2019)
-
SAR-Ship-Dataset : "A SAR Dataset of Ship Detection for Deep Learning under Complex Backgrounds". (Remote Sensing, 2019)
-
OpenSARUrban : "OpenSARUrban: A Sentinel-1 SAR Image Dataset for Urban Interpretation". (IEEE JSTAEORS 2020)
-
HRSID : "HRSID: A High-Resolution SAR Images Dataset for Ship Detection and Instance Segmentation". (IEEE Access 2020)
-
FUSAR-Ship : 高分辨率船只数据集FUSAR-Ship1.0. (雷达学报). "FUSAR-Ship: building a high-resolution SAR-AIS matchup dataset of Gaofen-3 for ship detection and recognition". (Science China Information Sciences, 2020)
-
Official-SSDD : "SAR Ship Detection Dataset (SSDD): Official Release and Comprehensive Data Analysis ". (Remote Sensing, 2021)
-
FLIR_ADAS : Teledyne FLIR Free ADAS Thermal Dataset v2.
-
VEDAI : "Vehicle Detection in Aerial Imagery: A small target detection benchmark". (Journal of Visual Communication and Image Representation 2015)
-
KAIST_rgbt : "Multispectral Pedestrian Detection: Benchmark Dataset and Baseline". (CVPR 2015)
-
TNO : "The TNO multiband image data collection". (Data in brief, 2017)
-
MFNet : MFNet-pytorch, image semantic segmentation using RGB-Thermal images. "MFNet: Towards real-time semantic segmentation for autonomous vehicles with multi-spectral scenes". (IROS 2017). (MFNet Dataset : Multi-spectral Object Detection and Semantic Segmentation Datasets)
-
LLVIP : "LLVIP: A Visible-Infrared Paired Dataset for Low-Light Vision". (ICCV 2021)
-
MSRS : MSRS: Multi-Spectral Road Scenarios for Practical Infrared and Visible Image Fusion. "PIAFusion : A progressive infrared and visible image fusion network based on illumination aware". (Information Fusion, 2022)
-
TarDAL : "Target-Aware Dual Adversarial Learning and a Multi-Scenario Multi-Modality Benchmark To Fuse Infrared and Visible for Object Detection". (CVPR 2022). (M3FD Dataset)
-
DroneVehicle : "Drone-based RGB-Infrared Cross-Modality Vehicle Detection via Uncertainty-Aware Learning". (IEEE TCSVT 2022)
- Objectron : "Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations". (CVPR, 2021)
-
OpenCOOD|OPV2V : OpenCOOD is an Open COOperative Detection framework for autonomous driving. It is also the official implementation of the ICRA 2022 paper OPV2V. "OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication". (ICRA, 2022). mobility-lab.seas.ucla.edu/opv2v/
-
CoBEVT : "CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers". (CoRL, 2022).
-
Where2comm : "Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps". (Neurips, 2022).
-
PJLab-ADG/LiDARSimLib-and-Placement-Evaluation : "Analyzing Infrastructure LiDAR Placement with Realistic LiDAR Simulation Library". (ICRA, 2023).
-
CoAlign : "Robust Collaborative 3D Object Detection in Presence of Pose Errors". (ICRA, 2023).
-
V2V4Real : "V2V4Real: A Real-World Large-Scale Dataset for Vehicle-to-Vehicle Cooperative Perception". (CVPR, 2023).
-
V2X-ViT|V2XSet : "V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer". (ECCV, 2022).
-
DAIR-V2X : "DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection". (CVPR, 2022). 全球首个车路协同自动驾驶数据集发布
-
V2X-Seq : "V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting". (CVPR, 2023). 全球首个大规模时序车路协同自动驾驶数据集发布
- VideoLQ : "Investigating Tradeoffs in Real-World Video Super-Resolution". (CVPR, 2022)
-
-
WIDER FACE : "WIDER FACE: A Face Detection Benchmark". (CVPR 2016)
-
UFDD : Unconstrained Face Detection Dataset(UFDD). "Pushing the Limits of Unconstrained Face Detection: a Challenge Dataset and Baseline Results". (IEEE BTAS 2018)
-
-
-
LFW : Labeled Faces in the Wild(LFW). "Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments". (Workshop on faces in'Real-Life'Images: detection, alignment, and recognition. 2008)
-
YouTube Faces (YTF) : "Face recognition in unconstrained videos with matched background similarity". (CVPR 2011)
-
CASIA-WebFace : "Learning Face Representation from Scratch". (arXiv 2014)
-
IJB-A : "Pushing the Frontiers of Unconstrained Face Detection and Recognition: IARPA Janus Benchmark A". (CVPR 2015)
-
MS-Celeb-1M : "MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition". (ECCV 2016)
-
MegaFace : "The MegaFace Benchmark: 1 Million Faces for Recognition at Scale". (CVPR 2016)
-
UMDFaces : "UMDFaces: An annotated face dataset for training deep networks". (IJCB 2017)
-
IJB-C : "IARPA Janus Benchmark - C: Face Dataset and Protocol". (ICB 2018)
-
VGGFace2 : "VGGFace2: A Dataset for Recognising Faces across Pose and Age". (FG 2018)
-
- 微信公众号「PandaCVer」
- 微信公众号「自动驾驶之心」
- 微信公众号「整数智能AI研究院」
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for awesome-object-detection-datasets
Similar Open Source Tools
awesome-object-detection-datasets
This repository is a curated list of awesome public object detection and recognition datasets. It includes a wide range of datasets related to object detection and recognition tasks, such as general detection and recognition datasets, autonomous driving datasets, adverse weather datasets, person detection datasets, anti-UAV datasets, optical aerial imagery datasets, low-light image datasets, infrared image datasets, SAR image datasets, multispectral image datasets, 3D object detection datasets, vehicle-to-everything field datasets, super-resolution field datasets, and face detection and recognition datasets. The repository also provides information on tools for data annotation, data augmentation, and data management related to object detection tasks.
Awesome-GenAI-Unlearning
This repository is a collection of papers on Generative AI Machine Unlearning, categorized based on modality and applications. It includes datasets, benchmarks, and surveys related to unlearning scenarios in generative AI. The repository aims to provide a comprehensive overview of research in the field of machine unlearning for generative models.
awesome-cuda-tensorrt-fpga
Okay, here is a JSON object with the requested information about the awesome-cuda-tensorrt-fpga repository:
prompt-in-context-learning
An Open-Source Engineering Guide for Prompt-in-context-learning from EgoAlpha Lab. 📝 Papers | ⚡️ Playground | 🛠 Prompt Engineering | 🌍 ChatGPT Prompt | ⛳ LLMs Usage Guide > **⭐️ Shining ⭐️:** This is fresh, daily-updated resources for in-context learning and prompt engineering. As Artificial General Intelligence (AGI) is approaching, let’s take action and become a super learner so as to position ourselves at the forefront of this exciting era and strive for personal and professional greatness. The resources include: _🎉Papers🎉_: The latest papers about _In-Context Learning_ , _Prompt Engineering_ , _Agent_ , and _Foundation Models_. _🎉Playground🎉_: Large language models(LLMs)that enable prompt experimentation. _🎉Prompt Engineering🎉_: Prompt techniques for leveraging large language models. _🎉ChatGPT Prompt🎉_: Prompt examples that can be applied in our work and daily lives. _🎉LLMs Usage Guide🎉_: The method for quickly getting started with large language models by using LangChain. In the future, there will likely be two types of people on Earth (perhaps even on Mars, but that's a question for Musk): - Those who enhance their abilities through the use of AIGC; - Those whose jobs are replaced by AI automation. 💎EgoAlpha: Hello! human👤, are you ready?
nlp-phd-global-equality
This repository aims to promote global equality for individuals pursuing a PhD in NLP by providing resources and information on various aspects of the academic journey. It covers topics such as applying for a PhD, getting research opportunities, preparing for the job market, and succeeding in academia. The repository is actively updated and includes contributions from experts in the field.
For similar tasks
X-AnyLabeling
X-AnyLabeling is a robust annotation tool that seamlessly incorporates an AI inference engine alongside an array of sophisticated features. Tailored for practical applications, it is committed to delivering comprehensive, industrial-grade solutions for image data engineers. This tool excels in swiftly and automatically executing annotations across diverse and intricate tasks.
file-organizer-2000
AI File Organizer 2000 is an Obsidian Plugin that uses AI to transcribe audio, annotate images, and automatically organize files by moving them to the most likely folders. It supports text, audio, and images, with upcoming local-first LLM support. Users can simply place unorganized files into the 'Inbox' folder for automatic organization. The tool renames and moves files quickly, providing a seamless file organization experience. Self-hosting is also possible by running the server and enabling the 'Self-hosted' option in the plugin settings. Join the community Discord server for more information and use the provided iOS shortcut for easy access on mobile devices.
LabelLLM
LabelLLM is an open-source data annotation platform designed to optimize the data annotation process for LLM development. It offers flexible configuration, multimodal data support, comprehensive task management, and AI-assisted annotation. Users can access a suite of annotation tools, enjoy a user-friendly experience, and enhance efficiency. The platform allows real-time monitoring of annotation progress and quality control, ensuring data integrity and timeliness.
awesome-open-data-annotation
At ZenML, we believe in the importance of annotation and labeling workflows in the machine learning lifecycle. This repository showcases a curated list of open-source data annotation and labeling tools that are actively maintained and fit for purpose. The tools cover various domains such as multi-modal, text, images, audio, video, time series, and other data types. Users can contribute to the list and discover tools for tasks like named entity recognition, data annotation for machine learning, image and video annotation, text classification, sequence labeling, object detection, and more. The repository aims to help users enhance their data-centric workflows by leveraging these tools.
anylabeling
AnyLabeling is a tool for effortless data labeling with AI support from YOLO and Segment Anything. It combines features from LabelImg and Labelme with an improved UI and auto-labeling capabilities. Users can annotate images with polygons, rectangles, circles, lines, and points, as well as perform auto-labeling using YOLOv5 and Segment Anything. The tool also supports text detection, recognition, and Key Information Extraction (KIE) labeling, with multiple language options available such as English, Vietnamese, and Chinese.
awesome-object-detection-datasets
This repository is a curated list of awesome public object detection and recognition datasets. It includes a wide range of datasets related to object detection and recognition tasks, such as general detection and recognition datasets, autonomous driving datasets, adverse weather datasets, person detection datasets, anti-UAV datasets, optical aerial imagery datasets, low-light image datasets, infrared image datasets, SAR image datasets, multispectral image datasets, 3D object detection datasets, vehicle-to-everything field datasets, super-resolution field datasets, and face detection and recognition datasets. The repository also provides information on tools for data annotation, data augmentation, and data management related to object detection tasks.
lhotse
Lhotse is a Python library designed to make speech and audio data preparation flexible and accessible. It aims to attract a wider community to speech processing tasks by providing a Python-centric design and an expressive command-line interface. Lhotse offers standard data preparation recipes, PyTorch Dataset classes for speech tasks, and efficient data preparation for model training with audio cuts. It supports data augmentation, feature extraction, and feature-space cut mixing. The tool extends Kaldi's data preparation recipes with seamless PyTorch integration, human-readable text manifests, and convenient Python classes.
langtest
LangTest is a comprehensive evaluation library for custom LLM and NLP models. It aims to deliver safe and effective language models by providing tools to test model quality, augment training data, and support popular NLP frameworks. LangTest comes with benchmark datasets to challenge and enhance language models, ensuring peak performance in various linguistic tasks. The tool offers more than 60 distinct types of tests with just one line of code, covering aspects like robustness, bias, representation, fairness, and accuracy. It supports testing LLMS for question answering, toxicity, clinical tests, legal support, factuality, sycophancy, and summarization.
For similar jobs
weave
Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.
LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.
VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.
kaito
Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.
PyRIT
PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.
tabby
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.
spear
SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.
Magick
Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.