Skip to main content

Advanced Auto Labeling Solution with Added Features

Project description

Auto-Training
Auto-Labeling
Detect Anything
Segment Anything
Promptable Concept Grounding
VQA
Chatbot
Image Classifier
OCR

🥳 What's New

  • 2026-04-26: Added client-side (local ONNX) support for SAM 3 text-grounded segmentation.
  • 2026-04-25: Added support for the GeCo2 object counting model.
  • 2026-04-19: Added annotation check status workflow for reviewing labeled images.
  • 2026-04-19: Added support for TensorRT backend inference for YOLO models.
  • 2026-04-18: Added support exporting annotation visualizations as images or videos.
  • 2026-04-18: Added support for PaddleOCR document parsing and intelligent text recognition annotation panel.
  • 2026-04-01: Added support for Japanese and Korean UI languages (ja_JP, ko_KR).
  • 2026-03-22: Added support for built-in Settings in the GUI for directly adjusting common options.
  • 2026-03-10: Added support for 3D Cuboid shape annotation from rectangle.
  • 2026-03-01: Completed the PyQt5-to-PyQt6 upgrade refactor (Beta release), along with additional feature fixes and optimizations.
  • For more details, please refer to the CHANGELOG

X-AnyLabeling

X-AnyLabeling is a powerful annotation tool that integrates an AI engine for fast and automatic labeling. It's designed for multi-modal data engineers, offering industrial-grade solutions for complex tasks.

Also, we highly recommend trying out X-AnyLabeling-Server, a simple, lightweight, and extensible framework that enables remote inference capabilities for X-AnyLabeling.

Features

  • Supports remote inference service.
  • Processes both images and videos.
  • Supports flexible backends including ONNX Runtime, TensorRT, and OpenCV DNN.
  • Supports UI localization in English, Chinese, Japanese, and Korean.
  • Allows custom models and secondary development.
  • Supports one-click inference for all images in the current task.
  • Supports import/export for formats like COCO, VOC, YOLO, DOTA, MOT, MASK, PPOCR, MMGD, VLM-R1, ShareGPT.
  • Handles tasks like classification, detection, segmentation, caption, rotation, tracking, estimation, ocr, vqa, grounding, document parsing and so on.
  • Supports diverse annotation styles: polygons, rectangles, cuboids, rotated boxes, quadrilaterals, circles, lines, line strips, points, and annotations for text detection, recognition, and KIE.

Model library

Task Category Supported Models
🖼️ Image Classification YOLOv5-Cls, YOLOv8-Cls, YOLO11-Cls, InternImage, PULC
🎯 Object Detection YOLOv5/6/7/8/9/10, YOLO11/12/26, YOLOX, YOLO-NAS, D-FINE, DAMO-YOLO, Gold_YOLO, RT-DETR, RF-DETR, DEIMv2
🖌️ Instance Segmentation YOLOv5-Seg, YOLOv8-Seg, YOLO11-Seg, YOLO26-Seg, Hyper-YOLO-Seg, RF-DETR-Seg
🏃 Pose Estimation YOLOv8-Pose, YOLO11-Pose, YOLO26-Pose, DWPose, RTMO
👣 Tracking Bot-SORT, ByteTrack, SAM2/3-Video
🔄 Rotated Object Detection YOLOv5-Obb, YOLOv8-Obb, YOLO11-Obb, YOLO26-Obb
📏 Depth Estimation Depth Anything
🧩 Segment Anything SAM 1/2/3, SAM-HQ, SAM-Med2D, EdgeSAM, EfficientViT-SAM, MobileSAM
✂️ Image Matting RMBG 1.4/2.0
💡 Proposal UPN
🏷️ Tagging RAM, RAM++
📄 OCR PP-OCRv4, PP-OCRv5
🧾 Layout Analysis PP-DocLayoutV3
📑 Document Parsing PaddleOCR-VL, PaddleOCR-VL-1.5
🗣️ Vision Foundation Models Rex-Omni, Florence2
👁️ Vision Language Models Qwen3-VL, Gemini, ChatGPT, GLM
🛣️ Lane Detection CLRNet
🔢 Object Counting CountGD, GeCO, GeCo2
📍 Grounding Grounding DINO, YOLO-World, YOLOE
📚 Other 👉 model_zoo 👈

Docs

  1. Remote Inference Service
  2. Installation & Quickstart
  3. Usage
  4. Command Line Interface
  5. Customize a model
  6. Chatbot
  7. VQA
  8. Multi-class Image Classifier
  9. Document Parsing and Intelligent Text Recognition

Examples

Contribute

We believe in open collaboration! X‑AnyLabeling continues to grow with the support of the community. Whether you're fixing bugs, improving documentation, or adding new features, your contributions make a real impact.

To get started, please read our Contributing Guide and make sure to agree to the Contributor License Agreement (CLA) before submitting a pull request.

If you find this project helpful, please consider giving it a ⭐️ star! Have questions or suggestions? Open an issue or email us at cv_hub@163.com.

A huge thank you 🙏 to everyone helping to make X‑AnyLabeling better.

License

This project is licensed under the GPL-3.0 license and is completely open source and free. The original intention is to enable more developers, researchers, and enterprises to conveniently use this AI application platform, promoting the development of the entire industry. We encourage everyone to use it freely (including commercial use), and you can also add features based on this project and commercialize it, but you must retain the brand identity and indicate the source project address.

Additionally, to understand the ecosystem and usage of X-AnyLabeling, if you use this project for academic, research, teaching, or enterprise purposes, please fill out the registration form. This registration is only for statistical purposes and will not incur any fees. We will strictly keep all information confidential.

X-AnyLabeling is independently developed and maintained by an individual. If this project has been helpful to you, we welcome your support through the donation links below to help sustain the project's continued development. Your support is the greatest encouragement! If you have any questions about the project or would like to collaborate, please feel free to contact via WeChat: ww10874 or email provided above.

Sponsors

Acknowledgement

I extend my heartfelt thanks to the developers and contributors of AnyLabeling, LabelMe, LabelImg, roLabelImg, PPOCRLabel and CVAT, whose work has been crucial to the success of this project.

Citing

If you use this software in your research, please cite it as below:

@misc{X-AnyLabeling,
  year = {2023},
  author = {Wei Wang},
  publisher = {Github},
  organization = {CVHub},
  journal = {Github repository},
  title = {Advanced Auto Labeling Solution with Added Features},
  howpublished = {\url{https://github.com/CVHub520/X-AnyLabeling}}
}

Star History Chart

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

x_anylabeling_cvhub-4.0.0b5.tar.gz (1.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

x_anylabeling_cvhub-4.0.0b5-py3-none-any.whl (1.8 MB view details)

Uploaded Python 3

File details

Details for the file x_anylabeling_cvhub-4.0.0b5.tar.gz.

File metadata

  • Download URL: x_anylabeling_cvhub-4.0.0b5.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for x_anylabeling_cvhub-4.0.0b5.tar.gz
Algorithm Hash digest
SHA256 61260415e6379776ae06577cc8002d52d97d8da8a95909a4d12383922cf291bd
MD5 c84be1510a3b05df08d9726649024bea
BLAKE2b-256 63c8560df9725954056c4c3258672d775f5e201b347763338cf30a0258356c0b

See more details on using hashes here.

File details

Details for the file x_anylabeling_cvhub-4.0.0b5-py3-none-any.whl.

File metadata

File hashes

Hashes for x_anylabeling_cvhub-4.0.0b5-py3-none-any.whl
Algorithm Hash digest
SHA256 038a9bc613a74f5cde394b3f7e83e44fc60a47c1c7350b79113098ea12a15085
MD5 7cb3b3f0ad875d2af7fa8bc980640d7d
BLAKE2b-256 3651f2bfe78846c8744cdaea00d7daed788328dfc6aad4e8a4fa2768e44b47c9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page