Skip to main content

Advanced Auto Labeling Solution with Added Features

Project description

Auto-Training
Auto-Labeling
Detect Anything
Segment Anything
Promptable Concept Grounding
VQA
Chatbot
Image Classifier
OCR

🥳 What's New

  • 2026-04-26: Added client-side (local ONNX) support for SAM 3 text-grounded segmentation.
  • 2026-04-25: Added support for the GeCo2 object counting model.
  • 2026-04-19: Added annotation check status workflow for reviewing labeled images.
  • 2026-04-19: Added support for TensorRT backend inference for YOLO models.
  • 2026-04-18: Added support exporting annotation visualizations as images or videos.
  • 2026-04-18: Added support for PaddleOCR document parsing and intelligent text recognition annotation panel.
  • 2026-04-01: Added support for Japanese and Korean UI languages (ja_JP, ko_KR).
  • 2026-03-22: Added support for built-in Settings in the GUI for directly adjusting common options.
  • 2026-03-10: Added support for 3D Cuboid shape annotation from rectangle.
  • 2026-03-01: Completed the PyQt5-to-PyQt6 upgrade refactor (Beta release), along with additional feature fixes and optimizations.
  • For more details, please refer to the CHANGELOG

X-AnyLabeling

X-AnyLabeling is a powerful annotation tool that integrates an AI engine for fast and automatic labeling. It's designed for multi-modal data engineers, offering industrial-grade solutions for complex tasks.

Also, we highly recommend trying out X-AnyLabeling-Server, a simple, lightweight, and extensible framework that enables remote inference capabilities for X-AnyLabeling.

Features

  • Supports remote inference service.
  • Processes both images and videos.
  • Supports flexible backends including ONNX Runtime, TensorRT, and OpenCV DNN.
  • Supports UI localization in English, Chinese, Japanese, and Korean.
  • Allows custom models and secondary development.
  • Supports one-click inference for all images in the current task.
  • Supports import/export for formats like COCO, VOC, YOLO, DOTA, MOT, MASK, PPOCR, MMGD, VLM-R1, ShareGPT.
  • Handles tasks like classification, detection, segmentation, caption, rotation, tracking, estimation, ocr, vqa, grounding, document parsing and so on.
  • Supports diverse annotation styles: polygons, rectangles, cuboids, rotated boxes, quadrilaterals, circles, lines, line strips, points, and annotations for text detection, recognition, and KIE.

Model library

Task Category Supported Models
🖼️ Image Classification YOLOv5-Cls, YOLOv8-Cls, YOLO11-Cls, InternImage, PULC
🎯 Object Detection YOLOv5/6/7/8/9/10, YOLO11/12/26, YOLOX, YOLO-NAS, D-FINE, DAMO-YOLO, Gold_YOLO, RT-DETR, RF-DETR, DEIMv2
🖌️ Instance Segmentation YOLOv5-Seg, YOLOv8-Seg, YOLO11-Seg, YOLO26-Seg, Hyper-YOLO-Seg, RF-DETR-Seg
🏃 Pose Estimation YOLOv8-Pose, YOLO11-Pose, YOLO26-Pose, DWPose, RTMO
👣 Tracking Bot-SORT, ByteTrack, SAM2/3-Video
🔄 Rotated Object Detection YOLOv5-Obb, YOLOv8-Obb, YOLO11-Obb, YOLO26-Obb
📏 Depth Estimation Depth Anything
🧩 Segment Anything SAM 1/2/3, SAM-HQ, SAM-Med2D, EdgeSAM, EfficientViT-SAM, MobileSAM
✂️ Image Matting RMBG 1.4/2.0
💡 Proposal UPN
🏷️ Tagging RAM, RAM++
📄 OCR PP-OCRv4, PP-OCRv5
🧾 Layout Analysis PP-DocLayoutV3
📑 Document Parsing PaddleOCR-VL, PaddleOCR-VL-1.5
🗣️ Vision Foundation Models Rex-Omni, Florence2
👁️ Vision Language Models Qwen3-VL, Gemini, ChatGPT, GLM
🛣️ Lane Detection CLRNet
🔢 Object Counting CountGD, GeCO, GeCo2
📍 Grounding Grounding DINO, YOLO-World, YOLOE
📚 Other 👉 model_zoo 👈

Docs

  1. Remote Inference Service
  2. Installation & Quickstart
  3. Usage
  4. Command Line Interface
  5. Customize a model
  6. Chatbot
  7. VQA
  8. Multi-class Image Classifier
  9. Document Parsing and Intelligent Text Recognition

Examples

Contribute

We believe in open collaboration! X‑AnyLabeling continues to grow with the support of the community. Whether you're fixing bugs, improving documentation, or adding new features, your contributions make a real impact.

To get started, please read our Contributing Guide and make sure to agree to the Contributor License Agreement (CLA) before submitting a pull request.

If you find this project helpful, please consider giving it a ⭐️ star! Have questions or suggestions? Open an issue or email us at cv_hub@163.com.

A huge thank you 🙏 to everyone helping to make X‑AnyLabeling better.

License

This project is licensed under the GPL-3.0 license and is completely open source and free. The original intention is to enable more developers, researchers, and enterprises to conveniently use this AI application platform, promoting the development of the entire industry. We encourage everyone to use it freely (including commercial use), and you can also add features based on this project and commercialize it, but you must retain the brand identity and indicate the source project address.

Additionally, to understand the ecosystem and usage of X-AnyLabeling, if you use this project for academic, research, teaching, or enterprise purposes, please fill out the registration form. This registration is only for statistical purposes and will not incur any fees. We will strictly keep all information confidential.

X-AnyLabeling is independently developed and maintained by an individual. If this project has been helpful to you, we welcome your support through the donation links below to help sustain the project's continued development. Your support is the greatest encouragement! If you have any questions about the project or would like to collaborate, please feel free to contact via WeChat: ww10874 or email provided above.

Sponsors

Acknowledgement

I extend my heartfelt thanks to the developers and contributors of AnyLabeling, LabelMe, LabelImg, roLabelImg, PPOCRLabel and CVAT, whose work has been crucial to the success of this project.

Citing

If you use this software in your research, please cite it as below:

@misc{X-AnyLabeling,
  year = {2023},
  author = {Wei Wang},
  publisher = {Github},
  organization = {CVHub},
  journal = {Github repository},
  title = {Advanced Auto Labeling Solution with Added Features},
  howpublished = {\url{https://github.com/CVHub520/X-AnyLabeling}}
}

Star History Chart

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

x_anylabeling_cvhub-4.0.0b6.tar.gz (1.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

x_anylabeling_cvhub-4.0.0b6-py3-none-any.whl (1.8 MB view details)

Uploaded Python 3

File details

Details for the file x_anylabeling_cvhub-4.0.0b6.tar.gz.

File metadata

  • Download URL: x_anylabeling_cvhub-4.0.0b6.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for x_anylabeling_cvhub-4.0.0b6.tar.gz
Algorithm Hash digest
SHA256 71b7f65a19e9a3623c7b4513a2e002b2e3986da5736b5d7331ae99c7be6621f3
MD5 620d5d71dbe614074b67fd4dc8733345
BLAKE2b-256 fc09d0870cd3e37f94b4b5bfb31fe9e40e815ef3c78cadd78db6d5156695927f

See more details on using hashes here.

File details

Details for the file x_anylabeling_cvhub-4.0.0b6-py3-none-any.whl.

File metadata

File hashes

Hashes for x_anylabeling_cvhub-4.0.0b6-py3-none-any.whl
Algorithm Hash digest
SHA256 e2b89a974037175619e0acca08e3c613595b4cf14e9efd09d254d6934d364e95
MD5 93f24605dbe9709f93aba4f7f5a05f77
BLAKE2b-256 ffb56a1c598053e3eeb5c80ddade9401c97739a5984cbfa0b463908491da5b9f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page