Skip to main content

Advanced Auto Labeling Solution with Added Features

Project description

Auto-Training
Auto-Labeling
Detect Anything
Segment Anything
Promptable Concept Grounding
VQA
Chatbot
Image Classifier

🥳 What's New

  • Added PP-DocLayoutV3, supporting multi-point localization (quadrilaterals/polygons) and logical reading order prediction
  • Added PaddleOCR-VL-1.5, supporting OCR, table recognition, formula recognition, chart recognition, text spotting, and seal recognition
  • Added YOLO26 series models for object detection, instance segmentation, pose estimation, and rotated object detection
  • Added Compare View feature for split-screen image comparison (ideal for infrared/visible fusion, mask preview, and super-resolution) [docs]
  • Added multimodal large language model Rex-Omni with support for grounding, keypoints, referring pointing, OCR, and visual prompting tasks [docs]
  • Added powerful file search feature upporting text search, regular expression search, and attribute-based filtering [docs]
  • Added semi-transparent mask rendering for polygon, rectangle, rotation, and circle shapes with toggle support (Ctrl+M)
  • Added one-click text and visual prompt video detection and segmentation tracking based on Segment Anything 3 [docs]
  • For more details, please refer to the CHANGELOG

X-AnyLabeling

X-AnyLabeling is a powerful annotation tool that integrates an AI engine for fast and automatic labeling. It's designed for multi-modal data engineers, offering industrial-grade solutions for complex tasks.

Also, we highly recommend trying out X-AnyLabeling-Server, a simple, lightweight, and extensible framework that enables remote inference capabilities for X-AnyLabeling.

Features

  • Supports remote inference service.
  • Processes both images and videos.
  • Accelerates inference with GPU support.
  • Allows custom models and secondary development.
  • Supports one-click inference for all images in the current task.
  • Supports import/export for formats like COCO, VOC, YOLO, DOTA, MOT, MASK, PPOCR, MMGD, VLM-R1.
  • Handles tasks like classification, detection, segmentation, caption, rotation, tracking, estimation, ocr, vqa, grounding and so on.
  • Supports diverse annotation styles: polygons, rectangles, rotated boxes, circles, lines, points, and annotations for text detection, recognition, and KIE.

Model library

Task Category Supported Models
🖼️ Image Classification YOLOv5-Cls, YOLOv8-Cls, YOLO11-Cls, InternImage, PULC
🎯 Object Detection YOLOv5/6/7/8/9/10, YOLO11/12/26, YOLOX, YOLO-NAS, D-FINE, DAMO-YOLO, Gold_YOLO, RT-DETR, RF-DETR, DEIMv2
🖌️ Instance Segmentation YOLOv5-Seg, YOLOv8-Seg, YOLO11-Seg, YOLO26-Seg, Hyper-YOLO-Seg, RF-DETR-Seg
🏃 Pose Estimation YOLOv8-Pose, YOLO11-Pose, YOLO26-Pose, DWPose, RTMO
👣 Tracking Bot-SORT, ByteTrack, SAM2/3-Video
🔄 Rotated Object Detection YOLOv5-Obb, YOLOv8-Obb, YOLO11-Obb, YOLO26-Obb
📏 Depth Estimation Depth Anything
🧩 Segment Anything SAM 1/2/3, SAM-HQ, SAM-Med2D, EdgeSAM, EfficientViT-SAM, MobileSAM
✂️ Image Matting RMBG 1.4/2.0
💡 Proposal UPN
🏷️ Tagging RAM, RAM++
📄 OCR PP-OCRv4, PP-OCRv5, PP-DocLayoutV3, PaddleOCR-VL-1.5
🗣️ Vision Foundation Models Rex-Omni, Florence2
👁️ Vision Language Models Qwen3-VL, Gemini, ChatGPT
🛣️ Land Detection CLRNet
📍 Grounding CountGD, GeCO, Grounding DINO, YOLO-World, YOLOE
📚 Other 👉 model_zoo 👈

Docs

  1. Remote Inference Service
  2. Installation & Quickstart
  3. Usage
  4. Command Line Interface
  5. Customize a model
  6. Chatbot
  7. VQA
  8. Multi-class Image Classifier

Examples

Contribute

We believe in open collaboration! X‑AnyLabeling continues to grow with the support of the community. Whether you're fixing bugs, improving documentation, or adding new features, your contributions make a real impact.

To get started, please read our Contributing Guide and make sure to agree to the Contributor License Agreement (CLA) before submitting a pull request.

If you find this project helpful, please consider giving it a ⭐️ star! Have questions or suggestions? Open an issue or email us at cv_hub@163.com.

A huge thank you 🙏 to everyone helping to make X‑AnyLabeling better.

License

This project is licensed under the GPL-3.0 license and is completely open source and free. The original intention is to enable more developers, researchers, and enterprises to conveniently use this AI application platform, promoting the development of the entire industry. We encourage everyone to use it freely (including commercial use), and you can also add features based on this project and commercialize it, but you must retain the brand identity and indicate the source project address.

Additionally, to understand the ecosystem and usage of X-AnyLabeling, if you use this project for academic, research, teaching, or enterprise purposes, please fill out the registration form. This registration is only for statistical purposes and will not incur any fees. We will strictly keep all information confidential.

X-AnyLabeling is independently developed and maintained by an individual. If this project has been helpful to you, we welcome your support through the donation links below to help sustain the project's continued development. Your support is the greatest encouragement! If you have any questions about the project or would like to collaborate, please feel free to contact via WeChat: ww10874 or email provided above.

Sponsors

Acknowledgement

I extend my heartfelt thanks to the developers and contributors of AnyLabeling, LabelMe, LabelImg, roLabelImg, PPOCRLabel and CVAT, whose work has been crucial to the success of this project.

Citing

If you use this software in your research, please cite it as below:

@misc{X-AnyLabeling,
  year = {2023},
  author = {Wei Wang},
  publisher = {Github},
  organization = {CVHub},
  journal = {Github repository},
  title = {Advanced Auto Labeling Solution with Added Features},
  howpublished = {\url{https://github.com/CVHub520/X-AnyLabeling}}
}

Star History Chart

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

x_anylabeling_cvhub-3.3.9.tar.gz (1.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

x_anylabeling_cvhub-3.3.9-py3-none-any.whl (1.5 MB view details)

Uploaded Python 3

File details

Details for the file x_anylabeling_cvhub-3.3.9.tar.gz.

File metadata

  • Download URL: x_anylabeling_cvhub-3.3.9.tar.gz
  • Upload date:
  • Size: 1.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for x_anylabeling_cvhub-3.3.9.tar.gz
Algorithm Hash digest
SHA256 a04f12a41b547ffd6fef0a10bc0f5f68eee8addc790a6812aaa19ef7eb56d331
MD5 43129036369820ecbad860e8e176c14a
BLAKE2b-256 944ae204070b2f0a2a5dfac3eab702313b6d232ed339e41317152cce4fede2ff

See more details on using hashes here.

File details

Details for the file x_anylabeling_cvhub-3.3.9-py3-none-any.whl.

File metadata

File hashes

Hashes for x_anylabeling_cvhub-3.3.9-py3-none-any.whl
Algorithm Hash digest
SHA256 f8179c2bcbe22a72cad000aa286f820becffa9a15d8b726b0437e61f11ad2a7a
MD5 3e2689849e08745093c520208908b714
BLAKE2b-256 1c0e3f362658f1af0435708f2d5b5c6fcd00dd72d0040a04a2e6999db954ccf0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page