Skip to main content

A package for parsing Vietnamese address

Project description

Vietnamese address standardizer - Bộ chuẩn hóa địa chỉ Việt Nam

A package for parsing Vietnamese address

Tính năng

  1. Xử lỹ những tên viết tắt thông dụng
  2. Sửa chính tả
  3. Sửa lỗi thứ tự tên đơn vị hành chính
  4. Thêm prefix (xã, huyện, tỉnh, ...)

Cài đặt qua PyPi

pip3 install vnaddress

Thử nghiệm

from vnaddress import VNAddressStandardizer

address = VNAddressStandardizer(raw_address = "Dicjh Vongj Haaju", comma_handle = True)
address.execute()

# output
# phường Dịch Vọng Hậu, quận Cầu Giấy, thành phố Hà Nội


address = VNAddressStandardizer(raw_address = "Dicjh Vongj Haaju, ", comma_handle = True, detail=True)
address.execute()

# output
# phường Dịch Vọng Hậu, quận Cầu Giấy, thành phố Hà Nội

更新pypi仓库

cd python
python setup.py sdist bdist_wheel
python -m twine upload dist/*

c++

clone到本地直接编译即可

cd text-clean
bazel build //:test

具体用法请参考test.cc

效果如下:

转换前:繁 體 字是smasd ❶ ❷ ❸ ❹  彩呗我 幹什麼 □ ■ ◇ ◆ − + ⑪ ⑫ ⑬  ⒍ ⒎ ⒏ ⒐ W,X  asd鬼东 西錯 鍼٩(๑ᵒ̴̶͈᷄ᗨᵒ̴̶͈᷅)و q🕓🕛,
转换后:繁体字是smasd1234彩呗我干什么+1112136789wxasd鬼东西错针q

转换前:神谕23 速度来各种老手+——++++++来个大法师+++
转换后:神谕23速度来各种老手+++来个大法师+++

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vnaddress-0.9.6.tar.gz (481.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vnaddress-0.9.6-py3-none-any.whl (527.0 kB view details)

Uploaded Python 3

File details

Details for the file vnaddress-0.9.6.tar.gz.

File metadata

  • Download URL: vnaddress-0.9.6.tar.gz
  • Upload date:
  • Size: 481.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.6.9

File hashes

Hashes for vnaddress-0.9.6.tar.gz
Algorithm Hash digest
SHA256 378523a301da911a4ee4370e1aa273577c4bc110a373ea5dec28c5275db9a77d
MD5 8b9e16dc26a2c106cadfb7225368e3b5
BLAKE2b-256 a12760e78af913465a465cbdad549a53bffaae8c9210ce2192bc8f8dfdc36705

See more details on using hashes here.

File details

Details for the file vnaddress-0.9.6-py3-none-any.whl.

File metadata

  • Download URL: vnaddress-0.9.6-py3-none-any.whl
  • Upload date:
  • Size: 527.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.6.9

File hashes

Hashes for vnaddress-0.9.6-py3-none-any.whl
Algorithm Hash digest
SHA256 b7dfb93f1377cad369ae822fde809ea9d2637be5bb0976ca7dec4e26a0421ec5
MD5 32776544ae26eec2bb5785cba1e2abb9
BLAKE2b-256 5065ca58940dc63467df6ccb9e8335482539efb4e80c54ab8fd92064f1b81c77

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page