Skip to main content

From unstructured data to structured data.

Project description

Roseta

Pypi MIT License stars

🧪 从「非结构化数据」到「结构化数据」!

如果需求多,就开始做,否则就慢慢推进~

  1. 欢迎「提需求」🎉;
  2. 欢迎「供数据」🎉。

1 功能

  • 转化「长度」描述;
    • 身高
    • 胸围
    • 腰围
    • 臀围
    • 肩宽
  • 转化「重量」描述;
  • 转化「城市」描述;
  • 转化「年龄」描述;
  • 转化「罩杯」描述;
  • 转化「日期」描述;
  • 转化「地点」描述。

2 安装

pip install roseta -U

# or
git clone https://github.com/Ailln/roseta.git
cd roseta && python setup.py install

3 使用

from roseta import trans

## 转化「长度」
num, unit = trans("一米八")
# (180, 'cm')
num, unit = trans("1.8米", cls="length", unit="cm")
# (180.0, 'cm')
num, unit = trans("180厘米", unit="m")
# (1.8, 'm')
num, unit = trans("两尺")
# (66.67, 'cm')
num, unit = trans("2.1尺", cls="length")
# (70.0, 'cm')
num, unit = trans("2尺", cls="length", unit="m")
# (0.6667, 'm')

## 转化「重量」
num, unit = trans("一斤八")
# (0.9, 'kg')
num, unit = trans("1.8公斤", cls="weight", unit="kg")
# (1.8, 'kg')
num, unit = trans("180kg", unit="g")
# (180000, 'g')

## 转化「城市」
text, unit = trans("杭州市")
# ('杭州市', '市')
text, unit = trans("杭州", cls="city")
# ('杭州市', '市')
text, unit = trans("杭州市", unit="省")
# ('浙江省杭州市', '省')

## 转化「年龄」
num, unit = trans("一岁")
# (1, 'year')
num, unit = trans("二个月", cls="age")
# (0.2, 'year')
num, unit = trans("二个月", unit="year")
# (0.2, 'year')

## 转化「罩杯」
# 类别不可以省略
num, unit = trans("c36", cls="cup")
# ([96, 80], 'cm'),96 为上胸围,80 为下胸围
num, unit = trans("九五F", cls="cup", unit="m")
# ([1.17, 0.95], 'm')

4 许可

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

roseta-0.1.9.tar.gz (16.6 kB view hashes)

Uploaded Source

Built Distribution

roseta-0.1.9-py3-none-any.whl (23.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page