Data and code to support name-based gender-classification in scientific research
Project description
nom quam gender
A simple package containing data and a few functions to support name-based gender-classification in scientific research.
Installation
pip install nomquamgender
Usage
import nomquamgender as nqg
nqg.annotate('clara')
given | used | sources | counts | p(f) | |
---|---|---|---|---|---|
0 | clara | clara | 31 | 492337 | 0.992 |
nqg.annotate(['András','Jean','Mitsuko'])
given | used | sources | counts | p(f) | |
---|---|---|---|---|---|
0 | András | andras | 24 | 13010 | 0.001 |
1 | Jean | jean | 31 | 2525377 | 0.477 |
2 | Mitsuko | mitsuko | 14 | 925 | 0.981 |
import pandas as pd
name_data = nqg.dump()
df = pd.DataFrame([(n,c,p) for n,(s,c,p) in name_data.items()],
columns = ['name','counts','p(f)']).set_index('name')
df.sort_values(by='counts',ascending=False).head(10)
name | counts | p(f) |
---|---|---|
john | 5.73712e+06 | 0.001 |
robert | 5.71833e+06 | 0 |
james | 5.71246e+06 | 0.001 |
michael | 5.04746e+06 | 0.001 |
david | 4.88524e+06 | 0.001 |
william | 4.6944e+06 | 0 |
mary | 4.5431e+06 | 0.98 |
joseph | 3.39841e+06 | 0.002 |
daniel | 3.2188e+06 | 0.016 |
thomas | 3.17053e+06 | 0.001 |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
nomquamgender-0.0.3.tar.gz
(5.5 MB
view hashes)
Built Distribution
Close
Hashes for nomquamgender-0.0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 77f2b421f409932d5db613bff60b16903d036a290f8c151bc928f375df829fbf |
|
MD5 | b2c2e31371de728f1548b6ee7c144445 |
|
BLAKE2b-256 | d3675f5b8ecccf98b48a954e3a39e935b13ffd565f4221783d05714aff04acc0 |