Data and code to support name-based gender-classification in scientific research
Project description
nom quam gender
A simple package containing data and a few functions to support name-based gender-classification in scientific research.
Installation
pip install nomquamgender
Usage
import nomquamgender as nqg
nqg.annotate('clara')
given | used | sources | counts | p(f) | |
---|---|---|---|---|---|
0 | clara | clara | 31 | 492337 | 0.992 |
nqg.annotate(['András','Jean','Mitsuko'])
given | used | sources | counts | p(f) | |
---|---|---|---|---|---|
0 | András | andras | 24 | 13010 | 0.001 |
1 | Jean | jean | 31 | 2525377 | 0.477 |
2 | Mitsuko | mitsuko | 14 | 925 | 0.981 |
import pandas as pd
name_data = nqg.dump()
df = pd.DataFrame([(n,c,p) for n,(s,c,p) in name_data.items()],
columns = ['name','counts','p(f)']).set_index('name')
df.sort_values(by='counts',ascending=False).head(10)
name | counts | p(f) |
---|---|---|
john | 5.73712e+06 | 0.001 |
robert | 5.71833e+06 | 0 |
james | 5.71246e+06 | 0.001 |
michael | 5.04746e+06 | 0.001 |
david | 4.88524e+06 | 0.001 |
william | 4.6944e+06 | 0 |
mary | 4.5431e+06 | 0.98 |
joseph | 3.39841e+06 | 0.002 |
daniel | 3.2188e+06 | 0.016 |
thomas | 3.17053e+06 | 0.001 |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
nomquamgender-0.0.4.tar.gz
(5.5 MB
view hashes)
Built Distributions
Close
Hashes for nomquamgender-0.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e279211d43740d77393c39f59eb3133c38dfd3d8612d24e455847f08aa672f17 |
|
MD5 | ae1f70ed0f0e7a6f15f2d035b19816fe |
|
BLAKE2b-256 | 602089e717a3ceb87b322c26f49e7c7350b29057abb84b28bce5b19d86a2b40a |
Close
Hashes for nomquamgender-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4c7b6947d640451dbc58b9ea0a7f21a1c4500229202314404d066f2fa15dbe48 |
|
MD5 | 90571579d4420bb20000740933693807 |
|
BLAKE2b-256 | 706c0648f3ed279ab291bd3ff88cd8e21c934d891230b9421af1b7c04b027e91 |