Israeli baby names dataset (1949-2024) from CBS
Project description
babynamesil
Israeli baby names dataset (1949-2024) from the Central Bureau of Statistics (CBS).
This package provides easy access to comprehensive baby name statistics from Israel, including names given to at least 5 babies per year, categorized by demographic sector, sex, and year.
Installation
pip install babynamesil
Quick Start
import babynamesil
# Load the main dataset
df = babynamesil.load_data()
print(df.head())
Output:
sector year sex name n prop
0 Jewish 1949 F רחל 1362 0.038065
1 Jewish 1949 F אסתר 1344 0.037562
2 Jewish 1949 F שרה 1190 0.033258
3 Jewish 1949 F מרים 964 0.026942
4 Jewish 1949 F חנה 895 0.025013
Usage Examples
Find the most popular names in 2024
import babynamesil
df = babynamesil.load_data()
# Top 10 Jewish names in 2024
top_2024 = (
df[(df['year'] == 2024) & (df['sector'] == 'Jewish')]
.groupby('sex')
.apply(lambda x: x.nlargest(10, 'n')[['name', 'n']])
)
print(top_2024)
Track a name over time
import babynamesil
import matplotlib.pyplot as plt
df = babynamesil.load_data()
# Track the name "נועם" (Noam) over time
noam = df[(df['name'] == 'נועם') & (df['sector'] == 'Jewish')]
noam_pivot = noam.pivot(index='year', columns='sex', values='prop')
noam_pivot.plot(title='נועם - Popularity Over Time')
plt.ylabel('Proportion')
plt.show()
Get all-time totals
import babynamesil
totals = babynamesil.load_totals()
# Most popular names of all time (Jewish sector)
jewish_totals = totals[totals['sector'] == 'Jewish']
print(jewish_totals.groupby('sex').apply(lambda x: x.nlargest(5, 'total')))
Available Datasets
| Function | Description | Years | Rows |
|---|---|---|---|
load_data() |
Main baby names by year/sector/sex | 1949-2024 | ~160K |
load_totals() |
Aggregated totals by name | 1949-2024 | ~8K |
load_1948() |
Legacy 1948 data | 1948 | ~500 |
load_other() |
Archived "Other" sector | 1985-2021 | ~5K |
Data Structure
Main Dataset (load_data())
| Column | Type | Description |
|---|---|---|
sector |
str | "Jewish", "Muslim", "Christian-Arab", or "Druze" |
year |
int | Birth year (1949-2024) |
sex |
str | "M" (male) or "F" (female) |
name |
str | Baby name in Hebrew |
n |
int | Count of babies with this name |
prop |
float | Proportion within year/sector/sex (0-1) |
Totals Dataset (load_totals())
| Column | Type | Description |
|---|---|---|
sector |
str | Demographic sector |
sex |
str | "M" or "F" |
name |
str | Baby name in Hebrew |
total |
int | Total count across all years |
Data Source
All data is sourced from CBS (Israel Central Bureau of Statistics) Release 391/2025:
- Press release: השמות הפרטיים שניתנו לילידי 2024
- Data file: 11_25_391t1.xlsx
Related Projects
- R package: babynamesIL on CRAN
- Web app: babynames.lifshitz.xyz
License
CC0 1.0 Universal - This work is dedicated to the public domain.
Contributing
Contributions are welcome! Please open an issue or submit a pull request on GitHub.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file babynamesil-0.2.1.tar.gz.
File metadata
- Download URL: babynamesil-0.2.1.tar.gz
- Upload date:
- Size: 975.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1d5f48fa94059e74af944b3e37fd2d8d25f8e17816eb725544e49252c0ff4b3c
|
|
| MD5 |
109bfb348dc55a9115d17bf209abae30
|
|
| BLAKE2b-256 |
36f2f91bb900fd1332fe1e7df695bc72f3d35ee1a11b38424749501887b0a6f0
|
File details
Details for the file babynamesil-0.2.1-py3-none-any.whl.
File metadata
- Download URL: babynamesil-0.2.1-py3-none-any.whl
- Upload date:
- Size: 974.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1f09516d95a4ed0eaad00900181f3cac3f573f3bde0b84a93ab66f2d90472afa
|
|
| MD5 |
29e3b88cf08d107876cdff8f6942dd88
|
|
| BLAKE2b-256 |
ae63604f9cb8aa147cdd33c68381e0c4835b34e7ee05a73fb1211a29bfa229a9
|