Skip to main content

Israeli baby names dataset (1949-2024) from CBS

Project description

babynamesil

PyPI version Python 3.9+ License: CC0-1.0

Israeli baby names dataset (1949-2024) from the Central Bureau of Statistics (CBS).

This package provides easy access to comprehensive baby name statistics from Israel, including names given to at least 5 babies per year, categorized by demographic sector, sex, and year.

Installation

pip install babynamesil

Quick Start

import babynamesil

# Load the main dataset
df = babynamesil.load_data()
print(df.head())

Output:

   sector  year sex  name     n      prop
0  Jewish  1949   F  רחל  1362  0.038065
1  Jewish  1949   F  אסתר  1344  0.037562
2  Jewish  1949   F  שרה  1190  0.033258
3  Jewish  1949   F  מרים   964  0.026942
4  Jewish  1949   F  חנה   895  0.025013

Usage Examples

Find the most popular names in 2024

import babynamesil

df = babynamesil.load_data()

# Top 10 Jewish names in 2024
top_2024 = (
    df[(df['year'] == 2024) & (df['sector'] == 'Jewish')]
    .groupby('sex')
    .apply(lambda x: x.nlargest(10, 'n')[['name', 'n']])
)
print(top_2024)

Track a name over time

import babynamesil
import matplotlib.pyplot as plt

df = babynamesil.load_data()

# Track the name "נועם" (Noam) over time
noam = df[(df['name'] == 'נועם') & (df['sector'] == 'Jewish')]
noam_pivot = noam.pivot(index='year', columns='sex', values='prop')

noam_pivot.plot(title='נועם - Popularity Over Time')
plt.ylabel('Proportion')
plt.show()

Get all-time totals

import babynamesil

totals = babynamesil.load_totals()

# Most popular names of all time (Jewish sector)
jewish_totals = totals[totals['sector'] == 'Jewish']
print(jewish_totals.groupby('sex').apply(lambda x: x.nlargest(5, 'total')))

Available Datasets

Function Description Years Rows
load_data() Main baby names by year/sector/sex 1949-2024 ~160K
load_totals() Aggregated totals by name 1949-2024 ~8K
load_1948() Legacy 1948 data 1948 ~500
load_other() Archived "Other" sector 1985-2021 ~5K

Data Structure

Main Dataset (load_data())

Column Type Description
sector str "Jewish", "Muslim", "Christian-Arab", or "Druze"
year int Birth year (1949-2024)
sex str "M" (male) or "F" (female)
name str Baby name in Hebrew
n int Count of babies with this name
prop float Proportion within year/sector/sex (0-1)

Totals Dataset (load_totals())

Column Type Description
sector str Demographic sector
sex str "M" or "F"
name str Baby name in Hebrew
total int Total count across all years

Data Source

All data is sourced from CBS (Israel Central Bureau of Statistics) Release 391/2025:

Related Projects

License

CC0 1.0 Universal - This work is dedicated to the public domain.

Contributing

Contributions are welcome! Please open an issue or submit a pull request on GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

babynamesil-0.2.1.tar.gz (975.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

babynamesil-0.2.1-py3-none-any.whl (974.1 kB view details)

Uploaded Python 3

File details

Details for the file babynamesil-0.2.1.tar.gz.

File metadata

  • Download URL: babynamesil-0.2.1.tar.gz
  • Upload date:
  • Size: 975.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for babynamesil-0.2.1.tar.gz
Algorithm Hash digest
SHA256 1d5f48fa94059e74af944b3e37fd2d8d25f8e17816eb725544e49252c0ff4b3c
MD5 109bfb348dc55a9115d17bf209abae30
BLAKE2b-256 36f2f91bb900fd1332fe1e7df695bc72f3d35ee1a11b38424749501887b0a6f0

See more details on using hashes here.

File details

Details for the file babynamesil-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: babynamesil-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 974.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for babynamesil-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1f09516d95a4ed0eaad00900181f3cac3f573f3bde0b84a93ab66f2d90472afa
MD5 29e3b88cf08d107876cdff8f6942dd88
BLAKE2b-256 ae63604f9cb8aa147cdd33c68381e0c4835b34e7ee05a73fb1211a29bfa229a9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page