Skip to main content

A Python module for reading and extracting data from GEDCOM files.

Project description

Simple GEDCOM Parser

A simplified Python library for extracting genealogy data from GEDCOM files:

  1. Extract basic person data - Get a list of people with vital information
  2. Extract sources - Get lists of documentary sources
  3. Extract pedigree - Get direct ancestors

Features

  • Parse GEDCOM 5.5 files
  • Extract person data: names, birth/death dates and places, parents
  • Extract source citations linked to individuals
  • Extract pedigree
  • Simple, clean API designed for data analysis and writing to csv files

Quick Start

# Read a GEDCOM file and write lists to CSV files
from simple_gedcom import load_gedcom

gedcom = load_gedcom('data/tree.ged')

gedcom.save_person_list_to_csv()

gedcom.save_pedigree_to_csv()

gedcom.save_source_list_to_csv()

gedcom.save_person_source_list_to_csv()

# pedigree analytics
pedigree = gedcom.get_pedigree()

gedcom.show_generation_counts(pedigree) 

gedcom.show_pedigree_duplicates(pedigree) 

# Use pandas to display lists
import pandas as pd

person_list = gedcom.get_person_list()
df_person_list = pd.DataFrame(person_list)
print(df_person_list.head())

sources = gedcom.get_source_list()
df_sources = pd.DataFrame(sources)
print(df_sources.head()) 

person_sources = gedcom.get_person_source_list()
df_person_sources = pd.DataFrame(person_sources)
print(df_person_sources.head())

pedigree = gedcom.get_pedigree()
df_pedigree = pd.DataFrame(pedigree)
print(df_pedigree.head())

# Search by name
found = gedcom.find_persons_by_name(first_name="Theodore")
df_found = pd.DataFrame(found)
print(df_found.head())

# Show the pedigree for a specific person (by ID)
pedigree = gedcom.get_pedigree("@I162694122750@")
df_pedigree = pd.DataFrame(pedigree)
print(df_pedigree.head())

Requirements

  • Python 3.6+
  • No external dependencies for core functionality
  • pandas (optional, for DataFrame examples)

License

This project is licensed under the GNU General Public License v2.0 - see the LICENSE file for details.

Attribution

This project is derived from python-gedcom by Nicklas Reincke and contributors. The original project provided the foundation for GEDCOM parsing, which has been simplified and focused for specific genealogy data extraction use cases.

Original Copyright (C) 2018-2019 Nicklas Reincke and contributors
Simplified version Copyright (C) 2025 [mcobtechnology]

Contributing

This is a simplified, focused library. If you need additional GEDCOM functionality, consider using the full-featured python-gedcom library.

For bug fixes and improvements to the core functionality, feel free to open issues or submit pull requests.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simple_gedcom-1.0.6.tar.gz (17.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

simple_gedcom-1.0.6-py3-none-any.whl (19.4 kB view details)

Uploaded Python 3

File details

Details for the file simple_gedcom-1.0.6.tar.gz.

File metadata

  • Download URL: simple_gedcom-1.0.6.tar.gz
  • Upload date:
  • Size: 17.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.10.0 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/1.0.0 urllib3/1.26.20 tqdm/4.64.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.5 CPython/3.6.9

File hashes

Hashes for simple_gedcom-1.0.6.tar.gz
Algorithm Hash digest
SHA256 3a51ade31c1dd9deeac1105371097dbb228748e9deeeaf7809ad5d2ecabb02bd
MD5 908e10ad0c70d33bd6eacbda49d83793
BLAKE2b-256 eeadbb8a2cbd75a0c562e2bcd465a0f07bb7b2cf2476e5a1eeef4cd211f138c3

See more details on using hashes here.

File details

Details for the file simple_gedcom-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: simple_gedcom-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 19.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.10.0 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/1.0.0 urllib3/1.26.20 tqdm/4.64.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.5 CPython/3.6.9

File hashes

Hashes for simple_gedcom-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 c0ed57c93b3ef78eb85852acfd1cf72ba9baf9853ab67ecf5d72e91535074dfa
MD5 0938e297b6356e127645f842d9e28ad9
BLAKE2b-256 2a0adde907e92b60dd3cd9d906e31205084f717b1070e0b498d6e6fdcb568287

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page