Skip to main content

Parse Securities and Exchange Commission Standard Generalized Markup Language (SEC SGML) files

Project description

SEC SGML

A python library to parse Securities and Exchange Commission Standardized Generalized Markup Language. Used to power the open-source datamule project.

Currently parses two types of files:

  1. Daily Archives
  2. Submissions

Will be expanded to also parse SGML Tables.

All Variations

secsgml also attempts to standardize the metadata between formats. e.g. 'CENTRAL INDEX KEY' will be mapped to 'cik'.

Installation

pip install secsgml

Quickstart

Parse into memory

from secsgml import parse_sgml_submission_into_memory
metadata,documents = parse_sgml_submission_into_memory(filepath="000000443897000001.sgml")

Parse to file

from secsgml import parse_sgml_submission
# from file
parse_sgml_submission(filepath='samples/0000891618-94-000021.txt',output_dir='results')

# from content
parse_sgml_submission(content=sgml_content,output_dir='results')

Note

Will be giving parse_sgml_submission_into_memory more love, will have to refactor parse_sgml_submission afterwards.

Future

  • SGML Table parsing
  • Optimization + refactor in Cython/ C bindings.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

secsgml-0.1.3.tar.gz (177.6 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

secsgml-0.1.3-cp313-cp313-win_amd64.whl (263.6 kB view details)

Uploaded CPython 3.13Windows x86-64

secsgml-0.1.3-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (762.1 kB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

secsgml-0.1.3-cp313-cp313-macosx_10_13_universal2.whl (356.2 kB view details)

Uploaded CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.1.3-cp312-cp312-win_amd64.whl (264.2 kB view details)

Uploaded CPython 3.12Windows x86-64

secsgml-0.1.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (772.1 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

secsgml-0.1.3-cp312-cp312-macosx_10_13_universal2.whl (359.2 kB view details)

Uploaded CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.1.3-cp311-cp311-win_amd64.whl (264.6 kB view details)

Uploaded CPython 3.11Windows x86-64

secsgml-0.1.3-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (773.3 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

secsgml-0.1.3-cp311-cp311-macosx_10_9_universal2.whl (356.6 kB view details)

Uploaded CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file secsgml-0.1.3.tar.gz.

File metadata

  • Download URL: secsgml-0.1.3.tar.gz
  • Upload date:
  • Size: 177.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.3.tar.gz
Algorithm Hash digest
SHA256 0272141c1c93d3c2400a1664a4219dff29ddf3012d6229e53fb0ef1acb56d07c
MD5 7b80df28b3172f1cae8690c873503f48
BLAKE2b-256 558484ee6130d3feb07b20e07d1d6cfcdd41e06887e02da1b0df545d0e4275e9

See more details on using hashes here.

File details

Details for the file secsgml-0.1.3-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.3-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 263.6 kB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.3-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 b97e894ac5919f6b0c11f2ae157074fb28a72ca529329f6ffb717626e376ebef
MD5 c6872ed62803a45c16e3939de9a22c57
BLAKE2b-256 62a121aba41b3b54876551101e20a0b7ba503b78c4a9e1e5795529da993c7f10

See more details on using hashes here.

File details

Details for the file secsgml-0.1.3-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.3-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 29714761c26228ffeb1086da2d072296c9f0f56808c6cb3967b43c2531fd4225
MD5 bba39625fe3ff4faab8b4976e97c67d0
BLAKE2b-256 7e12ea3bec92cb325b2ee0ce6f86fb0c864e7766ad9324c3a98449a5168b0335

See more details on using hashes here.

File details

Details for the file secsgml-0.1.3-cp313-cp313-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.3-cp313-cp313-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 48599c2dbc27df29aa7aa24159ad69cb85dff20bcf61fff311034aee5799bb8f
MD5 e1f48274f13d741d71928a1ea5855c52
BLAKE2b-256 dd1a56e6be7f8422e0af6b8924479dc1b2de55806782662840c60367e9cb2cbf

See more details on using hashes here.

File details

Details for the file secsgml-0.1.3-cp312-cp312-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.3-cp312-cp312-win_amd64.whl
  • Upload date:
  • Size: 264.2 kB
  • Tags: CPython 3.12, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.3-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 823ac0c3777669afa70ef61ed6f6cb9ffa9acf62f0a5f2694c4feebf1db157c4
MD5 7492bb165d64304b1e28b0380e124963
BLAKE2b-256 2c02a2f3951d1cd7fd171fca203acb03007479f43b906408788a7b32ee248e61

See more details on using hashes here.

File details

Details for the file secsgml-0.1.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.3-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 0d6e8b30b603f6ce80300aab3494445a2b6ab34a3c5946a9a5d0501f4f158d43
MD5 37351c847e1b493fc27b27231730ddf4
BLAKE2b-256 5c5b357ad04c3285c2c5a6fc85d341f469a146b754af2b6fe724bb87d9cff15d

See more details on using hashes here.

File details

Details for the file secsgml-0.1.3-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.3-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 3ab790af6e33342cf9616e0cbda2c40db9cf8ab37329ca7f7c933b715cd64ba4
MD5 5a8726d3193830a2028e5b4070629897
BLAKE2b-256 4dd5f85ef528e2947acbd1ebf696fa58eff706e2f99d8a202777b588a589c41d

See more details on using hashes here.

File details

Details for the file secsgml-0.1.3-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.3-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 264.6 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.3-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 6af3ff599069fe9b22538f09055b3ed31c292bc2ca28ee5abf536f991b432cf5
MD5 f6bbda52a690557ae7865489008b93fc
BLAKE2b-256 a39f9298c83477409d6e439c62a0dee2fb7b7b9e4685112a32bb6c206c588d9d

See more details on using hashes here.

File details

Details for the file secsgml-0.1.3-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.3-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 6a0ba27bf93773fbd856a922e8e31e721341fcf7635d6932ca6b73f9ab846176
MD5 230c7dd463f4755ba9f8f10d8ee90ba4
BLAKE2b-256 2a885b1fa111ce44b61c4dd8909827239668c499f7a989726b62410f51d28579

See more details on using hashes here.

File details

Details for the file secsgml-0.1.3-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.3-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 932961b46f8b4913f2fd780c380639ef484cca8518144b2913bb8182af9af0b7
MD5 30a36a42adf3bd663803d1316fe47f25
BLAKE2b-256 958bef9f6d035d05cd6b5f1888e47925162aec7698c8c391b4914cc7d5e0db4e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page