Skip to main content

Parse Securities and Exchange Commission Standard Generalized Markup Language (SEC SGML) files

Project description

SEC SGML

A python library to parse Securities and Exchange Commission Standardized Generalized Markup Language. Used to power the open-source datamule project.

Currently parses two types of files:

  1. Daily Archives
  2. Submissions

Will be expanded to also parse SGML Tables.

All Variations

Installation

pip install secsgml

Quickstart

Parse into memory

from secsgml import parse_sgml_submission_into_memory
metadata,results = parse_sgml_submission_into_memory(filepath="000000443897000001.sgml")

Parse to file

from secsgml import parse_sgml_submission
# from file
parse_sgml_submission(filepath='samples/0000891618-94-000021.txt',output_dir='results')

# from content
parse_sgml_submission(content=sgml_content,output_dir='results')

Note

Will be giving parse_sgml_submission_into_memory more love, will have to refactor parse_sgml_submission afterwards.

Future

  • SGML Table parsing
  • Optimization + refactor in Cython/ C bindings.
  • Standardize metadata for different file types. Keys and values vary across variations, e.g. 'CIK' vs 'CENTRAL INDEX KEY' as well as values such as '34' vs '1934'

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

secsgml-0.0.6.tar.gz (172.5 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

secsgml-0.0.6-cp313-cp313-win_amd64.whl (88.4 kB view details)

Uploaded CPython 3.13Windows x86-64

secsgml-0.0.6-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (587.3 kB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

secsgml-0.0.6-cp313-cp313-macosx_10_13_universal2.whl (181.4 kB view details)

Uploaded CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.0.6-cp312-cp312-win_amd64.whl (89.0 kB view details)

Uploaded CPython 3.12Windows x86-64

secsgml-0.0.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (597.3 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

secsgml-0.0.6-cp312-cp312-macosx_10_13_universal2.whl (184.4 kB view details)

Uploaded CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.0.6-cp311-cp311-win_amd64.whl (89.4 kB view details)

Uploaded CPython 3.11Windows x86-64

secsgml-0.0.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (598.5 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

secsgml-0.0.6-cp311-cp311-macosx_10_9_universal2.whl (181.8 kB view details)

Uploaded CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file secsgml-0.0.6.tar.gz.

File metadata

  • Download URL: secsgml-0.0.6.tar.gz
  • Upload date:
  • Size: 172.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.6.tar.gz
Algorithm Hash digest
SHA256 f8fd09ae064bb003228e40192c0ba430b2fcb56387b1ebd827f49b209f3efbd6
MD5 39590542d54b6c0f3a9d80caac457bac
BLAKE2b-256 36f88bbc2c380a9e31886324a84ad55107e33caf8bcbd3471ab5771229ccde1f

See more details on using hashes here.

File details

Details for the file secsgml-0.0.6-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.0.6-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 88.4 kB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.6-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 59227b8495ae94833c32cbd913218cf14356e38993b3f356f31017c091e06373
MD5 d7b1ea8f50510be747bc16fa65b3fe02
BLAKE2b-256 07332d6f8c577b7e3a9bf51058901af6afdec3f1fb39341dd3c156f529c2b559

See more details on using hashes here.

File details

Details for the file secsgml-0.0.6-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.0.6-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 e7ccbb6f97a612770963b4c404f9f0dc8f7c91ff1d3ba47a72667db203351784
MD5 8dfc2d4c8678470322fd55830927362b
BLAKE2b-256 b7d3b5b6449eab9de16c931ffafa7bd81878c6d5f65ce6021596b04f190d8ce8

See more details on using hashes here.

File details

Details for the file secsgml-0.0.6-cp313-cp313-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.0.6-cp313-cp313-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 041ef09ef9bd71de3da625401ab028c254edea52706393533d6c66bd0ba1ace0
MD5 5c194625e6397cd6023221e08620d0b2
BLAKE2b-256 47c9904a0f00246cd548821c8e577d4db536dc7d25335bcba6e9b7c52e660db7

See more details on using hashes here.

File details

Details for the file secsgml-0.0.6-cp312-cp312-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.0.6-cp312-cp312-win_amd64.whl
  • Upload date:
  • Size: 89.0 kB
  • Tags: CPython 3.12, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.6-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 1aefdf1c7ef15be72b506bf5ed5ead4ddf678a1f6f78d5b9be4ad507b57861e6
MD5 19f4245d2c8659c4b7329069e1141157
BLAKE2b-256 619e6644cf4765c3b2f73de33255d59cec30909ae302d5dbedad697a68af04c3

See more details on using hashes here.

File details

Details for the file secsgml-0.0.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.0.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 e5ff8aaee241f8f4a15c686dc7b5c76c5728d21bfccb9c45980941b1d2cc59ee
MD5 74900cce28b4d3f0bfe06d9b10621cbb
BLAKE2b-256 41b7a2199f21f498f2ac13c11f940e8c903a4fc32a97d586b2635bda05dab79a

See more details on using hashes here.

File details

Details for the file secsgml-0.0.6-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.0.6-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 9b6ce663704b020dafd3c51f656906bf63dca20cebd02096f417e2ab997ecc89
MD5 e225e41a98f13a3031855802e5c79512
BLAKE2b-256 32e8ed43492073a19e7b83da78defa5ca807d0c8c8d56338e1d6b962a0a786b1

See more details on using hashes here.

File details

Details for the file secsgml-0.0.6-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.0.6-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 89.4 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.6-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 ed828d1cca2e912b8ea93268cbbea4fd56df61dff5c1243114d94f12b724aa73
MD5 bfa28f5e1ab22a9d2cc390f1cda2a743
BLAKE2b-256 55923399edafcd929c7fdd88f5b0bf9b976d4ea944def3f69b8a4f97d512600f

See more details on using hashes here.

File details

Details for the file secsgml-0.0.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.0.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 be072330bc787d5950faa875e3c8f385b28bd68734659f36bb5768f4f9655b22
MD5 d44fdb06e814eb5c66474226c9827d01
BLAKE2b-256 51632f7a11c891389e65016e41c305a737d208e75cac81cbcc46699354d49f36

See more details on using hashes here.

File details

Details for the file secsgml-0.0.6-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.0.6-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 75635f155e2e397611fcb73c7d740611f6e5712b5d97c09b97ea0a51ca9e14b6
MD5 f7d8d6522ef95070e75ea98d5ff59c75
BLAKE2b-256 d7c1f0fa229625cfb5ff636ac8db3a3181d599c99571ebf536d737bef0a1bb33

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page