Skip to main content

Parse Securities and Exchange Commission Standard Generalized Markup Language (SEC SGML) files

Project description

SEC SGML

A python library to parse Securities and Exchange Commission Standardized Generalized Markup Language. Used to power the open-source datamule project.

Currently parses two types of files:

  1. Daily Archives
  2. Submissions

Will be expanded to also parse SGML Tables.

All Variations

Installation

pip install secsgml

Quickstart

Parse into memory

from secsgml import parse_sgml_submission_into_memory
metadata,results = parse_sgml_submission_into_memory(filepath="000000443897000001.sgml")

Parse to file

from secsgml import parse_sgml_submission
# from file
parse_sgml_submission(filepath='samples/0000891618-94-000021.txt',output_dir='results')

# from content
parse_sgml_submission(content=sgml_content,output_dir='results')

Note

Will be giving parse_sgml_submission_into_memory more love, will have to refactor parse_sgml_submission afterwards.

Future

  • SGML Table parsing
  • Optimization + refactor in Cython/ C bindings.
  • Standardize metadata for different file types. Keys and values vary across variations, e.g. 'CIK' vs 'CENTRAL INDEX KEY' as well as values such as '34' vs '1934'

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

secsgml-0.0.9.tar.gz (176.6 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

secsgml-0.0.9-cp313-cp313-win_amd64.whl (262.3 kB view details)

Uploaded CPython 3.13Windows x86-64

secsgml-0.0.9-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (760.8 kB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

secsgml-0.0.9-cp313-cp313-macosx_10_13_universal2.whl (354.9 kB view details)

Uploaded CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.0.9-cp312-cp312-win_amd64.whl (262.9 kB view details)

Uploaded CPython 3.12Windows x86-64

secsgml-0.0.9-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (770.9 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

secsgml-0.0.9-cp312-cp312-macosx_10_13_universal2.whl (357.9 kB view details)

Uploaded CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.0.9-cp311-cp311-win_amd64.whl (263.3 kB view details)

Uploaded CPython 3.11Windows x86-64

secsgml-0.0.9-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (772.0 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

secsgml-0.0.9-cp311-cp311-macosx_10_9_universal2.whl (355.3 kB view details)

Uploaded CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file secsgml-0.0.9.tar.gz.

File metadata

  • Download URL: secsgml-0.0.9.tar.gz
  • Upload date:
  • Size: 176.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.9.tar.gz
Algorithm Hash digest
SHA256 516a951867cb300618bafa6710eb5865761894ee37eea853a04e9140a500fe76
MD5 7a22feaf370507abf65e19f8831af328
BLAKE2b-256 a1d0572247a101a79c01c3b6a6909428e6e2693c6e0545e551aae0353bf0b4ce

See more details on using hashes here.

File details

Details for the file secsgml-0.0.9-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.0.9-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 262.3 kB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.9-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 b613cd954910842ffed7d167a3ae186eaa36f9e805de345ce3d7c974fe93bac4
MD5 5c1f2bef1bc2136472e7cb65aae98fe5
BLAKE2b-256 b4758f4aa58c2b6c3883c839ae64c8459ce380efe8ff8885ae0e0d70baf620ae

See more details on using hashes here.

File details

Details for the file secsgml-0.0.9-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.0.9-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 facb97bf326279b98ea0b3494a49edb9d3be19ed2beffc3cbfa83634c8cd4c6b
MD5 ee3488d83d33f51b2dad2d5cde7397c7
BLAKE2b-256 d1ee8541b2a4b300c2ea92919735b25ed7c6e9708af1e58f1fe1dba6e6c32eb3

See more details on using hashes here.

File details

Details for the file secsgml-0.0.9-cp313-cp313-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.0.9-cp313-cp313-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 5dc32c6adfee563f37abfb606922f13eaf6eee5d98a636bcadebc6ed3a2ab267
MD5 400cb7e29747ca0df1d0236c15fcbc3e
BLAKE2b-256 8cd42379cede373efe6425bbf652e5fbc2d181c6902bd093e9db2f9924aabb48

See more details on using hashes here.

File details

Details for the file secsgml-0.0.9-cp312-cp312-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.0.9-cp312-cp312-win_amd64.whl
  • Upload date:
  • Size: 262.9 kB
  • Tags: CPython 3.12, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.9-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 939fdb2f6cac2b041b9489694a75a183bfe66a3c518e95c4363d3bcbc97ed4f8
MD5 48f780c5342a1a2fcdaf30a2cceb056b
BLAKE2b-256 79d74722ff3327a95a9fdd316b3333183e47008e5782acff8b796e65c18c29e3

See more details on using hashes here.

File details

Details for the file secsgml-0.0.9-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.0.9-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 4c27976ae6f51fc0bb1c6681a2f15410467258b9334f9ba2c69ae30d8e48e510
MD5 22976e3dc124c1c9f6d20d13636ac4e0
BLAKE2b-256 e3ad41a32dece7fe63b512dd04000ab2836fdc50fc33617177298d93589f9424

See more details on using hashes here.

File details

Details for the file secsgml-0.0.9-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.0.9-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 ca5155b1f60dc4cf2733a06c0ea0f3d7324e7530f6908ed20a64a70adf1d3d47
MD5 cb159b98a74e0cc9f7de7628055f2b9c
BLAKE2b-256 fa2d9d70083222c972f33f3e7135317626147109d5f9eaee7009b03198dc1887

See more details on using hashes here.

File details

Details for the file secsgml-0.0.9-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.0.9-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 263.3 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.9-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 721f03bd019b04eafdf7d0f136b291d567441e28f6374b3b277531773160eb61
MD5 42e127693420c093c262f53fd9f3acd1
BLAKE2b-256 0f9bd8f7c567122f5860e24e83448932dccb29e986f4b7764818a6f07a038881

See more details on using hashes here.

File details

Details for the file secsgml-0.0.9-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.0.9-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 3f3daef2df342cfa17a50733934e610a6797b8d24496eaa0dbf8c111be68fa1b
MD5 c7c61edce467b4c05d689d7d4bc4280d
BLAKE2b-256 0e49ecc3e708eebe14687b820ce0d3b83129a2a7e0859e035d4aad110e6d7090

See more details on using hashes here.

File details

Details for the file secsgml-0.0.9-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.0.9-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 0eeda9038a95344ea9a724418a4de4023204f3e1b35821d73547c233ec0f4160
MD5 39a68212c7d7032e0ce186221921661d
BLAKE2b-256 78c205a975238946fe58c48db8250c8bc38041f527a9c17154de7de9c27fa203

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page