Skip to main content

Parse Securities and Exchange Commission Standard Generalized Markup Language (SEC SGML) files

Project description

SEC SGML

A python library to parse Securities and Exchange Commission Standardized Generalized Markup Language. Used to power the open-source datamule project.

Currently parses two types of files:

  1. Daily Archives
  2. Submissions

Will be expanded to also parse SGML Tables.

All Variations

secsgml also attempts to standardize the metadata between formats. e.g. 'CENTRAL INDEX KEY' will be mapped to 'cik'.

Installation

pip install secsgml

Quickstart

Parse into memory

from secsgml import parse_sgml_submission_into_memory
metadata,documents = parse_sgml_submission_into_memory(filepath="000000443897000001.sgml")

Parse to file

from secsgml import parse_sgml_submission
# from file
parse_sgml_submission(filepath='samples/0000891618-94-000021.txt',output_dir='results')

# from content
parse_sgml_submission(content=sgml_content,output_dir='results')

Note

Will be giving parse_sgml_submission_into_memory more love, will have to refactor parse_sgml_submission afterwards.

Future

  • SGML Table parsing
  • Optimization + refactor in Cython/ C bindings.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

secsgml-0.1.2.tar.gz (177.6 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

secsgml-0.1.2-cp313-cp313-win_amd64.whl (263.6 kB view details)

Uploaded CPython 3.13Windows x86-64

secsgml-0.1.2-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (762.1 kB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

secsgml-0.1.2-cp313-cp313-macosx_10_13_universal2.whl (356.2 kB view details)

Uploaded CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.1.2-cp312-cp312-win_amd64.whl (264.2 kB view details)

Uploaded CPython 3.12Windows x86-64

secsgml-0.1.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (772.1 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

secsgml-0.1.2-cp312-cp312-macosx_10_13_universal2.whl (359.2 kB view details)

Uploaded CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.1.2-cp311-cp311-win_amd64.whl (264.6 kB view details)

Uploaded CPython 3.11Windows x86-64

secsgml-0.1.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (773.3 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

secsgml-0.1.2-cp311-cp311-macosx_10_9_universal2.whl (356.6 kB view details)

Uploaded CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file secsgml-0.1.2.tar.gz.

File metadata

  • Download URL: secsgml-0.1.2.tar.gz
  • Upload date:
  • Size: 177.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.2.tar.gz
Algorithm Hash digest
SHA256 e1df15c5c8ff5cf5255e73c880c9c8a464ee584ba8dd9cb453545f1386a3701e
MD5 6da5a06404327a3e69fa6875d6bc4ba9
BLAKE2b-256 09de876e4fedddab85ad22f6e32a1db66842ca0754ee781ccd7445c76a383c45

See more details on using hashes here.

File details

Details for the file secsgml-0.1.2-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.2-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 263.6 kB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.2-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 50d187f9c9bbc2f28088e55f21bbddf2253bf4c6a2d5647667801fc69ade0a75
MD5 e7dfecb2f17d7f5a88bd76bc498b96b7
BLAKE2b-256 3ed5d68dad6e7ccc97737c08c4b9ffd8160ce3e7bdfa10acb0503e4d3dd5a983

See more details on using hashes here.

File details

Details for the file secsgml-0.1.2-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.2-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 3224e79f89324b0344ff5fb5ead26af30521f8df15702841cb3aa808f4018140
MD5 a3ac8b749f745b4d355401251d0ed39b
BLAKE2b-256 fe266931fd2bac074da4d95e2f64c4335158fa6ba01ec1dcd716bf4271205cd6

See more details on using hashes here.

File details

Details for the file secsgml-0.1.2-cp313-cp313-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.2-cp313-cp313-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 45335adfeae3186f77d1cf2c1ae1fc1691916e7ddc87a6e6fad633e25d57e048
MD5 bf1552d4d43a079915d54ea72d4177ea
BLAKE2b-256 501cb5b871496ae54246bcdce1e21c69793e2d5a37ef459ffff50a1c2951cf84

See more details on using hashes here.

File details

Details for the file secsgml-0.1.2-cp312-cp312-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.2-cp312-cp312-win_amd64.whl
  • Upload date:
  • Size: 264.2 kB
  • Tags: CPython 3.12, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.2-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 1f0d53bf2d44f7ef1ce315b53ba236b1dc51432364c95190c815091cdedd80a2
MD5 5adc12d31d79671edbf40ca0862d9bae
BLAKE2b-256 4cfddb42f50e691aa935f38af3a07785858a63afe8f129729cc96cc8ac1dc94c

See more details on using hashes here.

File details

Details for the file secsgml-0.1.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 046506e422a2547db57597f4b24b6be487969aa1f8f22f9983fca35994bef26a
MD5 17880411c3ddb3673e4873fe69d3f46f
BLAKE2b-256 dce1478572dcaf7597a0098d5edad83374da9ffc9cd667cb013d6f48eb2e7a01

See more details on using hashes here.

File details

Details for the file secsgml-0.1.2-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.2-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 ef004db8b827915cd8a14084cca59b65fee4b5d54219263544044be493d6208d
MD5 152ce0f9332f502b3a3ab96289013b34
BLAKE2b-256 22206fefcfaa07bbede17d4e4a5f7d1fb760fc879aba519d7da2a4fcbbcb75d3

See more details on using hashes here.

File details

Details for the file secsgml-0.1.2-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.2-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 264.6 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.2-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 6469434bc0d11218795661d5bacf15fc5ae3c41a144d194c92581a6f1fb6900b
MD5 2aa840ae9be53b5cafa80af581395756
BLAKE2b-256 de1eac866ef62579479e93ae6b5c8234973739206bf7fb4dcf19444e31ae294c

See more details on using hashes here.

File details

Details for the file secsgml-0.1.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 5df52a0bd6f71365381c1a76a8bd178f02c2bcb87734c8c9266cb064821352dd
MD5 9a80284bc7bb1578dc1b021018318f71
BLAKE2b-256 85121f0871d3efed7133f723be1a3c1b421ebe9abc695242f60cbc4337291c39

See more details on using hashes here.

File details

Details for the file secsgml-0.1.2-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.2-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 d72aaa54af3d53fdbee23afb3b97100e4a600306111dece6c3bfb73c97975077
MD5 1c598b21a9e34c637de3edf82f350a8e
BLAKE2b-256 b8c71ec7604647ba64c2009ab74e585be3c14f8e56dbc71f484bf7cb06ab4a2a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page