Skip to main content

Parse Securities and Exchange Commission Standard Generalized Markup Language (SEC SGML) files

Project description

SEC SGML

A python library to parse Securities and Exchange Commission Standardized Generalized Markup Language. Used to power the open-source datamule project.

Currently parses two types of files:

  1. Daily Archives
  2. Submissions

Will be expanded to also parse SGML Tables.

All Variations

Installation

pip install secsgml

Quickstart

Parse into memory

from secsgml import parse_sgml_submission_into_memory
metadata,results = parse_sgml_submission_into_memory(filepath="000000443897000001.sgml")

Parse to file

from secsgml import parse_sgml_submission
# from file
parse_sgml_submission(filepath='samples/0000891618-94-000021.txt',output_dir='results')

# from content
parse_sgml_submission(content=sgml_content,output_dir='results')

Note

Will be giving parse_sgml_submission_into_memory more love, will have to refactor parse_sgml_submission afterwards.

Future

  • SGML Table parsing
  • Optimization + refactor in Cython/ C bindings.
  • Standardize metadata for different file types. Keys and values vary across variations, e.g. 'CIK' vs 'CENTRAL INDEX KEY' as well as values such as '34' vs '1934'

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

secsgml-0.0.7.tar.gz (172.5 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

secsgml-0.0.7-cp313-cp313-win_amd64.whl (88.5 kB view details)

Uploaded CPython 3.13Windows x86-64

secsgml-0.0.7-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (587.3 kB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

secsgml-0.0.7-cp313-cp313-macosx_10_13_universal2.whl (181.4 kB view details)

Uploaded CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.0.7-cp312-cp312-win_amd64.whl (89.1 kB view details)

Uploaded CPython 3.12Windows x86-64

secsgml-0.0.7-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (597.3 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

secsgml-0.0.7-cp312-cp312-macosx_10_13_universal2.whl (184.4 kB view details)

Uploaded CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.0.7-cp311-cp311-win_amd64.whl (89.5 kB view details)

Uploaded CPython 3.11Windows x86-64

secsgml-0.0.7-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (598.5 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

secsgml-0.0.7-cp311-cp311-macosx_10_9_universal2.whl (181.8 kB view details)

Uploaded CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file secsgml-0.0.7.tar.gz.

File metadata

  • Download URL: secsgml-0.0.7.tar.gz
  • Upload date:
  • Size: 172.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.7.tar.gz
Algorithm Hash digest
SHA256 d78f9c25ad11bd2f681fce152a5cba509e5546bf4ed9b2783ef83f969b5a169a
MD5 c35139d21ca04c5b159ea5df60eb14e8
BLAKE2b-256 1d7938e422ca4fcf11f1f0aca21318290ac9c4999c1623e612a679c065104078

See more details on using hashes here.

File details

Details for the file secsgml-0.0.7-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.0.7-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 88.5 kB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.7-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 f437d4e29aa445e2f625ed3625b902d780d7034b55c7e9912f779b6b84c7f4ad
MD5 92d074d39f1de7a735783e1bba3e7260
BLAKE2b-256 c491a00e782f96feba15e381fee5cecf43e98022b096147df4a224fb4b456521

See more details on using hashes here.

File details

Details for the file secsgml-0.0.7-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.0.7-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 0466fd1b80eaff8170dc8ce13676b12f8c5b0209759b76b6c67e5bc8b0b03708
MD5 5636245c26b67c8996fd667204471a59
BLAKE2b-256 093ea45581c2f94e8d4645ecc4562ef4035da0701d5ea84bae3f515f72dae94f

See more details on using hashes here.

File details

Details for the file secsgml-0.0.7-cp313-cp313-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.0.7-cp313-cp313-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 000aed26612bc4bfd6acacf00d9b740b09e515945c09b048f52d4f34739bfed7
MD5 f8a46c336569f67649387180a50ba0cd
BLAKE2b-256 a16c693146a4d76895cefdcd390aa14ed79175b58abb0252bf54e7a3eaf6b425

See more details on using hashes here.

File details

Details for the file secsgml-0.0.7-cp312-cp312-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.0.7-cp312-cp312-win_amd64.whl
  • Upload date:
  • Size: 89.1 kB
  • Tags: CPython 3.12, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.7-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 a0f290bb23c89dcc9049c7fd58d706d138ce486c83e9e06f9f14ae8ed244bdae
MD5 00a86621ed93e369a49310fe38483e9c
BLAKE2b-256 03063871e3a51fdb134f5190a3ec85815c07f2d59e1dd0fad38f3e292536f0b8

See more details on using hashes here.

File details

Details for the file secsgml-0.0.7-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.0.7-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 237c03cece940c51e5fe4bc6a468649067f3bda9427762ff2747a705a4690d1f
MD5 fadbc1ebdf6e884eb6c387863c26a906
BLAKE2b-256 bc5aacf16b400dd3f0bb1b1351b314e10da3f96e0d5e61acbc27cbeaae24a1f5

See more details on using hashes here.

File details

Details for the file secsgml-0.0.7-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.0.7-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 07fcbcd9c3908508c672ff456c896ab71082f1781fb33a37d41efd79149e8dda
MD5 c867082ea936c448e8342b32e86cda47
BLAKE2b-256 ba0763e440ade6c243c2022695dd9876d0a6d0c234d3e08a9f2bb0e276a71d32

See more details on using hashes here.

File details

Details for the file secsgml-0.0.7-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.0.7-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 89.5 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.0.7-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 ebec4b14b4b12548bd486225d59c9ccabf5e2364e828ca5a4dbdd64b17632a40
MD5 dd1812a26901bd24f44d9a046159d4a8
BLAKE2b-256 bc8ddec6b8565943a843e2010587b9dca6033a011e2d27080595c9a56c08f6d3

See more details on using hashes here.

File details

Details for the file secsgml-0.0.7-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.0.7-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 5db0fd9355da9e3f6468f3298fbbb6968adc6fc38b979fbc1e20c3fbc0d49e50
MD5 829ad55d04fcc46b59f92c978e873098
BLAKE2b-256 5aaddc3bcbd0c8429a106f45334655ed2a80c656caef24c1ae72e29b2eb6a68b

See more details on using hashes here.

File details

Details for the file secsgml-0.0.7-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.0.7-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 d44ebc4990ba2feeb56c215938ec02cae38dda14ddf39a153ddec5d0411d8477
MD5 67fb2cdab0a21afa4edd0fc9f571d87b
BLAKE2b-256 dc07589e65fdab5fd65a832e9494d5fa5c4b88f984156c0f7052c4cb69cf1bbc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page