Skip to main content

Parse Securities and Exchange Commission Standard Generalized Markup Language (SEC SGML) files

Project description

SEC SGML

A python library to parse Securities and Exchange Commission Standardized Generalized Markup Language. Used to power the open-source datamule project.

Currently parses two types of files:

  1. Daily Archives
  2. Submissions

Will be expanded to also parse SGML Tables.

All Variations

secsgml also attempts to standardize the metadata between formats. e.g. 'CENTRAL INDEX KEY' will be mapped to 'cik'.

Installation

pip install secsgml

Quickstart

Parse into memory

from secsgml import parse_sgml_submission_into_memory
metadata,documents = parse_sgml_submission_into_memory(filepath="000000443897000001.sgml")

Parse to file

from secsgml import parse_sgml_submission
# from file
parse_sgml_submission(filepath='samples/0000891618-94-000021.txt',output_dir='results')

# from content
parse_sgml_submission(content=sgml_content,output_dir='results')

Note

Will be giving parse_sgml_submission_into_memory more love, will have to refactor parse_sgml_submission afterwards.

Future

  • SGML Table parsing
  • Optimization + refactor in Cython/ C bindings.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

secsgml-0.1.5.tar.gz (177.6 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

secsgml-0.1.5-cp313-cp313-win_amd64.whl (263.6 kB view details)

Uploaded CPython 3.13Windows x86-64

secsgml-0.1.5-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (762.2 kB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

secsgml-0.1.5-cp313-cp313-macosx_10_13_universal2.whl (356.3 kB view details)

Uploaded CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.1.5-cp312-cp312-win_amd64.whl (264.2 kB view details)

Uploaded CPython 3.12Windows x86-64

secsgml-0.1.5-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (772.2 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

secsgml-0.1.5-cp312-cp312-macosx_10_13_universal2.whl (359.2 kB view details)

Uploaded CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.1.5-cp311-cp311-win_amd64.whl (264.6 kB view details)

Uploaded CPython 3.11Windows x86-64

secsgml-0.1.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (773.4 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

secsgml-0.1.5-cp311-cp311-macosx_10_9_universal2.whl (356.7 kB view details)

Uploaded CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file secsgml-0.1.5.tar.gz.

File metadata

  • Download URL: secsgml-0.1.5.tar.gz
  • Upload date:
  • Size: 177.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.5.tar.gz
Algorithm Hash digest
SHA256 c3ab59450ba91589ac4a6f7b9599235871257cdd1017a851ab30ddf1740e8290
MD5 48b06995c3544742b16e18c7eaa5d172
BLAKE2b-256 70ceeb2f7caae555d3c652f78d030cec9222a936fbef122b552d85d25f43b298

See more details on using hashes here.

File details

Details for the file secsgml-0.1.5-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.5-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 263.6 kB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.5-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 8e82203942bb7a431a407ec38cf10425f31b62a88a650210ffc9dbfbab7f3925
MD5 d7f6cc7688cf62f011de104524e24444
BLAKE2b-256 c3eb2df127114716bbed194f2d35a2b0a9cb9380b18bc81028a8374276b3b0f6

See more details on using hashes here.

File details

Details for the file secsgml-0.1.5-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.5-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 63b9a2178c22a857ce8d6481e5669b5094658f716f1c77a185d1b528c1a8490e
MD5 7bc0995239754c599fccc70b205a5abe
BLAKE2b-256 b4b4e3e6c03af9ef4fe5102a0c6ae2dcbe08b7977aa3f292d7caf7164ff66f47

See more details on using hashes here.

File details

Details for the file secsgml-0.1.5-cp313-cp313-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.5-cp313-cp313-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 9a8222cf369bd91fd9efe4307f3d0e1c7ff3cb8b3bcb524a9babe7c214cd904b
MD5 0590fd03fab1b44b35bfbb04030f27cb
BLAKE2b-256 1d766a1e83a15a0596ae002c8fe796367ef4a0660ae6e5ee2361119da7b45b4a

See more details on using hashes here.

File details

Details for the file secsgml-0.1.5-cp312-cp312-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.5-cp312-cp312-win_amd64.whl
  • Upload date:
  • Size: 264.2 kB
  • Tags: CPython 3.12, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.5-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 56b0bbca49eebcdefd741af427b66c4d1314515bda3515e2acce0f6581a77369
MD5 a7a7568932fdf5b11046edbc1caad271
BLAKE2b-256 0ddf641f29a90fbde4618d07e73ee49a2ca6d38b1d16e45b51b4ef60f60eec37

See more details on using hashes here.

File details

Details for the file secsgml-0.1.5-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.5-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 9106c8ce7d20b3c2817d25c87546b54e87e4822ad594310acdd628b10a0cb0b8
MD5 631dbc988960607c6d7511df90575a08
BLAKE2b-256 1d26a30a13001b8ec7556bf08c8091d2def61e09c726e4970562827385cb1c1e

See more details on using hashes here.

File details

Details for the file secsgml-0.1.5-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.5-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 88f5fc104d4e530ebd60624ff72d313ceeb873c85e4c49c83512384d32883296
MD5 16c0e438f8315f0d77fe13bc24359151
BLAKE2b-256 03f4b80f7285fe26cd93dfd16c8c06d44922134c10e8909ac1aaf179a831eee0

See more details on using hashes here.

File details

Details for the file secsgml-0.1.5-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.5-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 264.6 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.5-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 e45e72cf3c7ab10aeda6799f8e49ac3ce33f9e4e22845145c7f2755af2994a52
MD5 76536bf4432203743b7f71165f119dc7
BLAKE2b-256 cf971bf9d6eb8eb4635649d682c59ac499931d134b222c1c0ea1870377ba0390

See more details on using hashes here.

File details

Details for the file secsgml-0.1.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 70600db74eeb1bf179aefd0e8afd85682da07872886aa4ee52e2a87c7504ea58
MD5 4411b04224a891b29ea5b88409acb53d
BLAKE2b-256 91c64971fba3aa80eebe269a9307e68b1d4e1d38a40fffb97c911301cb7b61d7

See more details on using hashes here.

File details

Details for the file secsgml-0.1.5-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.5-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 2c30101085d61a5156ce25a84d9562d136ae43db6235c1b0e0618e561dc9cd3b
MD5 938afa6f9d437c0ae28ec9ecde3346e3
BLAKE2b-256 79a854ed5519be7aa5b9e6365d6009d439423c3ed54507aa5c11f5b91bcab636

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page