Skip to main content

Parse Securities and Exchange Commission Standard Generalized Markup Language (SEC SGML) files

Project description

SEC SGML

A python library to parse Securities and Exchange Commission Standardized Generalized Markup Language. Used to power the open-source datamule project.

Currently parses two types of files:

  1. Daily Archives
  2. Submissions

Will be expanded to also parse SGML Tables.

All Variations

Installation

pip install secsgml

Quickstart

Parse into memory

from secsgml import parse_sgml_submission_into_memory
metadata,results = parse_sgml_submission_into_memory(filepath="000000443897000001.sgml")

Parse to file

from secsgml import parse_sgml_submission
# from file
parse_sgml_submission(filepath='samples/0000891618-94-000021.txt',output_dir='results')

# from content
parse_sgml_submission(content=sgml_content,output_dir='results')

Note

Will be giving parse_sgml_submission_into_memory more love, will have to refactor parse_sgml_submission afterwards.

Future

  • SGML Table parsing
  • Optimization + refactor in Cython/ C bindings.
  • Standardize metadata for different file types. Keys and values vary across variations, e.g. 'CIK' vs 'CENTRAL INDEX KEY' as well as values such as '34' vs '1934'

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

secsgml-0.1.0.tar.gz (176.7 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

secsgml-0.1.0-cp313-cp313-win_amd64.whl (262.3 kB view details)

Uploaded CPython 3.13Windows x86-64

secsgml-0.1.0-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (760.9 kB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

secsgml-0.1.0-cp313-cp313-macosx_10_13_universal2.whl (355.0 kB view details)

Uploaded CPython 3.13macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.1.0-cp312-cp312-win_amd64.whl (262.9 kB view details)

Uploaded CPython 3.12Windows x86-64

secsgml-0.1.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (770.9 kB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

secsgml-0.1.0-cp312-cp312-macosx_10_13_universal2.whl (358.0 kB view details)

Uploaded CPython 3.12macOS 10.13+ universal2 (ARM64, x86-64)

secsgml-0.1.0-cp311-cp311-win_amd64.whl (263.3 kB view details)

Uploaded CPython 3.11Windows x86-64

secsgml-0.1.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (772.1 kB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

secsgml-0.1.0-cp311-cp311-macosx_10_9_universal2.whl (355.4 kB view details)

Uploaded CPython 3.11macOS 10.9+ universal2 (ARM64, x86-64)

File details

Details for the file secsgml-0.1.0.tar.gz.

File metadata

  • Download URL: secsgml-0.1.0.tar.gz
  • Upload date:
  • Size: 176.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.0.tar.gz
Algorithm Hash digest
SHA256 ab2d2818f718e962404630577e46137c71279ab0eb0d15a007bf84932cff5f00
MD5 b2fa1f57f3b45484e91862e63d1af938
BLAKE2b-256 9517415ef70da45138d908a9ef0f5081062624739bb49349c7c1e1abfc07c382

See more details on using hashes here.

File details

Details for the file secsgml-0.1.0-cp313-cp313-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.0-cp313-cp313-win_amd64.whl
  • Upload date:
  • Size: 262.3 kB
  • Tags: CPython 3.13, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.0-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 a9e61aa85a650410c6335cf3f21e3a677bcdbd21974294d16cddf0d165b00808
MD5 fba38a0ba74ddfb7ee6e86be74dccc53
BLAKE2b-256 020ee8b0cff4cac4d5aa92097bfcdee72b017163afca2bde2655ef93dee873df

See more details on using hashes here.

File details

Details for the file secsgml-0.1.0-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.0-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 0ffc21b906c962a0e8abe19ade2f8a593f89e666e51fac44e0a6c85da852e645
MD5 6149380199cbc6c1b167271b66c21b29
BLAKE2b-256 fd95095921300a2b9fedbf698bd1c68e6c40e15f70aa0807f7cab3c294065984

See more details on using hashes here.

File details

Details for the file secsgml-0.1.0-cp313-cp313-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.0-cp313-cp313-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 f7b6043c74320b642561c9905e661711bedcd9a51176e5fbcee3eb4a82530510
MD5 16c68fb9620e4582fdc2f4bbb73392ad
BLAKE2b-256 19977ad8a7e8872d57325b9d36b31fa7aaa5097ff35c532d32f4dcc0a39c7f14

See more details on using hashes here.

File details

Details for the file secsgml-0.1.0-cp312-cp312-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.0-cp312-cp312-win_amd64.whl
  • Upload date:
  • Size: 262.9 kB
  • Tags: CPython 3.12, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.0-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 2ac9d78edcef7577442b35234c1a3a512dd5ec64f9856b625e3d9295385034d3
MD5 deae3bf2eed6fcb3e22ae8a7d9519247
BLAKE2b-256 b143260ba621f94b803b1216485b2b57efb042c433f5e922566bbee6dd0b563a

See more details on using hashes here.

File details

Details for the file secsgml-0.1.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 5d7f2eab4198b2f1c99dd5bb7cf91adf43cf06f027c6bded6a7b9aed2c20ea6b
MD5 63f5ae6fa45e2fd11a0819387b76c566
BLAKE2b-256 2e7b15b43e1033d6ad9fe266d86c07e11ea25e14be8f68b39385448b85d44078

See more details on using hashes here.

File details

Details for the file secsgml-0.1.0-cp312-cp312-macosx_10_13_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.0-cp312-cp312-macosx_10_13_universal2.whl
Algorithm Hash digest
SHA256 efe74c1d8e55f76e715f2d5d2df314b8f1e5f1db462605daea56d42c67f97328
MD5 412cc755520c20e95fd03a797af4f8ea
BLAKE2b-256 7357c02ccf8c930b3c1717e702dbc0741a0c432495f5efa5aaf5a5f0ae7ff8f3

See more details on using hashes here.

File details

Details for the file secsgml-0.1.0-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: secsgml-0.1.0-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 263.3 kB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for secsgml-0.1.0-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 b960c62547478870c20361f2bcc3f969805f75e3c08817bf4aa516a7f8308c9a
MD5 81b1c16df623470349578de59c56d7cc
BLAKE2b-256 2a3c93273cae21bcf25a831ce9a680bd618320573efb5d49dbff120e1f501153

See more details on using hashes here.

File details

Details for the file secsgml-0.1.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for secsgml-0.1.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 3c1f15d10ea59b2c10e039c07d715b8393edd189fcf256a203476694ea5fa70c
MD5 a41f060bf1ca7e77047963b7387f344b
BLAKE2b-256 32536ee3133ee43b063a977d1cf87ba739c75e84fd560c591b103d3e38164b04

See more details on using hashes here.

File details

Details for the file secsgml-0.1.0-cp311-cp311-macosx_10_9_universal2.whl.

File metadata

File hashes

Hashes for secsgml-0.1.0-cp311-cp311-macosx_10_9_universal2.whl
Algorithm Hash digest
SHA256 263b8a0697debc58631cc21a52e61a9520565bf6e83416248092b7c70db9d33a
MD5 6be66033aeb39ab54b00dfb88843ec22
BLAKE2b-256 e268bc5066c10a475655218864a6f1c57166284efde6c8044bee8b46d3343549

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page