Making it easier to use SEC filings.

Project description

PyPI - Downloads GitHub

datamule

A python package to make using SEC filings easier. Integrated with datamule's APIs and datasets.

features

current:

monitor edgar for new filings
parse textual filings into simplified html, interactive html, or structured json.
download sec filings quickly and easily
download datasets such as every MD&A from 2024 or every 2024 10K converted to structured json

installation

pip install datamule

quickstart:

import datamule as dm
downloader = dm.Downloader()
downloader.download_using_api(form='10-K',ticker='AAPL')

documentation

indexer

indexer = dm.Indexer()

indexer.run() constructs the locations of filings for the downloader and stores it to 'data/submissions_index.csv'. If download is set to True, the indexer downloads the locations using the prebuilt indices from Dropbox. The prebuilt indices are typically updated every few days.

It uses the submissions endpoint, and submissions archive endpoint.

TODO: add indexer run option using EFTS endpoint

indexer.run(download=True)

indexer.watch() returns True when new filings are posted to edgar.

It uses the EFTS endpoint.

watching all companies and forms

print("Monitoring SEC EDGAR for changes...")
changed_bool = indexer.watch(1,silent=False)
if changed_bool:
    print("New filing detected!")

watching specific companies and forms

print("Monitoring SEC EDGAR for changes...")
changed_bool = indexer.watch(1,silent=False,cik=['0001267602','0001318605'],form=['3','S-8 POS'])
if changed_bool:
    print("New filing detected!")

TODO: add args for company and ticker.

downloader

downloader = dm.Downloader()

downloads and downloads_using_api

downloader.download() downloads filings using the indices.

# Example 1: Download all 10-K filings for Tesla using CIK
downloader.download(form='10-K', cik='1318605', output_dir='filings')

# Example 2: Download 10-K filings for Tesla and META using CIK
downloader.download(form='10-K', cik=['1318605','1326801'], output_dir='filings')

# Example 3: Download 10-K filings for Tesla using ticker
downloader.download(form='10-K', ticker='TSLA', output_dir='filings')

# Example 4: Download 10-K filings for Tesla and META using ticker
downloader.download(form='10-K', ticker=['TSLA','META'], output_dir='filings')

# Example 5: Download every form 3 for a specific date
downloader.download(form ='3', date='2024-05-21', output_dir='filings')

# Example 6: Download every 10K for a year
downloader.download(form='10-K', date=('2024-01-01', '2024-12-31'), output_dir='filings')

# Example 7: Download every form 4 for a list of dates
downloader.download(form = '4',date=['2024-01-01', '2024-12-31'], output_dir='filings')

downloader.download_using_api() downloads filings using the datamule API instead. For more information look at SEC Router.

It uses the datamule sec router endpoint.

downloader.download_using_api(form='10-K',ticker='AAPL')

Both functions operate mostly the same. If return_urls is set to True, returns filing primary document urls instead of downloading them. If human_readable = True, it will download human readable versions of the filings. For more information look at the Human Readable Jupyter Notebook

download_datasets

downloader.download_dataset('10K')
downloader.download_dataset('MDA')

Need a better way to store datasets, as I'm running out of storage. Currently stored on Dropbox 2gb free tier.

parsing

Uses endpoint: https://jgfriedman99.pythonanywhere.com/parse_url with params url and return_type. Current endpoint can be slow. If it's too slow for your use-case, please contact me.

simplified html

simplified_html = dm.parse_textual_filing(url='https://www.sec.gov/Archives/edgar/data/1318605/000095017022000796/tsla-20211231.htm',return_type='simplify')

Alt text Download Example

interactive html

interactive_html = dm.parse_textual_filing(url='https://www.sec.gov/Archives/edgar/data/1318605/000095017022000796/tsla-20211231.htm',return_type='interactive')

Alt text Download Example

json

d = dm.parse_textual_filing(url='https://www.sec.gov/Archives/edgar/data/1318605/000095017022000796/tsla-20211231.htm',return_type='json')

Alt text Download Example

TODO

standardize accession number to not include '-'. Currently db does not have '-' but submissions_index.csv does.
add code to convert parsed json to interactive html
add mulebot

Update Log

9/16/24 v0.26

added indexer.watch(interval,cik,form) to monitor when EDGAR updates. v0.25
added human_readable option to download, and download_using_api.

9/15/24

fixed downloading filings overwriting each other due to same name.

9/14/24

added support for parser API

9/13/24

added download_datasets
added option to download indices
added support for jupyter notebooks

9/9/24

added download_using_api(self, output_dir, **kwargs). No indices required.

9/8/24

Added integration with datamule's SEC Router API

9/7/24

Simplified indices approach
Switched from pandas to polar. Loading indices now takes under 500 milliseconds.

Project details

Release history Release notifications | RSS feed

3.6.4

Mar 24, 2026

3.6.3

Mar 19, 2026

3.6.1

Mar 19, 2026

3.6.0

Mar 16, 2026

3.5.2

Mar 11, 2026

3.5.1

Mar 9, 2026

3.5.0

Feb 26, 2026

3.4.3

Feb 26, 2026

3.4.1

Feb 26, 2026

3.4.0

Feb 26, 2026

3.3.0

Feb 2, 2026

3.2.9

Feb 2, 2026

3.2.8

Jan 27, 2026

3.2.7

Jan 22, 2026

3.2.6

Jan 22, 2026

3.2.5

Jan 18, 2026

3.2.4

Jan 17, 2026

3.2.3

Jan 17, 2026

3.2.1

Jan 12, 2026

3.2.0

Jan 12, 2026

3.1.1

Jan 9, 2026

3.1.0

Dec 31, 2025

3.0.6

Dec 28, 2025

3.0.5

Dec 25, 2025

3.0.4

Dec 14, 2025

3.0.3

Dec 13, 2025

3.0.2

Dec 13, 2025

3.0.1

Dec 10, 2025

3.0.0

Dec 9, 2025

2.4.3

Dec 7, 2025

2.4.2

Nov 9, 2025

2.4.1

Oct 10, 2025

2.4.0

Oct 2, 2025

2.3.9

Oct 1, 2025

2.3.8

Sep 30, 2025

2.3.7

Sep 26, 2025

2.3.6

Sep 26, 2025

2.3.5

Sep 19, 2025

2.3.4

Sep 17, 2025

2.3.3

Sep 15, 2025

2.3.2

Sep 15, 2025

2.3.0

Sep 12, 2025

2.2.9

Sep 12, 2025

2.2.8

Sep 11, 2025

2.2.7

Sep 11, 2025

2.2.6

Sep 11, 2025

2.2.5

Aug 31, 2025

2.2.4

Aug 26, 2025

2.2.3

Aug 25, 2025

2.2.2

Aug 25, 2025

2.2.1

Aug 25, 2025

2.2.0

Aug 25, 2025

2.1.6

Aug 20, 2025

2.1.5

Aug 17, 2025

2.1.4

Aug 16, 2025

2.1.3

Aug 16, 2025

2.1.2

Aug 4, 2025

2.1.1

Jul 31, 2025

2.1.0

Jul 30, 2025

2.0.9

Jul 29, 2025

2.0.8

Jul 29, 2025

2.0.7

Jul 28, 2025

2.0.6

Jul 28, 2025

2.0.5

Jul 27, 2025

2.0.4

Jul 26, 2025

2.0.3

Jul 25, 2025

2.0.2

Jul 24, 2025

2.0.1

Jul 24, 2025

2.0.0

Jul 23, 2025

1.9.0

Jul 23, 2025

1.8.6

Jul 18, 2025

1.8.5

Jul 16, 2025

1.8.4

Jul 16, 2025

1.8.3

Jul 14, 2025

1.8.2

Jul 12, 2025

1.8.1

Jul 10, 2025

1.8.0

Jul 10, 2025

1.7.1

Jul 9, 2025

1.7.0

Jul 3, 2025

1.6.9

Jun 30, 2025

1.6.8

Jun 30, 2025

1.6.7

Jun 29, 2025

1.6.6

Jun 29, 2025

1.6.5

Jun 29, 2025

1.6.4

Jun 24, 2025

1.6.3

Jun 24, 2025

1.6.2

Jun 24, 2025

1.6.1

Jun 22, 2025

1.6.0

Jun 22, 2025

1.5.9

Jun 13, 2025

1.5.8

Jun 12, 2025

1.5.6

Jun 12, 2025

1.5.5

Jun 12, 2025

1.5.4

Jun 10, 2025

1.5.3

Jun 2, 2025

1.5.2

May 27, 2025

1.5.1

May 27, 2025

1.5.0

May 27, 2025

1.4.9

May 26, 2025

1.4.6

May 26, 2025

1.4.5

May 25, 2025

1.4.4

May 24, 2025

1.4.3

May 24, 2025

1.4.2

May 22, 2025

1.4.0

May 18, 2025

1.3.1

May 7, 2025

1.3.0

May 7, 2025

1.2.9

May 4, 2025

1.2.8

May 4, 2025

1.2.7

Apr 30, 2025

1.2.6

Apr 28, 2025

1.2.5

Apr 21, 2025

1.2.4

Apr 19, 2025

1.2.3

Apr 18, 2025

1.2.2

Apr 18, 2025

1.2.1

Apr 18, 2025

1.2.0

Apr 9, 2025

1.1.8

Apr 8, 2025

1.1.7

Apr 4, 2025

1.1.6

Mar 29, 2025

1.1.5

Mar 24, 2025

1.1.1

Mar 23, 2025

1.1.0

Mar 23, 2025

1.0.9

Mar 23, 2025

1.0.8

Mar 23, 2025

1.0.7

Mar 23, 2025

1.0.6

Mar 23, 2025

1.0.3

Feb 12, 2025

1.0.2

Feb 6, 2025

1.0.0

Feb 5, 2025

0.430

Jan 7, 2025

0.429

Jan 4, 2025

0.428

Jan 4, 2025

0.427

Jan 2, 2025

0.426

Dec 31, 2024

0.424

Dec 28, 2024

0.423

Dec 26, 2024

0.422

Dec 26, 2024

0.421

Dec 26, 2024

0.420

Dec 26, 2024

0.418

Dec 18, 2024

0.417

Dec 18, 2024

0.416

Dec 18, 2024

0.415

Dec 18, 2024

0.414

Dec 18, 2024

0.413

Dec 18, 2024

0.411

Dec 26, 2024

0.410

Dec 18, 2024

0.408

Dec 18, 2024

0.407

Dec 18, 2024

0.405

Dec 18, 2024

0.401

Dec 18, 2024

0.400

Dec 16, 2024

0.381

Nov 18, 2024

0.380

Nov 18, 2024

0.379

Nov 18, 2024

0.378

Nov 5, 2024

0.377

Nov 2, 2024

0.376

Nov 1, 2024

0.374

Oct 30, 2024

0.373

Oct 29, 2024

0.372

Oct 29, 2024

0.371

Oct 29, 2024

0.369

Oct 29, 2024

0.368

Oct 29, 2024

0.367

Oct 29, 2024

0.366

Oct 28, 2024

0.365

Oct 28, 2024

0.364

Oct 28, 2024

0.363

Oct 25, 2024

0.362

Oct 25, 2024

0.361

Oct 25, 2024

0.360

Oct 25, 2024

0.357

Oct 24, 2024

0.356

Oct 24, 2024

0.355

Oct 24, 2024

0.352

Oct 21, 2024

0.351

Oct 18, 2024

0.350

Oct 17, 2024

0.343

Oct 17, 2024

0.342

Oct 16, 2024

0.341

Oct 16, 2024

0.340

Oct 15, 2024

0.339

Oct 15, 2024

0.338

Oct 15, 2024

0.337

Oct 15, 2024

0.336

Oct 14, 2024

0.335

Oct 14, 2024

0.334

Oct 13, 2024

0.333

Oct 13, 2024

0.332

Oct 6, 2024

0.331

Oct 6, 2024

0.330

Oct 3, 2024

0.323

Sep 27, 2024

0.320

Sep 27, 2024

0.314

Sep 26, 2024

0.312

Sep 21, 2024

0.311

Sep 19, 2024

0.302

Sep 19, 2024

0.301

Sep 18, 2024

0.29

Sep 18, 2024

This version

0.26

Sep 16, 2024

0.25

Sep 16, 2024

0.24

Sep 16, 2024

0.23

Sep 16, 2024

0.22

Sep 14, 2024

0.21

Sep 14, 2024

0.20

Sep 14, 2024

0.17

Sep 10, 2024

0.16

Sep 10, 2024

0.15

Sep 10, 2024

0.14

Sep 7, 2024

0.12

Sep 6, 2024

0.11

Sep 6, 2024

0.5.0

Feb 5, 2025

0.1

Sep 6, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datamule-0.26.tar.gz (13.5 kB view details)

Uploaded Sep 16, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

datamule-0.26-py3-none-any.whl (13.3 kB view details)

Uploaded Sep 16, 2024 Python 3

File details

Details for the file datamule-0.26.tar.gz.

File metadata

Download URL: datamule-0.26.tar.gz
Upload date: Sep 16, 2024
Size: 13.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.11.5

File hashes

Hashes for datamule-0.26.tar.gz
Algorithm	Hash digest
SHA256	`c242230245f497bfd8f177e9fdc13fdf2ff7c5c671dc1f81b8adf74cc25b82f4`
MD5	`806964e8f25d4aff141ef08eb2c389b3`
BLAKE2b-256	`d7de5955b38c6c6c114d463718af73373bee8f00476da0415b793b62e450cbdc`

See more details on using hashes here.

File details

Details for the file datamule-0.26-py3-none-any.whl.

File metadata

Download URL: datamule-0.26-py3-none-any.whl
Upload date: Sep 16, 2024
Size: 13.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.0 CPython/3.11.5

File hashes

Hashes for datamule-0.26-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4c2c7664b3c9286ac7589f6b1949e7a0ffa7df6f4e37b2f1b157a9b6883a6978`
MD5	`78d08fecc7fa88645fb6671659e217e3`
BLAKE2b-256	`7f8ddcddf9288de6909246251b3eb0ee4c0926f4c5821e33f20362131e7187a7`

See more details on using hashes here.

datamule 0.26

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

datamule

features

documentation

indexer

downloader

downloads and downloads_using_api

download_datasets

parsing

simplified html

interactive html

json

TODO

Update Log

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes