Skip to main content

Textual analysis on SEC filings from EDGAR

Project description

edgar-analyzer

edgar-analyzer is a CLI tool to download SEC filings from EDGAR and perform textual analyses.

Installation

pip install edgar-analyzer

Workflow

Setup

Download index files

edgar-analyzer download_index --user_agent "MyCompany name@mycompany.com" --output "./index"

Download filings (to be integrated)

edgar_analyzer download_filings

Build database (to be integrated)

edgar_analyzer build_database

Run specific jobs

These tasks can be executed once the database of filings is built.

Find event date

 edgar-analyzer find_event_date -h
usage: edgar-analyzer [OPTION]... find_event_date [-h] -d data_directory --file_type file_type [-db databsae] [-t threads]

Find event date from filings from header data

options:
  -h, --help            show this help message and exit
  -t threads, --threads threads
                        number of processes to use

required named arguments:
  -d data_directory, --data_dir data_directory
                        directory of filings
  --file_type file_type
                        type of filing
  -db databsae, --database databsae
                        sqlite database to store results

Find reported items

 edgar-analyzer find_reported_items -h
usage: edgar-analyzer [OPTION]... find_reported_items [-h] -d data_directory --file_type file_type [-db databsae] [-t threads]

Find reported items from filings from header data

options:
  -h, --help            show this help message and exit
  -t threads, --threads threads
                        number of processes to use

required named arguments:
  -d data_directory, --data_dir data_directory
                        directory of filings
  --file_type file_type
                        type of filing
  -db databsae, --database databsae
                        sqlite database to store results

more to be integrated

Example

Just a simple example of the job find_event_date. Based on the 1,491,368 8K filings (2004-2022), the table below shows the reporting lags (date of filing minus date of event).

We can find that most filings are filed on the same day as the event reported, and that over 99.99% of filings are filed within 4 calendar days (SEC requires 4 business days).

Filing lag (calendar days) Frequency Percentage Cumulative
0 1470089 98.57% 98.57%
1 20761 1.39% 99.97%
2 285 0.02% 99.98%
3 89 0.01% 99.99%
4 47 0.00% 99.99%
5 26 0.00% 100.00%
6 14 0.00% 100.00%
7 6 0.00% 100.00%
8 4 0.00% 100.00%
9 3 0.00% 100.00%
10 or more 44 0.00% 100.00%

Note

This tool is a work in progress and breaking changes may be expected.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edgar-analyzer-0.0.1rc2.tar.gz (9.5 kB view details)

Uploaded Source

File details

Details for the file edgar-analyzer-0.0.1rc2.tar.gz.

File metadata

  • Download URL: edgar-analyzer-0.0.1rc2.tar.gz
  • Upload date:
  • Size: 9.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for edgar-analyzer-0.0.1rc2.tar.gz
Algorithm Hash digest
SHA256 af79155dc771573ae15c5f766402a53ab9afe19929ba949a1ddf65030f4166f6
MD5 5608fe3acdf250ba970061410b1dd089
BLAKE2b-256 41ca5d6eedc3754a1aee6bdc02733df9ffd50b38bd4cec86591b05fd2498db06

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page