Textual analysis on SEC filings from EDGAR
Project description
edgar-analyzer
edgar-analyzer
is a CLI tool to download SEC filings from EDGAR and perform textual analyses.
Installation
pip install edgar-analyzer
Workflow
Setup
Download index files
edgar-analyzer download_index --user_agent "MyCompany name@mycompany.com" --output "./index"
Download filings (to be integrated)
edgar_analyzer download_filings
Build database (to be integrated)
edgar_analyzer build_database
Run specific jobs
These tasks can be executed once the database of filings is built.
Find event date
❯ edgar-analyzer find_event_date -h
usage: edgar-analyzer [OPTION]... find_event_date [-h] -d data_directory --file_type file_type [-db databsae] [-t threads]
Find event date from filings from header data
options:
-h, --help show this help message and exit
-t threads, --threads threads
number of processes to use
required named arguments:
-d data_directory, --data_dir data_directory
directory of filings
--file_type file_type
type of filing
-db databsae, --database databsae
sqlite database to store results
Find reported items
❯ edgar-analyzer find_reported_items -h
usage: edgar-analyzer [OPTION]... find_reported_items [-h] -d data_directory --file_type file_type [-db databsae] [-t threads]
Find reported items from filings from header data
options:
-h, --help show this help message and exit
-t threads, --threads threads
number of processes to use
required named arguments:
-d data_directory, --data_dir data_directory
directory of filings
--file_type file_type
type of filing
-db databsae, --database databsae
sqlite database to store results
more to be integrated
Example
Just a simple example of the job find_event_date
. Based on the 1,491,368 8K filings (2004-2022), the table below shows the reporting lags (date of filing minus date of event).
We can find that most filings are filed on the same day as the event reported, and that over 99.99% of filings are filed within 4 calendar days (SEC requires 4 business days).
Filing lag (calendar days) | Frequency | Percentage | Cumulative |
---|---|---|---|
0 | 1470089 | 98.57% | 98.57% |
1 | 20761 | 1.39% | 99.97% |
2 | 285 | 0.02% | 99.98% |
3 | 89 | 0.01% | 99.99% |
4 | 47 | 0.00% | 99.99% |
5 | 26 | 0.00% | 100.00% |
6 | 14 | 0.00% | 100.00% |
7 | 6 | 0.00% | 100.00% |
8 | 4 | 0.00% | 100.00% |
9 | 3 | 0.00% | 100.00% |
10 or more | 44 | 0.00% | 100.00% |
Note
This tool is a work in progress and breaking changes may be expected.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file edgar-analyzer-0.0.1rc2.tar.gz
.
File metadata
- Download URL: edgar-analyzer-0.0.1rc2.tar.gz
- Upload date:
- Size: 9.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | af79155dc771573ae15c5f766402a53ab9afe19929ba949a1ddf65030f4166f6 |
|
MD5 | 5608fe3acdf250ba970061410b1dd089 |
|
BLAKE2b-256 | 41ca5d6eedc3754a1aee6bdc02733df9ffd50b38bd4cec86591b05fd2498db06 |