An enhanced version of sec-edgar-downloader with caching capabilities and in-memory data retrieval. Designed for efficient SEC EDGAR document processing.
Project description
sec-downloader
Install
pip install sec_downloader
Features
- Instead of being saved to disk, files are directly downloaded into memory.
- Use “glob” pattern to select which files are read to memory.
How to use
Downloading multiple documents:
from sec_edgar_downloader import Downloader
from sec_downloader import DownloadStorage
storage = DownloadStorage()
with storage as path:
dl = Downloader("MyCompanyName", "email@example.com", path)
dl.get("10-K", "GOOG", limit=2)
for path, content in storage.get_file_contents():
print(f"Path: {path}\nContent [len={len(content)}]: {content[:30]}...\n")
Path: sec-edgar-filings/GOOG/10-K/0001652044-22-000019/full-submission.txt
Content [len=15044932]: <SEC-DOCUMENT>0001652044-22-00...
Path: sec-edgar-filings/GOOG/10-K/0001652044-23-000016/full-submission.txt
Content [len=15264470]: <SEC-DOCUMENT>0001652044-23-00...
Let’s demonstrate how to download a single file (latest 10-Q filing details in HTML format) to memory.
ONLY_HTML = "**/*.htm*"
storage = DownloadStorage(filter_pattern=ONLY_HTML)
with storage as path:
dl = Downloader("MyCompanyName", "email@example.com", path)
dl.get("10-Q", "AAPL", limit=1, download_details=True)
content = storage.get_file_contents()[0].content
print(f"{content[:50]}...")
<?xml version="1.0" ?><!--XBRL Document Created wi...
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sec-downloader-0.0.3.tar.gz
(5.2 kB
view hashes)
Built Distribution
Close
Hashes for sec_downloader-0.0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 58ea1ca5f5b694ef932e3ed7eb17c2a3f5e4e9b3b28c37847e0706c1173c06fd |
|
MD5 | 9362a1fe9d032413e70d361ce6405b04 |
|
BLAKE2b-256 | 6bb6ae129a229607e5c5ee156f20e4ad33f1658fae5171fad95e98a28d8eb2a2 |