A robust and efficient parser for SEC filings, designed to extract and analyze financial data with ease.
Project description
Overview
The sec-parser
project simplifies the process of extracting meaningful information from SEC EDGAR HTML documents. It organizes the document's source code into a list or tree of elements that correspond to the visual structure of the document. This includes distinct elements for section titles, paragraphs, and tables, making the data easier to analyze and understand.
This tool is especially beneficial for Artificial Intelligence (AI) and Large Language Models (LLM) applications. It significantly improves the efficiency of data extraction and analysis in these fields.
Installation
You can install sec-parser
using pip:
pip install sec-parser
Usage
import sec_parser as sp
tree = sp.parse_latest("10-K", ticker="AAPL")
# Show the general structure of the tree
print(tree.render())
Console output:
RootSectionElement: PART I — FINANCIAL INFORMATION
├── TitleElement: Item 1. Financial Statements
│ ├── TitleElement: CONDENSED CONSOLIDATED STATEMENTS OF OPERATIONS (U...
│ │ ├── TextElement: (In millions, except number of shares which are re...
│ │ ├── TableElement: ...
│ ...
License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for sec_parser-0.8.0.post7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 41d14070b6604c05d1bfc6f28e855e03c785fb6a1566d6955beadd8372a81797 |
|
MD5 | 0816789b65dc99659d860c3198eb0ace |
|
BLAKE2b-256 | 3f193bcaa2e481ad8a06e78135bbd9a581e27ba2e6746bb4aa1a30ea79b476aa |