Skip to main content

Financial data scraper core API - lightweight programmatic access

Project description

FinPull Core - Financial Data Scraper API

PyPI version Python 3.7+ License: MIT

FinPull Core is a lightweight financial data scraping library providing programmatic access to comprehensive financial metrics. This package contains only the essential API functionality, making it suitable for web applications, microservices, and minimal deployments.

Installation

pip install finpull-core

Quick Start

from finpull_core import FinancialDataAPI

# Initialize the API
api = FinancialDataAPI()

# Add a ticker for tracking
result = api.add_ticker("AAPL")
print(result)
# {'success': True, 'message': 'Added AAPL', 'ticker': 'AAPL'}

# Retrieve financial data
data = api.get_data("AAPL")
if data['success']:
    stock_data = data['data']
    print(f"Company: {stock_data['company_name']}")
    print(f"Price: ${stock_data['price']}")
    print(f"P/E Ratio: {stock_data['pe_ratio']}")

# Batch operations
tickers = ["AAPL", "GOOGL", "MSFT", "TSLA"]
results = api.batch_add_tickers(tickers)

# Export data
export_result = api.export_data("json")

Performance

Benchmarked on typical hardware with 10 test runs:

Metric Value Details
Package Size 21.9 KB (wheel) Compressed distribution file
Installed Size 134 KB Uncompressed on disk
Import Time 0.0002s (cached) Cold import: 0.9s, cached: 0.0002s
Dependencies 3 packages requests, beautifulsoup4, yfinance
API Response 100-500ms Per ticker, varies by data source

Performance Characteristics

  • First Import: ~0.9 seconds (includes dependency loading)
  • Subsequent Imports: ~0.0002 seconds (module cache)
  • Standard Deviation: 0.3s (due to first-time loading)
  • Consistency: Very stable after initial load

Optimization Features

  • Local Caching: Automatic data caching reduces API calls
  • Rate Limiting: Built-in throttling prevents API blocks (1 req/sec default)
  • Lazy Loading: Components load only when needed
  • Efficient Storage: JSON-based local storage with minimal overhead
  • Web Compatible: Works in browser environments via Pyodide/WASM
  • Batch Operations: Process multiple tickers efficiently

Configuration

Environment Variables

# Custom storage location
export FINPULL_STORAGE_FILE="/path/to/custom/storage.json"

# Rate limiting (seconds between requests)
export FINPULL_RATE_LIMIT="2"

Storage

Data is stored locally in JSON format:

  • Linux/macOS: ~/.finpull/data.json
  • Windows: %USERPROFILE%\.finpull\data.json

Data Coverage

FinPull Core provides 27 financial metrics per ticker including price, P/E ratio, market cap, earnings data, profitability ratios, and growth metrics. Data is sourced from Finviz and Yahoo Finance with automatic fallback for high availability.

Dependencies

Required Packages

  • requests (≥2.25.1): HTTP client for data fetching
  • beautifulsoup4 (≥4.9.3): HTML parsing for web scraping
  • yfinance (≥0.1.63): Yahoo Finance API integration

Optional Dependencies

  • psutil: For memory usage monitoring (development/testing)
  • openpyxl: Excel export (available in full finpull package)

Documentation

Web Integration

Works in browser environments via Pyodide and Node.js via subprocess calls. See main repository README for integration examples.

Examples

Basic Usage

from finpull_core import FinancialDataAPI

api = FinancialDataAPI()

# Add and retrieve data
api.add_ticker("AAPL")
data = api.get_data("AAPL")
print(f"Price: ${data['data']['price']}")

Batch Operations

# Add multiple tickers
tickers = ["AAPL", "GOOGL", "MSFT", "TSLA"]
results = api.batch_add_tickers(tickers)
print(f"Added {results['summary']['added_count']} tickers")

# Get all data
all_data = api.get_data()
for stock in all_data['data']:
    print(f"{stock['ticker']}: ${stock['price']} (P/E: {stock['pe_ratio']})")

Data Export

# Export to different formats
json_result = api.export_data("json", "portfolio.json")
csv_result = api.export_data("csv", "portfolio.csv")
print(f"Exported {json_result['record_count']} records")

Error Handling

try:
    result = api.add_ticker("INVALID_TICKER")
    if not result['success']:
        print(f"Error: {result['error']}")
except Exception as e:
    print(f"Exception: {e}")

Package Options

For complete functionality including CLI and GUI, use the full package:

pip install finpull

Or switch from core to full package:

pip uninstall finpull-core
pip install finpull

Package Comparison

Feature finpull-core finpull
Size 21.9 KB 27.2 KB
API Access
CLI Interface
GUI Application
Excel Export
Web Compatible

All API calls are identical between packages - simply change the import statement when switching.

License

MIT License - see LICENSE file for details.

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

finpull_core-1.1.0.tar.gz (20.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

finpull_core-1.1.0-py3-none-any.whl (21.8 kB view details)

Uploaded Python 3

File details

Details for the file finpull_core-1.1.0.tar.gz.

File metadata

  • Download URL: finpull_core-1.1.0.tar.gz
  • Upload date:
  • Size: 20.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for finpull_core-1.1.0.tar.gz
Algorithm Hash digest
SHA256 387a42245e6f091029d0e050f5a1ac4b7f87e6f9cae4fe658e73e50d5c2abb06
MD5 da1403a36911b3014ad55f571e9785e3
BLAKE2b-256 2b369cbd16ef71261a453f52d1ca76ad88757fa41b95ea33762e714b379ef94c

See more details on using hashes here.

File details

Details for the file finpull_core-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: finpull_core-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 21.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for finpull_core-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 99cd78fc665ab230fe4b568bc91e5ea68cb24cd6dc97f8297f50b829e73384e1
MD5 f4029efd5eb2bf8032143686b175f630
BLAKE2b-256 64e58d3b189d61328e67ea7a179ece385ca63ca166612dc8b96dd98e04971357

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page