Intelligent financial modeling platform for small businesses
Project description
WACCY
Intelligent Financial Modeling Platform for Small Businesses
WACCY is an intelligent financial modeling platform designed to automatically extract, parse, classify, and synthesize business data from diverse sources to generate sophisticated, production-grade financial models and operating analyses. The platform's primary focus is on small businessesโfrom sole proprietorships to growing companiesโthat struggle with messy, incomplete, and poorly-maintained financial records.
Unlike large enterprises with dedicated accounting teams, small businesses often have inconsistent record-keeping, ambiguous account classifications, incomplete data, and limited financial infrastructure. WACCY transforms this raw, often chaotic business data into comprehensive, auditable, and decision-ready financial models that adhere to institutional-quality standards.
๐ Quick Start
Installation
WACCY uses uv, the modern Python package manager. Install the core platform:
# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh
# Install WACCY core platform
uv pip install waccy
# Install with core extensions (QuickBooks Online and SEC EDGAR)
uv pip install "waccy[quickbooks,edgar]"
# Or install individual extensions
uv pip install waccy-quickbooks
uv pip install waccy-edgar
Basic Usage
from waccy.extraction import ExtractorRegistry
from waccy.modeling import ModelBuilder
# Discover available extractors
registry = ExtractorRegistry()
available_sources = registry.list_extractors()
print(f"Available data sources: {available_sources}")
# Extract data from QuickBooks Online
quickbooks_extractor = registry.get_extractor("quickbooks")
extracted_data = quickbooks_extractor().extract({
"company_id": "your_company_id",
"date_range": ("2023-01-01", "2024-12-31")
})
# Build a 3-statement financial model
builder = ModelBuilder()
model = builder.build_three_statement_model(
extracted_data=extracted_data,
forecast_periods=12
)
# Export to Google Sheets
builder.export_to_sheets(model, output_path="financial_model.xlsx")
๐ Core Features
๐ค AI-Powered Data Extraction & Classification
- Intelligent Account Mapping: LLM-enhanced classification for ambiguous account names and inconsistent categorizations
- Missing Data Inference: Patterns learned from high-quality sources (EDGAR filings) to infer missing information
- Causal Chain Recognition: Understands how financial events cascade through statements
- Terminology Normalization: Maps equivalent concepts across different naming conventions
- Confidence Scoring: Every classification includes a confidence score for quality assessment
๐๏ธ Standardized Financial Ontology
- WACCY Standard Chart of Accounts: Comprehensive, standardized classification ontology
- Universal Mapping: All data sources map to the same standard accounts for consistency
- Industry Templates: Pre-configured extensions for SaaS, manufacturing, retail, real estate, and more
- Quality Quantification: Standardized ontology enables measurement of data quality and model reliability
๐ Institutional-Quality Financial Models
- 3-Statement Integrated Models: Income statement, balance sheet, and cash flow with full balancing
- DCF Valuation Models: Detailed free cash flow construction, terminal value analysis, WACC calculation
- Trading Comparables: Peer company analysis with multiple calculation and benchmarking
- Transaction Comparables: Precedent M&A transaction analysis
- LBO Models: Leveraged buyout analysis with returns calculation and debt schedules
- M&A Models: Accretion/dilution analysis with purchase accounting
- Specialized Models: SaaS cohort models, REIT models, project finance, cap tables, and more
๐ Modular Extension Architecture
- Core Platform: Focused core with QuickBooks Online and SEC EDGAR as primary data sources
- Extensible Design: Community-developed extension packages for additional data sources
- Entry Point Discovery: Automatic discovery of extensions via Python entry points
- Simple Installation: Install only the extensions you need
๐ Advanced Forecasting & Analysis
- Driver-Based Forecasting: Revenue builds from price ร volume, units ร ARPU, cohort models
- Working Capital Modeling: DSO, DIO, DPO analysis with seasonality adjustments
- Debt & Interest Modeling: Revolver mechanics, amortization schedules, cash sweep waterfalls
- Tax Modeling: Effective tax rates, DTAs/DTLs, NOL utilization tracking
- Sensitivity Analysis: Comprehensive scenario modeling and tornado charts
โ Data Quality & Validation
- Pandera Validation: Schema-based validation for extracted data
- Mapping Confidence Scores: Quantify the reliability of account mappings
- Reconciliation Checks: Cross-referencing between sources and balance checks
- Audit Trails: Every data point traceable to source with full provenance
๐ Professional Model Outputs
- Google Sheets Export: Production-ready spreadsheet models with proper formatting
- Professional Architecture: Modular tab structures, consistent time axis, clear sign conventions
- Color Conventions: Inputs in blue, calculations in black, outputs in green
- Balance Checks: Built-in reconciliation tables and error flags
- Scenario Tooling: Data tables for sensitivity analysis, scenario toggles, goal seek integration
๐ Complete Workflow
from waccy.extraction import ExtractorRegistry
from waccy.classification import ClassificationEngine
from waccy.modeling import ModelBuilder
from waccy.core.ontology import StandardChartOfAccounts
# 1. Extract data from QuickBooks Online (handles messy, incomplete records)
registry = ExtractorRegistry()
extractor = registry.get_extractor("quickbooks")()
credentials = {
"client_id": "your_client_id",
"client_secret": "your_client_secret",
"access_token": "your_access_token"
}
extractor.authenticate(credentials)
extracted_data = extractor.extract({
"company_id": "123456789",
"date_range": ("2022-01-01", "2024-12-31"),
"include_transactions": True
})
# 2. Classify and map to standard accounts (with LLM enhancement for ambiguity)
ontology = StandardChartOfAccounts()
classification_engine = ClassificationEngine()
for account in extracted_data.accounts:
mapped_account, confidence = classification_engine.classify_account(
source_account_name=account.name,
transaction_patterns=account.transaction_history,
context={"company_type": "SaaS", "industry": "Software"}
)
print(f"Mapped '{account.name}' to '{mapped_account.name}' (confidence: {confidence:.2f})")
# 3. Build 3-statement integrated model
builder = ModelBuilder()
model = builder.build_three_statement_model(
extracted_data=extracted_data,
forecast_periods=24
)
# 4. Generate DCF valuation
dcf_model = builder.build_dcf_model(
three_statement_model=model,
wacc=0.10,
terminal_growth_rate=0.03,
exit_multiple=12.0
)
# 5. Export to Google Sheets
builder.export_to_sheets(model, output_path="financial_model.xlsx")
builder.export_to_sheets(dcf_model, output_path="dcf_valuation.xlsx")
# 6. Generate quality report
quality_report = extracted_data.generate_quality_report()
print(f"Data completeness: {quality_report.completeness:.2%}")
print(f"Average mapping confidence: {quality_report.avg_confidence:.2f}")
print(f"Issues flagged: {len(quality_report.issues)}")
๐ Example Output
Standardized Account Mapping
Source Account: "Sales Revenue" (QuickBooks)
Mapped To: Revenue - Product Sales (WACCY Standard)
Confidence: 0.95
Validation: โ
Transaction patterns match revenue recognition
3-Statement Model Structure
Financial Model.xlsx
โโโ Assumptions
โ โโโ Revenue Drivers
โ โโโ Cost Assumptions
โ โโโ Working Capital
โโโ Income Statement
โ โโโ Historical (3 years)
โ โโโ Forecast (2 years)
โโโ Balance Sheet
โ โโโ Assets (Current & Non-Current)
โ โโโ Liabilities (Current & Non-Current)
โ โโโ Equity
โโโ Cash Flow Statement
โ โโโ Operating Activities
โ โโโ Investing Activities
โ โโโ Financing Activities
โโโ Supporting Schedules
โ โโโ Working Capital Detail
โ โโโ Debt Schedule
โ โโโ Depreciation
โโโ Checks & Reconciliations
โโโ Balance Checks
โโโ Quality Metrics
๐ Core Data Sources
QuickBooks Online (QBO)
Primary Data Source - The accounting system most commonly used by small businesses.
- Direct API integration for chart of accounts, general ledger, and financial statements
- Intelligent handling of ambiguous, inconsistently-named accounts
- Transaction-level detail extraction
- Vendor and customer data integration
- Skeptical treatment of source classifications with validation
SEC EDGAR
Pattern Learning & Reference Data - High-quality financial data for learning and benchmarking.
- Automated parsing of 10-K, 10-Q, 8-K filings
- Proxy statement and registration statement processing
- Pattern extraction for proper financial classification
- Learning causal chains from professional financial reports
- Application of learned patterns to small business data
Extension Packages
Additional data sources available as modular extensions:
waccy-google- Google Drive and Gmail integrationwaccy-xero- Xero accounting systemwaccy-sage- Sage accounting platformwaccy-stripe- Payment processor datawaccy-salesforce- CRM and sales pipeline data
๐ Project Structure
WACCY uses a monorepo structure where the core platform and first-party extensions are maintained together but built and published separately.
waccy/
โโโ src/
โ โโโ waccy/ # Core platform
โ โโโ core/
โ โ โโโ ontology.py # Standardized chart of accounts
โ โ โโโ models.py # Core data models (Pydantic)
โ โ โโโ validation.py # Data validation (Pandera)
โ โโโ extraction/
โ โ โโโ base.py # Abstract base classes
โ โ โโโ registry.py # Extension registry
โ โ โโโ mapper.py # Mapping to standard ontology
โ โโโ classification/
โ โ โโโ engine.py # LLM-enhanced classification
โ โ โโโ patterns.py # Pattern matching from EDGAR
โ โ โโโ confidence.py # Confidence scoring
โ โโโ modeling/
โ โ โโโ builder.py # Model construction
โ โ โโโ templates.py # Model templates
โ โ โโโ exporters.py # Google Sheets export
โ โโโ utils/
โ โโโ dates.py
โ โโโ formatting.py
โ โโโ validation.py
โโโ extensions/ # First-party extensions (monorepo)
โ โโโ waccy-quickbooks/ # QuickBooks Online integration
โ โ โโโ src/waccy_quickbooks/
โ โ โโโ pyproject.toml
โ โโโ waccy-edgar/ # SEC EDGAR filing parser
โ โโโ src/waccy_edgar/
โ โโโ pyproject.toml
โโโ tests/
โ โโโ unit/
โ โโโ integration/
โ โโโ fixtures/
โโโ scripts/
โ โโโ publish.py # Publish core package
โ โโโ build-extension.py # Build extension packages
โ โโโ publish-extension.py # Publish extension packages
โโโ docs/
โ โโโ 0-MISSION.md
โ โโโ 1-ARCHITECTURE.md
โ โโโ 2-EXPERIENCE.md
โ โโโ skills_models.md
โโโ pyproject.toml # Core package configuration
Monorepo Benefits
- Unified Development: All packages in one repository for easier coordination
- Independent Publishing: Each package builds and publishes separately to PyPI
- Shared Standards: Consistent code quality, linting, and testing across packages
- Easier Refactoring: Changes to core can be tested with all extensions immediately
See extensions/README.md for details on working with extensions.
๐งช Testing
# Install development dependencies
uv sync --dev
# Run all tests
uv run pytest
# Run with coverage
uv run pytest --cov=waccy --cov-report=html
# Run specific test suites
uv run pytest tests/unit/
uv run pytest tests/integration/
# Format and lint with ruff
uv run ruff format
uv run ruff check --fix
# Type checking with mypy
uv run mypy src/waccy
๐ Documentation
- Mission Statement - Project goals, philosophy, and roadmap
- Architecture - Technical architecture and design principles
- Experience Guide - User experience and workflows
- Skills & Models - Financial modeling capabilities
๐๏ธ Design Principles
-
Simplicity and Focus: Core platform maintains a simple, focused design with essential data sources (QBO and EDGAR)
-
Standardized Ontology First: All financial data maps to a standardized WACCY chart of accounts, ensuring consistency and comparability
-
Modular Extensibility: New data sources and model types added as separate packages that conform to core interfaces
-
Accuracy First: Deterministic functions preferred over probabilistic models. LLMs used for parsing and classification, not financial calculations
-
Transparency and Auditability: Every data point traceable to source, every calculation explainable, every assumption documented
-
Professional Standards: Models adhere to institutional-quality standards for architecture, formatting, and presentation
-
Quality Quantification: Standardized ontology enables measurement and reporting of data quality and model output reliability
-
Small Business Focus: Designed primarily for small businessesโfrom sole proprietorships to growing companiesโhandling messy, incomplete data
๐ค Contributing
We welcome contributions! WACCY is built for the small business community, and we'd love your help making financial modeling more accessible.
Getting Started
- Fork the repository on GitHub
- Create a feature branch for your contribution
- Set up development environment:
git clone https://github.com/your-username/waccy.git cd waccy uv sync --dev
- Make your changes following our coding standards:
- Use
rufffor formatting and linting - Add type hints with
mypycompliance - Write tests for new functionality
- Update documentation as needed
- Use
- Run tests and linting:
uv run pytest uv run ruff check uv run mypy src/
- Submit a pull request with a clear description of your changes
Creating Extensions
Want to add a new data source? Create an extension package:
- Create a new package:
waccy-yourdatasource - Implement the
Extractorinterface fromwaccy.extraction.base - Register your extension via entry points
- Follow the Extension Development Guide
See our Architecture Documentation for detailed extension development guidelines.
Code of Conduct
Please read and follow our Code of Conduct to ensure a welcoming environment for all contributors.
๐ Project Status
- Phase 1: ๐ Planned - Core foundation and 3-statement models
- Phase 2: ๐ Planned - Public market data and pattern learning (EDGAR)
- Phase 3: ๐ Planned - Advanced valuation models (DCF, M&A, LBO)
- Phase 4: ๐ Planned - Specialized model types (SaaS, REIT, project finance)
- Phase 5: ๐ Planned - Advanced analysis and decision support
Current Status: Early development - Architecture and core platform design
Python Version: 3.13+
Package Manager: uv
CI/CD: GitHub Actions (coming soon)
๐ License
This project is licensed under the MIT License - see the LICENSE file for details.
๐ Acknowledgments
- Small Business Community - Built for entrepreneurs and small business owners who need professional financial modeling
- Financial Modeling Community - Inspired by institutional-quality modeling standards
- Open Source Tools:
Made with ๐คช for small businesses that deserve institutional-quality financial modeling
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file waccy-0.0.2.tar.gz.
File metadata
- Download URL: waccy-0.0.2.tar.gz
- Upload date:
- Size: 2.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
36cd8fb004961de53078b5fe36bc45e4dda4b54fb5a756470b8320262908ca88
|
|
| MD5 |
71f1ceb6ff89e5595621051e4b3f622c
|
|
| BLAKE2b-256 |
082387bf574999db2f14dbfdb8c558024689dd751bcbb87cabcb6890689b2e57
|
File details
Details for the file waccy-0.0.2-py3-none-any.whl.
File metadata
- Download URL: waccy-0.0.2-py3-none-any.whl
- Upload date:
- Size: 20.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e588f67d8346b2722d4820da3bd2ec59d0beb3862e1b2a86e39baddae8d575f1
|
|
| MD5 |
da03e0a1b3d40cc94486f8157fb75b98
|
|
| BLAKE2b-256 |
4570744345d50489dc13b8b3f6cb31188e93fbf4b82f8a01a33ed183c9b9c6a1
|