A library for scraping and analyzing forecasting markets

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

vigji

These details have not been verified by PyPI

Project description

Mootlib

A Python library for finding similar questions across prediction markets.

Features

Search for similar questions across multiple prediction market platforms
Access historical market data and probabilities
Compare questions using semantic similarity
Automatic caching and data management
Direct access to market data and embeddings

Installation

pip install mootlib

Environment Setup

Required Environment Variables

The library requires several environment variables to function:

MOOTLIB_ENCRYPTION_KEY: Required for decrypting market data
DEEPINFRA_TOKEN: Required for computing embeddings
GJO_EMAIL and GJO_PASSWORD: Optional, for Good Judgment Open access

You can set these up in two ways:

1. Using a .env file (recommended for local development)

Create a .env file in your project root:

MOOTLIB_ENCRYPTION_KEY="your-key-here"
DEEPINFRA_TOKEN="your-token-here"
GJO_EMAIL="your-email@example.com"  # Optional
GJO_PASSWORD="your-password"        # Optional

Then in your Python code:

from dotenv import load_dotenv
load_dotenv()  # Load environment variables from .env

from mootlib import MootlibMatcher
matcher = MootlibMatcher()

2. Setting environment variables directly

# Unix/macOS
export MOOTLIB_ENCRYPTION_KEY="your-key-here"
export DEEPINFRA_TOKEN="your-token-here"

# Windows PowerShell
$env:MOOTLIB_ENCRYPTION_KEY="your-key-here"
$env:DEEPINFRA_TOKEN="your-token-here"

3. For GitHub Actions

Add these secrets in your repository's Settings → Secrets and Variables → Actions:

MOOTLIB_ENCRYPTION_KEY
DEEPINFRA_TOKEN
GJO_EMAIL (optional)
GJO_PASSWORD (optional)

Then use them in your workflow:

env:
  MOOTLIB_ENCRYPTION_KEY: ${{ secrets.MOOTLIB_ENCRYPTION_KEY }}
  DEEPINFRA_TOKEN: ${{ secrets.DEEPINFRA_TOKEN }}

Quick Start

from mootlib import MootlibMatcher

# Initialize the matcher
matcher = MootlibMatcher()

# Find similar questions
similar = matcher.find_similar_questions(
    "Will Russia invade Moldova in 2024?",
    n_results=3,
    min_similarity=0.7
)

# Print the results
for question in similar:
    print(f"\n{question}")

API Reference

MootlibMatcher

The main interface for finding similar questions across prediction markets.

matcher = MootlibMatcher(cache_duration_minutes=30)

Parameters:

cache_duration_minutes: How long to keep downloaded data in cache (default: 30)

Properties

markets_df

Access the raw markets DataFrame containing all prediction market data:

markets_df = matcher.markets_df

The DataFrame contains columns:

question: The market question text
source_platform: Platform where the market is from
formatted_outcomes: Current probabilities/outcomes
url: Link to the original market
n_forecasters: Number of forecasters
volume: Trading volume/liquidity
published_at: Publication datetime

embeddings_df

Access the embeddings DataFrame containing question vectors:

embeddings_df = matcher.embeddings_df

The DataFrame contains columns:

text: The question text
embedding: The numerical embedding vector

Note: Embeddings are computed on-demand and cached for future use.

find_similar_questions

similar = matcher.find_similar_questions(
    query="Will Tesla stock reach $300 in 2024?",
    n_results=5,
    min_similarity=0.5
)

Parameters:

query: The question to find similar matches for
n_results: Number of similar questions to return (default: 5)
min_similarity: Minimum similarity score 0-1 (default: 0.5)

Returns a list of SimilarQuestion objects with the following attributes:

question: The text of the prediction market question
similarity_score: How similar this question is to the query (0-1)
source_platform: The platform where this question was found
formatted_outcomes: String representation of possible outcomes and probabilities
url: URL to the original market (optional)
n_forecasters: Number of people who made predictions (optional)
volume: Trading volume or liquidity (optional)
published_at: When the market was published (optional)

Examples

Finding Similar Market Questions

from mootlib import MootlibMatcher

matcher = MootlibMatcher()

# Search for AI-related questions
ai_questions = matcher.find_similar_questions(
    "Will AGI be achieved by 2025?",
    n_results=3,
    min_similarity=0.7
)

# Search for geopolitical questions
geo_questions = matcher.find_similar_questions(
    "Will China invade Taiwan in 2024?",
    n_results=3,
    min_similarity=0.7
)

# Print results
for q in ai_questions + geo_questions:
    print(f"\n{q}\n{'=' * 80}")

Accessing Market Details

from mootlib import MootlibMatcher

matcher = MootlibMatcher()

# Find similar questions and access their details
similar = matcher.find_similar_questions("Will SpaceX reach Mars by 2025?")

for q in similar:
    print(f"\nQuestion: {q.question}")
    print(f"Platform: {q.source_platform}")
    print(f"Current Probabilities: {q.formatted_outcomes}")
    if q.url:
        print(f"Market URL: {q.url}")
    if q.n_forecasters:
        print(f"Number of Forecasters: {q.n_forecasters}")
    print("-" * 80)

Accessing Raw Data

from mootlib import MootlibMatcher

matcher = MootlibMatcher()

# Get all market data
markets_df = matcher.markets_df
print(f"Total markets: {len(markets_df)}")
print("\nMarkets by platform:")
print(markets_df["source_platform"].value_counts())

# Get question embeddings
embeddings_df = matcher.embeddings_df
print(f"\nTotal questions with embeddings: {len(embeddings_df)}")

# Filter markets by platform
manifold_markets = markets_df[markets_df["source_platform"] == "Manifold"]
print(f"\nManifold markets: {len(manifold_markets)}")

# Get high-volume markets
high_volume = markets_df[markets_df["volume"] > 1000]
print(f"\nHigh volume markets: {len(high_volume)}")

Development

Local Setup

Clone the repository

git clone https://github.com/vigji/mootlib.git
cd mootlib

Install dependencies with uv

pip install uv
uv venv
source .venv/bin/activate  # On Unix/macOS
# or
.venv\Scripts\activate     # On Windows
uv pip install -e ".[dev]"

Code Quality

We use Ruff for all Python linting and formatting:

# Format code
ruff format .

# Run linter
ruff check .

# Run linter with automatic fixes
ruff check --fix .

Repository Maintenance

Versioning and Releases

We use Git tags for versioning. The version number is automatically derived from the latest tag using hatch-vcs.

To create a new release, you have two options:

Quick Release (via Git tag):

# Create and push a new version tag (e.g., v0.1.1)
git tag -a v0.1.1 -m "Description of changes"
git push origin v0.1.1

This will automatically trigger the release workflow.

Full Release (via GitHub UI):
- Create and push a tag as above
- Go to GitHub -> Releases -> Create a new release
- Choose the tag you just pushed
- Add detailed release notes
- Click "Publish release"

In both cases, the release workflow will automatically:

Run all tests
If tests pass, build the package
Publish to PyPI using trusted publishing

Note: Using the GitHub UI method allows you to add more detailed release notes and attachments, but both methods will publish to PyPI.

Pre-commit Hooks

We use pre-commit hooks to ensure code quality. Install them with:

pre-commit install

This will automatically run Ruff and other checks before each commit.

Code Style Guidelines

Maximum line length: 88 characters (enforced by Ruff)
Use pathlib over os.path
Use functions only where you see opportunity for code reuse
Use classes sparingly and when it makes sense over functions
Use loops to streamline operations repeated more than once
Document briefly middle-length functions, fully annotate only complex ones

Running Tests

pytest

Type Checking

mypy mootlib tests

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

vigji

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.0

May 14, 2025

0.2.0

May 13, 2025

0.1.2

May 13, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mootlib-0.3.0.tar.gz (3.0 MB view details)

Uploaded May 14, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mootlib-0.3.0-py3-none-any.whl (35.3 kB view details)

Uploaded May 14, 2025 Python 3

File details

Details for the file mootlib-0.3.0.tar.gz.

File metadata

Download URL: mootlib-0.3.0.tar.gz
Upload date: May 14, 2025
Size: 3.0 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for mootlib-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`4e579b4952686dee3d00596e486e71ae5fb8c4aecea88204396c7682209e382b`
MD5	`9e504feb30399e48c7bdb2a6ea6ff06c`
BLAKE2b-256	`fe9406a51b14578a163fc89c6e74b8fa5b7f9d89f0e5634dda2988274cff3433`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mootlib-0.3.0.tar.gz:

Publisher: publish.yml on vigji/mootlib

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mootlib-0.3.0.tar.gz
- Subject digest: 4e579b4952686dee3d00596e486e71ae5fb8c4aecea88204396c7682209e382b
- Sigstore transparency entry: 212779041
- Sigstore integration time: May 14, 2025
Source repository:
- Permalink: vigji/mootlib@111b5dc13ea1cf109dbd58af83781fc58e673244
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/vigji
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@111b5dc13ea1cf109dbd58af83781fc58e673244
- Trigger Event: push

File details

Details for the file mootlib-0.3.0-py3-none-any.whl.

File metadata

Download URL: mootlib-0.3.0-py3-none-any.whl
Upload date: May 14, 2025
Size: 35.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for mootlib-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9385aadd9a7d0a76c111b0e10b01cef039444c59a39a4bf0076f1ffa9ff905c4`
MD5	`6b7cbbda46fc91d1dd20c8497b37562c`
BLAKE2b-256	`1626bce88db6d9436c4b26c9be14a356d51a876b9f06af126c6ff09b3cb10bdf`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mootlib-0.3.0-py3-none-any.whl:

Publisher: publish.yml on vigji/mootlib

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mootlib-0.3.0-py3-none-any.whl
- Subject digest: 9385aadd9a7d0a76c111b0e10b01cef039444c59a39a4bf0076f1ffa9ff905c4
- Sigstore transparency entry: 212779043
- Sigstore integration time: May 14, 2025
Source repository:
- Permalink: vigji/mootlib@111b5dc13ea1cf109dbd58af83781fc58e673244
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/vigji
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@111b5dc13ea1cf109dbd58af83781fc58e673244
- Trigger Event: push

mootlib 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Mootlib

Features

Installation

Environment Setup

Required Environment Variables

1. Using a .env file (recommended for local development)

2. Setting environment variables directly

3. For GitHub Actions

Quick Start

API Reference

MootlibMatcher

Properties

markets_df

embeddings_df

find_similar_questions

Examples

Finding Similar Market Questions

Accessing Market Details

Accessing Raw Data

Development

Local Setup

Code Quality

Repository Maintenance

Versioning and Releases

Pre-commit Hooks

Code Style Guidelines

Running Tests

Type Checking

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance