Skip to main content

A simple Python SDK to scrape and retrieve pricing information for Large Language Models (LLMs) from an external webpage, with structured models for easy integration and usage.

Project description

LLM Price Scraper

LLM Price Scraper is a Python package designed to scrape and organize pricing information for large language models (LLMs) from the following sources:

Installation

You can install the package using pip:

pip install llm-price-scraper==1.0.2

Usage

Once you have installed the scrapper, you can use it to quickly retrieve the current pricing information from the website.

from llm_price_scraper.scrapers import LlmPricingScraper

# Get the pricing information
pricing_data = LlmPricingScraper.scrape()

# Loop through each pricing entry and print data
for entry in pricing_data:
    print(f"Model: {entry.model}")
    print(f"Provider: {entry.provider}")
    print(f"1M input tokens: {entry.input_tokens_price}$")
    print(f"1M output tokens: {entry.output_tokens_price}$")
    print(f"Context: {entry.context}")
    print(f"Source: {entry.source}")
    print(f"Updated: {entry.updated}")
    print("-" * 40)
    
# get all gpt-4o models
gpt_4o_models = [entry for entry in pricing_data if "gpt-4o" in entry.model.lower()]
print("GPT-4o models:")
for entry in gpt_4o_models:
    print(f"Model: {entry.model}")
    print(f"Provider: {entry.provider}")
    print(f"1M input tokens: {entry.input_tokens_price}$")
    print(f"1M output tokens: {entry.output_tokens_price}$")
    print(f"Context: {entry.context}")
    print(f"Source: {entry.source}")
    print(f"Updated: {entry.updated}")
    print("-" * 40)

You can also chose the source of the data you want to scrape by passing the source as an argument to the scrape method. The available sources are defined in the DataSources enum.

from llm_price_scraper.scrapers import LlmPricingScraper, DataSources

pricing_data = LlmPricingScraper.scrape(DataSources.HUGGINGFACE)

Example Output

After running the above code, you should see an output like this:

Model: gpt-4-32k
Provider: OpenAI
1M input tokens: 60
1M output tokens: 120
Context: 32K
Source: https://www.botgenuity.com/tools/llm-pricing
Updated: March 16, 2024
----------------------------------------
...

Error Handling

In case of a failure to connect to the webpage, an exception will be thrown with an appropriate message. For example:

Exception: Failed to retrieve the webpage. Status code: 404

Contributing

Contributions, bug reports, and feature requests are welcome! Feel free to submit a pull request or open an issue on GitHub.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_price_scraper-1.0.2.tar.gz (7.3 kB view details)

Uploaded Source

Built Distribution

llm_price_scraper-1.0.2-py3-none-any.whl (9.5 kB view details)

Uploaded Python 3

File details

Details for the file llm_price_scraper-1.0.2.tar.gz.

File metadata

  • Download URL: llm_price_scraper-1.0.2.tar.gz
  • Upload date:
  • Size: 7.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for llm_price_scraper-1.0.2.tar.gz
Algorithm Hash digest
SHA256 f4bc2ff4f40150c90f172c35a26735c3a4b6998b52b944d5f3ae52336af7bcb6
MD5 fb1fe7ffb7b94877545311e407563d61
BLAKE2b-256 4e426235de7833fa7a55e758c00a94d1f1da6a2e57e22382e72bdd42eca27a7d

See more details on using hashes here.

File details

Details for the file llm_price_scraper-1.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for llm_price_scraper-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 3bf1ea6f61153982909ccab4732762ecfc136067caf28a98d66771a99ec25261
MD5 0a285e77bedbea056316d1a20c9c1279
BLAKE2b-256 8f528c56f93b7d7a6884ab75442dd608f038b3751b6dc4fe6dff7b70197be55d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page