Skip to main content

This package scrapes Wikipedia for equations, parses the equations, and builds priors of operations and functions.

Project description

Overview

equation_scraper is a Python package that scrapes Wikipedia pages for mathematical equations and then parses the equations into its components to build prior distributions. Specifically, these priors include information such as the number of times an operator or function appears across all equations scraped. For example, the expression m*x+b*sin(y) would be parsed into the simple prior: {*: 2, +: 1, sin: 1}. The package includes much more information than this simple prior, for example conditional priors---a full breakdown of the included metrics is detailed on the Priors section of our documentation. The package was designed to provide equation discovery modelling techniques, such as Symbolic Regression and the Bayesian Machine Scientist, with informed priors; however, the application of this package can extend far beyond this.

Documentation

equation_scraper documentation can be found at https://autoresearch.github.io/equation-scraper/

Equation-Scraper in Research

There are no current publications using the equation-scraper; however, we have two manuscripts currently under review. They are in a double-blind review process, so we are at this point refraining from sharing preprints, but will do so as soon as we are able.

About

This project is in active development by the Autonomous Empirical Research Group, led by Sebastian Musslick. The package was built by Chad Williams. Furthermore, the package depends on another package of ours, equation-tree, which was developped by Younes Strittmatter and Ioana Marinescu.

This research program is supported by Schmidt Science Fellows, in partnership with the Rhodes Trust, as well as the Carney BRAINSTORM program at Brown University.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

equation_scraper-1.0.0.tar.gz (847.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

equation_scraper-1.0.0-py3-none-any.whl (19.6 kB view details)

Uploaded Python 3

File details

Details for the file equation_scraper-1.0.0.tar.gz.

File metadata

  • Download URL: equation_scraper-1.0.0.tar.gz
  • Upload date:
  • Size: 847.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.11

File hashes

Hashes for equation_scraper-1.0.0.tar.gz
Algorithm Hash digest
SHA256 55d26338b33e127dd70cb12e1b849f7e8b5192847099ca4906911fb25a761f67
MD5 af3b73cbecf86b5320e1a49efcaccefe
BLAKE2b-256 b25511e9b9738602ea6c839159bc5fa422dcf2d1e6ea265d9b82584f14e65da6

See more details on using hashes here.

File details

Details for the file equation_scraper-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for equation_scraper-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4c3293a4868dd54ab2f506d15a2b7467a75b58f8b2e3de225015c2fa86b988fc
MD5 313c31291e7adc899d27c89292e64845
BLAKE2b-256 a66baaae145fb6793c855b531957e5cc1c8bce0aed4b9af07636a854c4579e3b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page