Skip to main content

This package scrapes Wikipedia for equations, parses the equations, and builds priors of operations and functions.

Project description


equation_scraper is a Python package that scrapes Wikipedia pages for mathematical equations and then parses the equations into its components to build prior distributions. Specifically, these priors include information such as the number of times an operator or function appears across all equations scraped. For example, the expression m*x+b*sin(y) would be parsed into the simple prior: {*: 2, +: 1, sin: 1}. The package includes much more information than this simple prior, for example conditional priors---a full breakdown of the included metrics is detailed on the Priors section of our documentation. The package was designed to provide equation discovery modelling techniques, such as Symbolic Regression and the Bayesian Machine Scientist, with informed priors; however, the application of this package can extend far beyond this.


equation_scraper documentation can be found at

Equation-Scraper in Research

There are no current publications using the equation-scraper; however, we have two manuscripts currently under review. They are in a double-blind review process, so we are at this point refraining from sharing preprints, but will do so as soon as we are able.


This project is in active development by the Autonomous Empirical Research Group, led by Sebastian Musslick. The package was built by Chad Williams. Furthermore, the package depends on another package of ours, equation-tree, which was developped by Younes Strittmatter and Ioana Marinescu.

This research program is supported by Schmidt Science Fellows, in partnership with the Rhodes Trust, as well as the Carney BRAINSTORM program at Brown University.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

equation_scraper-1.0.0.tar.gz (847.0 kB view hashes)

Uploaded Source

Built Distribution

equation_scraper-1.0.0-py3-none-any.whl (19.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page