Skip to main content

Extract data from Flora of North America

Project description

Flora Data Extraction Project

This script is designed to extract data from pdf files of genera from the book Flora of North America. It creates csv files whose names match the PDF files given to the script as arguments. The csv files have the format

"Species name", "Location where the species appears", "Identifier"

The easiest way to run the script is to move to a folder where the only pdf files are genera files from Flora of North America and enter:

python -m florana.extract -A

The script will then run on every pdf file in the directory and create a csv for each pdf.

Note: python 2

If you also have python 2 installed on your system, you will probably need to run python3 instead of python

Installing

python -m pip install florana

Note: Windows Users

If you're running Windows, you'll likely need to install poppler. You'll need to extract the latest binary from the link provided and add its bin folder to your PATH environment variable. i.e. If C:\path\to\poppler is the directory where you extracted poppler, then you'll need to add C:\path\to\poppler\bin to your PATH environment variable.

Dependencies

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for florana, version 1.1.0
Filename, size File type Python version Upload date Hashes
Filename, size florana-1.1.0-py3-none-any.whl (10.8 kB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size florana-1.1.0.tar.gz (9.1 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page