Skip to main content

Agriculture and forestry statistics.

Project description

The biotrade package analyses international trade of bio-based products. It focuses on the agriculture and forestry sectors, from primary production to secondary products transformation. It loads bilateral trade data from UN Comtrade, production and trade data from FAOSTAT and socio-economic indicators from the World Bank.

Documentation

The documentation is available at: https://bioeconomy.gitlab.io/forobs/biotrade/

Installation

Base installation

Install the biotrade package from the python package index with pip:

python -m pip install biotrade

Upgrade the package to the latest version with:

python -m pip install --upgrade biotrade

To install the latest development version, use the --upgrade parameter and install from the main branch of the gitlab repository:

python -m pip install --upgrade --force-reinstall https://gitlab.com/bioeconomy/forobs/biotrade/-/archive/main/biotrade-main.tar.gz

Installation for contributors

If you plan to contribute to the development of the biotrade package, clone the biotrade repository at https://gitlab.com/bioeconomy/forobs/biotrade/. You need to tell python where the package is located by adding it to your PYTHONPATH. You can do this by changing the environment variables or by adding the following line to your shell configuration file such as .bash_aliases:

export PYTHONPATH="$HOME/repos/biotrade/":$PYTHONPATH

Specify where you want to store the data by adding the following environment variable:

export BIOTRADE_DATA="$HOME/repos/biotrade_data/"

Dependencies are listed in the install_requires argument of setup.py.

Usage

The biotrade package can download data from FAOSTAT and UN Comtrade and store it inside a database. By default it will use an SQLite database stored locally in the folder defined by the environment variable BIOTRADE_DATA. Alternatively, a PostGRESQL database can be used if a connection string is defined in the environment variable BIOTRADE_DATABASE_URL, for example by adding the following to your .bash_aliases or .bash_rc:

export BIOTRADE_DATABASE_URL="postgresql://user@localhost/biotrade"

Note that database queries are abstracted with SQL Alchemy which is what makes it possible to use different database engines. SQLite is convenient for data analysis on laptops. PostGreSQL can be used on servers.

FAOSTAT

Faostat provides agriculture and forestry data on their website https://www.fao.org/faostat/en/#data/

The biotrade package has a faostat.pump object that loads a selection of datasets. The list of downloaded datasets is visible in faostat.pump.datasets. Column names and product descriptions are reformatted to snake case in a way that is convenient for analysis. The data is then stored into an SQLite database (or PostgreSQL). The following commands download and transfer the given datasets to the database:

>>> from biotrade.faostat import faostat
>>> faostat.pump.update(["crop_production", "forestry_production"])
>>> # Loading trade data can take a long time on slow connections
>>> faostat.pump.update(["crop_trade", "forestry_trade"])
>>> faostat.pump.update(["food_balance"])
>>> faostat.pump.update(["land_use", "land_cover", "forest_land"])

List available datasets and metadata links:

>>> faostat.pump.datasets
>>> faostat.pump.metadata_link

Once the data has been loaded into the database, you can query it. For example select crop production data for 2 countries

>>> from biotrade.faostat import faostat
>>> db = faostat.db_sqlite
>>> cp2 = db.select(table="crop_production",
>>>                 reporter=["Portugal", "Estonia"])

Select forestry trade flows data reported by all countries, with Austria as a partner country:

>>> ft_aut = db.select(table="forestry_trade",
>>>                    partner=["Austria"])

Select crop trade flows reported by the Netherlands where Brazil was a partner

>>> ct_nel_bra = db.select(table="crop_trade",
>>>                        reporter="Netherlands",
>>>                        partner="Brazil")

Select the mirror flows reported by Brazil, where the Netherlands was a partner

>>> ct_bra_bel = db.select(table="crop_trade",
>>>                        reporter="Brazil",
>>>                        partner="Netherlands")

Select land use and land cover data

>>> lu = faostat.db.select("land_use")
>>> lc = faostat.db.select("land_cover")

Comtrade

See the documentation of the various methods. As an example here is how to download trade data from the Comtrade API and return a data frame, for debugging purposes:

>>> from biotrade.comtrade import comtrade
>>> # Other sawnwood
>>> swd99 = comtrade.pump.download(cc = "440799")
>>> # Soy
>>> soy = comtrade.pump.download(cc = "120190")

Display information on column names used for renaming and dropping less important columns:

>>> comtrade.column_names

Get the list of products from the Comtrade API

>>> hs = comtrade.pump.get_parameter_list("classificationHS.json")

Get the list of reporter and partner countries

>>> comtrade.pump.get_parameter_list("reporterAreas.json")
>>> comtrade.pump.get_parameter_list("partnerAreas.json")

Metadata and configuration data

Release dates

FAOSTAT release dates are available at : https://fenixservices.fao.org/faostat/static/releasecalendar/Default.aspx

Variable definitions and harmonization

Column names and product descriptions are reformatted to snake case in a way that is convenient for analysis. See example below.

  • Variables are defined and compared between the data sources in a notebook called definitions_and_harmonization

  • Variable names are harmonized between the different sources using a mapping table defined in biotrade/config_data/column_names.csv See for example how the product_code column is called PRODUCT_NC in Eurostat Comext, commodity_code or cmdcode in UN Comtrade and item_code in FAOSTAT.

  • snake_case is the preferred way of naming files and variables in the code. This follows the R tidyverse style guide for object names and the python PEP 8 style guide for function and variable names.

To illustrate the advantage of using snake case for data exploration, compare the use of column names with space which have to be quoted. Python

>>> df["Product Code"]
>>> df.product_code

R

R> df["Product Code"]
R> df$`Product Code`
R> df$product_code

Configuration data

The biotrade package stores a small amount of configuration data such as country and product mapping tables and conversion coefficients in the biotrade/config_data folder.

Licence

This software is licenced under the MIT licence. See the LICENCE.md file.

Similar projects

  • The R packages FAOSTAT and WDI

Tests

This package uses pytest for unit testing. Run the test suite with

pytest

Run pytest with code coverage

cd python_project_dir
coverage run --source=. -m pytest

Followed by

coverage html

To generate a report. These tests are run as part of the Continuous Integration.

Acknowledgements

The authors would like to acknowledge ideas and feedback received from the following persons: Giovanni Bausano, Noemi Cazzaniga, Mirco Migliavacca, Roberto Pilli, Lucas Sinclair.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biotrade-0.3.16.tar.gz (160.5 kB view details)

Uploaded Source

Built Distribution

biotrade-0.3.16-py3-none-any.whl (189.7 kB view details)

Uploaded Python 3

File details

Details for the file biotrade-0.3.16.tar.gz.

File metadata

  • Download URL: biotrade-0.3.16.tar.gz
  • Upload date:
  • Size: 160.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.7

File hashes

Hashes for biotrade-0.3.16.tar.gz
Algorithm Hash digest
SHA256 dcbccc466330b7ed8550aef316c077660e82e30055b08911455141893fdf4965
MD5 60f674b99f0f8c99df783076f8fec6e5
BLAKE2b-256 ef1571ad0b9c353a9518905c06e7bf3f15164e887173a3d2705feadc847500d2

See more details on using hashes here.

File details

Details for the file biotrade-0.3.16-py3-none-any.whl.

File metadata

  • Download URL: biotrade-0.3.16-py3-none-any.whl
  • Upload date:
  • Size: 189.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.7

File hashes

Hashes for biotrade-0.3.16-py3-none-any.whl
Algorithm Hash digest
SHA256 a1a0703168e599e7e8b9509bba1e86a0c5fa6fa7fa69248f4837db4d1da6572d
MD5 3fadccae93a2fd667b412c4664847860
BLAKE2b-256 5b7d856c8e513decad48f4e00f0beddb05a84b50b0fd07777b3edd9827be8e5d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page