Skip to main content

Statistical tools for the ASReview project

Project description

ASReview-statistics

ASReview extension for generating statistics on log files and datasets.

General

Install the package with:

pip install asreview-statistics

The general usage of the package is to inspect files related to the systematic review done with ASReview. It can be used to inspect your dataset that you would like to review (or have reviewed).

General usage:

asreview stat path_to_file

Datasets

Use the following command on your command line:

asreview stat path_to_your_dataset

It should give some general properties of the dataset, e.g.:

************  ptsd.csv  ************

Number of papers:            5782
Number of inclusions:        38 (0.66%)
Number of exclusions:        5744 (99.34%)
Number of unlabeled:         0 (0.00%)
Average title length:        101
Average abstract length:     1339
Number of missing titles:    64 (of which 0 included)
Number of missing abstracts: 747 (of which 0 included)

Your dataset should be in a format that is readable by the ASReview software. Documentation on how to create such a dataset is in the main project.

Log files

Another use is the quick analysis of either one log file, or multiple log files in the same directory:

asreview stat path_to_your_log_files

This will give output similar to:

************  ptsd_nb  *******************

-----------  general  -----------
Number of runs           : 16
Number of papers         : 5782
Number of included papers: 38
Number of excluded papers: 5744
Number of queries        : 233

-----------  settings  -----------
model             : nb
query_strategy    : max_random
balance_strategy  : double
feature_extraction: tfidf
n_instances       : 25
n_prior_included  : 1
n_prior_excluded  : 1
mode              : simulate
model_param       : {'alpha': 3.822}
query_param       : {'strategy_1': 'max', 'strategy_2': 'random', 'mix_ratio': 0.95}
feature_param     : {}
balance_param     : {'a': 2.155, 'alpha': 0.94, 'b': 0.789, 'beta': 1.0}
abstract_only     : False

-----------  WSS/RRF  -----------
WSS@95 : 91.50 %
WSS@100: 87.56 %
RRF@5  : 97.30 %
RRF@10 : 97.64 %

Multiple log files/directories are accepted. Currently, the amount of information displayed is limited, help/suggestions are welcome!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

asreview-statistics-0.2.0.tar.gz (6.1 kB view details)

Uploaded Source

Built Distribution

asreview_statistics-0.2.0-py3-none-any.whl (11.2 kB view details)

Uploaded Python 3

File details

Details for the file asreview-statistics-0.2.0.tar.gz.

File metadata

  • Download URL: asreview-statistics-0.2.0.tar.gz
  • Upload date:
  • Size: 6.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.7

File hashes

Hashes for asreview-statistics-0.2.0.tar.gz
Algorithm Hash digest
SHA256 1cac9d58b4c49de2807b6b697d73d9731c0aea93cdd8f4026d5c9885b244f59a
MD5 8d7da219df1e66662509fb98c5090183
BLAKE2b-256 76f80969671cd72c029b66958bd5eec477ed17d40d98edc69d7a19bb1bcad664

See more details on using hashes here.

File details

Details for the file asreview_statistics-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: asreview_statistics-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 11.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.7

File hashes

Hashes for asreview_statistics-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a054e802dadf32f82dd98c8be7f5dbcea79c37de0668ef94bc86c0033a3a6c14
MD5 32b55cf2fc54f37180a2dc3f748c5dd7
BLAKE2b-256 b89babaadbd32091ae32297dec7a1fb66337803774fff54e76dbf197f600410a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page