Skip to main content

A python package to partially automate search term selection and writing search strategies for systematic reviews

Project description

Ananse

This project is a collaboration between Dr. Effah Antwi,Research Scientist, Natural Resources Canada, Canadian Forest Service and Dr Wiafe Owusu-Banahene, Department of Computer Engineering, School of Engineering Sciences, University of Ghana, Legon, Accra, Ghana. The Ananse package is a python package designed to partially automate search term selection and writing search strategies for systematic reviews. Read the documentation at baasare.github.io/ananse and ananse.readthedocs.io/

Setup

Ananse requires python 3.7 or higher

Using pip

pip install ananse

Directly from the repository

git clone https://github.com/baasare/ananse.git
python ananse/setup.py install

Quick start

Writing your own script

from ananse import Ananse

min_len = 1 # minimum keyword length
max_len = 4 # maximum keyword length


# Create an instance of the package
test_run = Ananse()


# Import your naive search results from the current working directory
imports = test_run.import_naive_results(path="./")

# Columns to deduplicate imported search results
columns =  ['title', 'abstract']


#de-duplicate the imported search results
data = test_run.deduplicate_dataframe(imports, columns)


#extract keywords from article title and abstract as well as author and database tagged keywords
all_terms = test_run.extract_terms(data, min_len=min_len, max_len=max_len)


#create Document-Term Matrix, with columns as terms and rows as articles
dtm, term_column = test_run.create_dtm(data.text, keywords=all_terms, min_len=max_len, max_len=max_len)


#create co-occurrence network using Document-Term Matrix
graph_network = test_run.create_network(dtm, all_terms)


#plot histogram and node strength of the network
test_run.plot_degree_histogram(graph_network)
test_run.plot_degree_distribution(graph_network)


#Determine cutoff for the relevant keywords
cutoff_strengths = test_run.find_cutoff(graph_network, "spline", "degree", degrees=3, knot_num=1, percent=0.879956,
                                            diagnostics=True)


#get suggested keywords and save to a csv file
suggested_keywords = test_run.get_keywords(graph_network, "degree", cutoff_strengths, save_keywords=True)


#Print suggested keywords
for word in suggested_keywords:
   print(word)

Using Ananse Test Script

python tests/ananse_test

References

This is a python implementation of the R package as mentioned in paper An automated approach to identifying search terms for systematic reviews using keyword co‐occurrence networks by Eliza M. Grames, Andrew N. Stillman Morgan W. Tingley and Chris S. Elphick

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ananse-1.1.2.tar.gz (11.2 kB view details)

Uploaded Source

Built Distribution

ananse-1.1.2-py3-none-any.whl (9.7 kB view details)

Uploaded Python 3

File details

Details for the file ananse-1.1.2.tar.gz.

File metadata

  • Download URL: ananse-1.1.2.tar.gz
  • Upload date:
  • Size: 11.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.6

File hashes

Hashes for ananse-1.1.2.tar.gz
Algorithm Hash digest
SHA256 c97cc4b339b1c00a85804e124d0eae7d7b6c31454b363bb8a41d57610fac02b5
MD5 e62923891761f778a19aea8f9a05e1ff
BLAKE2b-256 5c57eb78e081cdab578c9ae40fcf11568b01c93caf601f1664ae73034cce6ecd

See more details on using hashes here.

File details

Details for the file ananse-1.1.2-py3-none-any.whl.

File metadata

  • Download URL: ananse-1.1.2-py3-none-any.whl
  • Upload date:
  • Size: 9.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.6

File hashes

Hashes for ananse-1.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 89b439f42fed067bbe2ad15a7ef39639c18a03c7688b28d53a50902b00b906cf
MD5 74274d98901f42848c8749ccadfee9a2
BLAKE2b-256 ca5f43b42e0dc5a25be36efe112b3444082fd2cfffb3dc397ea20c42026f37a7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page