Skip to main content

Recommender system for Python libraries

Project description

PipRec - Recommender system for Python libraries

PipRec is a robust recommender system targeted for users starting on new projects. Often people want to install a number of libraries when they start working on a project. With so many libraries to choose from (over 200 thousand from PyPI alone) it is often difficult to make an informed decision. Our recommender system is aimed at solving this problem.

Background

We use data provided by libraries.io (https://libraries.io/data) to compute our recommendations. From the data we compute association rules which are then used as a basis for our recommendations.

The number of association rules is relatively small. Around 8500 libraries have recommendations based on these rules. Compared to the amount of libraries out there we employ another method to increase our range of recommendations.

For libraries which we do not have information for, we try to find a similar library for which we already have recommendations. This is based purely on library descriptions.

Features

  • Each recommendation has a quality indicator
  • Ability to choose between 2 quality metrics (Lift and Certainty Factor)
  • Ability to choose between 2 output formats (list-like and tree-like)
  • Ability to choose between 2 similarity methods for unseen libraries (Latent Dirichlet Allocation and PyPI search engine)

Getting started

To get a recommendation for a library simply type:

piprec <library_name>

To show the list of available flags type:

piprec -h

Quality metrics

By default, PipRec uses Lift to indicate the quality of a recommendation.

To switch to a Certainty Factor metric simply add the -cf flag to your command:

piprec <library_name> -cf

Output formats

By default, PipRec outputs the recommendations in a simple list-like format.

To enable the tree-like output add the flag -t to your command:

piprec <library_name> -t

Similarity methods

By default, PipRec uses a trained LDA (Latent Dirichlet Allocation) model to find a similar library for which we have data about. We assume that we have a large enough dataset that the most similar library found will have relevant recommendations for your library.

Alternatively, you can use PyPI to find a similar library. The way this works is that we query search function on the PyPI website and take the top result which we know of. To enable this add the flag -pse to your command:

piprec <library_name> -pse

Evaluation

We would be very pleased if you spent a little bit of your time to tell us how well we did. We do not collect any personal information and your feedback would help us in our future endeavours. Just click the link below to complete a survey.

https://docs.google.com/forms/d/1BzGa8oBFSSmS9NctdtDVNFhOiDJO-S5sRXm6NNm0oWc

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

piprec-1.0.0.tar.gz (7.9 MB view details)

Uploaded Source

Built Distribution

piprec-1.0.0-py3-none-any.whl (8.0 MB view details)

Uploaded Python 3

File details

Details for the file piprec-1.0.0.tar.gz.

File metadata

  • Download URL: piprec-1.0.0.tar.gz
  • Upload date:
  • Size: 7.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for piprec-1.0.0.tar.gz
Algorithm Hash digest
SHA256 5620fd3f8d4796b1570086e32ff96351e229b95a4678b490a3a5b85be92aa8b4
MD5 639d65f59568ad5690ef82154aad5544
BLAKE2b-256 4ac0efaa3c7cf38473a4548718fd0c74bbc69554ed877fead728080ad77df77a

See more details on using hashes here.

File details

Details for the file piprec-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: piprec-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 8.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for piprec-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b8f460d2e28cb7db1d944630cb134471ca8f39a8e3d37e9b4303d920e2664544
MD5 e01eea95f7f9ec34889c98ac88d9a7f7
BLAKE2b-256 5d32d33ed56712840beac344c770ece3a07d98e92544cf77c1c22527749d0f12

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page