Skip to main content

Mine implicit features using a generative feature language model.

Project description

GFLM: mine implicit features using a generative feature language model


This package implements a Generative Feature Language Models for Mining Implicit Features.

Given the following input:

  • a text dataset
  • a set of predefined features

Compute the following:

  • mapping of explicit and implicit features on the data
  • using both gflm_word and gflm_section algorithms


pip install feature_mining

Sample Usage

    from feature_mining import FeatureMining
    fm = FeatureMining()

    - prediction using 'section': fm.gflm.gflm_section
    - prediction using 'word': fm.gflm.gflm_word

Display result:
    print(fm.gflm_section_result.sort_values(by=['gflm_section'], ascending=False)[['feature', 'section_text']].head(20))

Package created based on the following paper

S. Karmaker Santu, P. Sondhi and C. Zhai, "Generative Feature Language Models for Mining Implicit Features from Customer Reviews", Proceedings of the 25th ACM International on Conference on Information and Knowledge Management - CIKM '16, 2016.

Pydocs (Code Documentation)

Accessible via this link:

(Apologies for the color scheme - it was the default)


See Jupyter notebook tutorial

Video presentation and tutorial

Link to YouTube:

Package on PyPi


Known Issues

Explicit feature mentions not removed from GFLM word/sentence:

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

feature_mining-0.1.1.tar.gz (296.8 kB view hashes)

Uploaded source

Built Distribution

feature_mining-0.1.1-py3-none-any.whl (581.4 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page