Skip to main content

Extract themes from verbatims

Project description

Matcher-pcx-synomia

Extract themes from verbatims.

Version 1.1.2

Release date: 2019-09-18

Getting Started

pip install ...

Prerequisites

  • flashtext: $ pip install flashtext

Usage

  • You would need a lexicon.txt with the following structure:

manque de place=><manque de place;place;-1< personnel désagréable=><personnel désagréable;personnel;-1< train direct=><train direct;train;1<

  • Init matcher:

themes_matcher = matcher.ThemesMatcher('lexicon.txt')

  • Extract themes from verbatims:

verbatims = ["Le confort et la propreté","Rapidité (Train Direct), mais personnel désagréable","c'est catastrophique… retards chroniques, personnel désagréable","Le manque de place ds le tgv"] vbs2matches = themes_matcher.match(verbatims) print(vbs2matches) {'Rapidité (Train Direct), mais personnel désagréable': [['train direct', 'train', '1'], ['personnel désagréable', 'personnel', '-1']], 'Le confort et la propreté': [], 'Le manque de place ds le tgv': [['manque de place', 'place', '-1']], "c'est catastrophique… retards chroniques, personnel désagréable": [['personnel désagréable', 'personnel', '-1']]}

Authors

  • Synomia

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

matcher_pcx_synomia-1.1.2.tar.gz (2.2 kB view hashes)

Uploaded source

Built Distribution

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page