Skip to main content

Automatic FAQ Mining Framework

Project description

AutoFAQ

PyPI version

AutoFAQ is a sophisticated FAQ dataset mining framework designed as an interactive CLI tool for rapid and efficient data collection. With AutoFAQ, you can compile comprehensive FAQ datasets in just minutes!

Process Overview

AutoFAQ's streamlined process follows these steps to generate a complete dataset:

  • Expand Keywords: Provide a few initial keywords, and AutoFAQ will automatically expand the list to generate a comprehensive query list.
  • Web Search: AutoFAQ performs searches using your choice of designated search engines to create a list of relevant webpages.
  • Information Extraction: Once all webpages are downloaded, AutoFAQ extracts QA pairs using the available extractor engines.
  • Data Cleaning: AutoFAQ's cleaning modules allow you to refine the dataset using semantic entailment of QA pairs, titles, or pages.
  • Rendering: Finally, you can render the dataset into your desired format with ease. Success!

Getting Started

  1. Install the autofaq CLI tool using:
    pip install auto-faq
    
  2. Usage: autofaq [OPTIONS] COMMAND [ARGS]...
    
    Options:
    --help  Show this message and exit.
    
    Commands:
    clean     Refines the dataset
    embed     Computes vector embeddings of the dataset
    extract   Collects webpages and extracts QA pairs
    init      Initiates a new mining project
    keywords  Generates an expanded list of keywords
    render    Transforms the dataset into human-readable formats
    search    Compiles search results for keywords
    

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

auto-faq-0.4.1.tar.gz (14.7 kB view hashes)

Uploaded Source

Built Distribution

auto_faq-0.4.1-py3-none-any.whl (25.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page