Skip to main content

A crawler for structured product information from TDCC (Taiwan).

Project description

Taiwan Structured Product Information Crawler

PyPI version PyPI license Package Version Github Last Commit

This is a repository that offers a StructuredProductCrawler class to crawl Taiwan TDCC website for the product information.


from tdcc import StructuredProductCrawler
crawler = StructuredProductCrawler()
all_products = crawler.crawl()

Installation

To install this verson from PyPI, type:


pip install tdcc

To get the newest one from this repo (note that we are in the alpha stage, so there may be frequent updates), type:


pip install git+git://github.com/jn8029/tdcc.git

To-do

  • TBC

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tdcc-0.0.1.tar.gz (3.2 kB view hashes)

Uploaded Source

Built Distribution

tdcc-0.0.1-py3-none-any.whl (5.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page