Skip to main content

A crawler for structured product information from TDCC (Taiwan).

Project description

Taiwan Structured Product Information Crawler

PyPI version PyPI license Package Version Github Last Commit

This is a repository that offers a StructuredProductCrawler class to crawl Taiwan TDCC website for the product information.


from tdcc import StructuredProductCrawler
crawler = StructuredProductCrawler()
all_products = crawler.crawl()

crawl() returns a Pandas DataFrame. Data columns include:

Column Name Are
URL the product's partial url
UID product id
NAME product name
CURRENCY product denomination
MATURITY maturity date
UNDERLYING underlying asset type
PRINCIPAL_PROTECTION % of principal protection
PI professional investor
ISSUE_DATE issue date
ISSUER issuer
MASTER_AGENT master agent
DISTRIBUTOR distributor


To install this verson from PyPI, type:

pip install tdcc

To get the newest one from this repo (note that we are in the alpha stage, so there may be frequent updates), type:

pip install git+git://



Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tdcc-0.0.2.tar.gz (5.2 kB view hashes)

Uploaded source

Built Distribution

tdcc-0.0.2-py3-none-any.whl (6.9 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page