Skip to main content

A crawler for structured product information from TDCC (Taiwan).

Project description

Taiwan Structured Product Information Crawler

PyPI version PyPI license Package Version Github Last Commit

This is a repository that offers a StructuredProductCrawler class to crawl Taiwan TDCC website for the product information.

Tutorial


from tdcc import StructuredProductCrawler
crawler = StructuredProductCrawler()
all_products = crawler.crawl()

crawl() returns a Pandas DataFrame. Data columns include:

Column Name Are
URL the product's partial url
UID product id
NAME product name
CURRENCY product denomination
MATURITY maturity date
UNDERLYING underlying asset type
PRINCIPAL_PROTECTION % of principal protection
PI professional investor
ISSUE_DATE issue date
ISSUER issuer
MASTER_AGENT master agent
DISTRIBUTOR distributor

Installation

To install this verson from PyPI, type:


pip install tdcc

To get the newest one from this repo (note that we are in the alpha stage, so there may be frequent updates), type:


pip install git+git://github.com/jn8029/tdcc.git

To-do

TBC

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tdcc-0.0.2.tar.gz (5.2 kB view hashes)

Uploaded Source

Built Distribution

tdcc-0.0.2-py3-none-any.whl (6.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page