Skip to main content

A crawler for structured product information from TDCC (Taiwan).

Project description

Taiwan Structured Product Information Crawler

PyPI version PyPI license Package Version Github Last Commit

This is a repository that offers a StructuredProductCrawler class to crawl Taiwan TDCC website for the product information.

Tutorial


from tdcc import StructuredProductCrawler
crawler = StructuredProductCrawler()
all_products = crawler.crawl()

crawl() returns a Pandas DataFrame. Data columns include:

Column Name Are
URL the product's partial url
UID product id
NAME product name
CURRENCY product denomination
MATURITY maturity date
UNDERLYING underlying asset type
PRINCIPAL_PROTECTION % of principal protection
PI professional investor
ISSUE_DATE issue date
ISSUER issuer
MASTER_AGENT master agent
DISTRIBUTOR distributor

Installation

To install this verson from PyPI, type:


pip install tdcc

To get the newest one from this repo (note that we are in the alpha stage, so there may be frequent updates), type:


pip install git+git://github.com/jn8029/tdcc.git

To-do

TBC

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for tdcc, version 0.0.2
Filename, size File type Python version Upload date Hashes
Filename, size tdcc-0.0.2-py3-none-any.whl (6.9 kB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size tdcc-0.0.2.tar.gz (5.2 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page