A crawler for structured product information from TDCC (Taiwan).
Project description
Taiwan Structured Product Information Crawler
This is a repository that offers a StructuredProductCrawler class to crawl Taiwan TDCC website for the product information.
Tutorial
from tdcc import StructuredProductCrawler
crawler = StructuredProductCrawler()
all_products = crawler.crawl()
crawl()
returns a Pandas DataFrame.
Data columns include:
Column Name | Are |
---|---|
URL | the product's partial url |
UID | product id |
NAME | product name |
CURRENCY | product denomination |
MATURITY | maturity date |
UNDERLYING | underlying asset type |
PRINCIPAL_PROTECTION | % of principal protection |
PI | professional investor |
ISSUE_DATE | issue date |
ISSUER | issuer |
MASTER_AGENT | master agent |
DISTRIBUTOR | distributor |
Installation
To install this verson from PyPI, type:
pip install tdcc
To get the newest one from this repo (note that we are in the alpha stage, so there may be frequent updates), type:
pip install git+git://github.com/jn8029/tdcc.git
To-do
TBC
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
tdcc-0.0.2.tar.gz
(5.2 kB
view hashes)
Built Distribution
tdcc-0.0.2-py3-none-any.whl
(6.9 kB
view hashes)