Skip to main content

A Quantitative-research Platform

Project description

Python Versions Platform PypI Versions Upload Python Package Github Actions Test Status Documentation Status License Join the chat at https://gitter.im/Microsoft/qlib

Qlib is an AI-oriented quantitative investment platform, which aims to realize the potential, empower the research, and create the value of AI technologies in quantitative investment.

It contains the full ML pipeline of data processing, model training, back-testing; and covers the entire chain of quantitative investment: alpha seeking, risk modeling, portfolio optimization, and order execution.

With Qlib, user can easily try ideas to create better Quant investment strategies.

For more details, please refer to our paper "Qlib: An AI-oriented Quantitative Investment Platform".

Framework of Qlib

At the module level, Qlib is a platform that consists of the above components. The components are designed as loose-coupled modules and each component could be used stand-alone.

Name Description
Infrastructure layer Infrastructure layer provides underlying support for Quant research. DataServer provides high-performance infrastructure for users to manage and retrieve raw data. Trainer provides flexible interface to control the training process of models which enable algorithms controlling the training process.
Workflow layer Workflow layer covers the whole workflow of quantitative investment. Information Extractor extracts data for models. Forecast Model focuses on producing all kinds of forecast signals (e.g. alpha, risk) for other modules. With these signals Portfolio Generator will generate the target portfolio and produce orders to be executed by Order Executor.
Interface layer Interface layer tries to present a user-friendly interface for the underlying system. Analyser module will provide users detailed analysis reports of forecasting signals, portfolios and execution results
  • The modules with hand-drawn style are under development and will be released in the future.
  • The modules with dashed borders are highly user-customizable and extendible.

Quick Start

This quick start guide tries to demonstrate

  1. It's very easy to build a complete Quant research workflow and try your ideas with Qlib.
  2. Though with public data and simple models, machine learning technologies work very well in practical Quant investment.

Installation

Users can easily install Qlib by pip according to the following command

  pip install pyqlib

Note: pip will install the latest stable qlib. However, the main branch of qlib is in active development. If you want to test the latest scripts or functions in the main branch. Please install qlib with the methods below.

Also, users can install the latest dev version Qlib by the source code according to the following steps:

  • Before installing Qlib from source, users need to install some dependencies:

    pip install numpy
    pip install --upgrade  cython
    

    Note: Please pay attention that installing cython in Python 3.6 will raise some error when installing Qlib from source. If users use Python 3.6 on their machines, it is recommended to upgrade Python to version 3.7 or use conda's Python to install Qlib from source.

  • Clone the repository and install Qlib as follows.

    • If you haven't installed qlib by the command pip install pyqlib before:
      git clone https://github.com/microsoft/qlib.git && cd qlib
      python setup.py install
      
    • If you have already installed the stable version by the command pip install pyqlib:
      git clone https://github.com/microsoft/qlib.git && cd qlib
      pip install .
      

    Note: Only the command pip install . can overwrite the stable version installed by pip install pyqlib, which the command python setup.py install can't.

Data Preparation

Load and prepare data by running the following code:

python scripts/get_data.py qlib_data --target_dir ~/.qlib/qlib_data/cn_data --region cn

This dataset is created by public data collected by crawler scripts, which have been released in the same repository. Users could create the same dataset with it.

Please pay ATTENTION that the data is collected from Yahoo Finance and the data might not be perfect. We recommend users to prepare their own data if they have high-quality dataset. For more information, users can refer to the related document.

Auto Quant Research Workflow

Qlib provides a tool named qrun to run the whole workflow automatically (including building dataset, training models, backtest and evaluation). You can start an auto quant research workflow and have a graphical reports analysis according to the following steps:

  1. Quant Research Workflow: Run qrun with lightgbm workflow config (workflow_config_lightgbm.yaml) as following.

      cd examples  # Avoid running program under the directory contains `qlib`
      qrun benchmarks/LightGBM/workflow_config_lightgbm.yaml
    

    The result of qrun is as follows, please refer to please refer to Intraday Trading for more details about the result.

    'The following are analysis results of the excess return without cost.'
                           risk
    mean               0.000708
    std                0.005626
    annualized_return  0.178316
    information_ratio  1.996555
    max_drawdown      -0.081806
    'The following are analysis results of the excess return with cost.'
                           risk
    mean               0.000512
    std                0.005626
    annualized_return  0.128982
    information_ratio  1.444287
    max_drawdown      -0.091078
    

    Here are detailed documents for qrun and workflow.

  2. Graphical Reports Analysis: Run examples/workflow_by_code.ipynb with jupyter notebook to get graphical reports

    • Forecasting signal (model prediction) analysis

      • Cumulative Return of groups Cumulative Return
      • Return distribution long_short
      • Information Coefficient (IC) Information Coefficient
        Monthly IC IC
      • Auto Correlation of forecasting signal (model prediction) Auto Correlation
    • Portfolio analysis

      • Backtest return Report

Building Customized Quant Research Workflow by Code

The automatic workflow may not suite the research workflow of all Quant researchers. To support a flexible Quant research workflow, Qlib also provides a modularized interface to allow researchers to build their own workflow by code. Here is a demo for customized Quant research workflow by code.

Quant Model Zoo

Here is a list of models built on Qlib.

Your PR of new Quant models is highly welcomed.

The performance of each model on the Alpha158 and Alpha360 dataset can be found here.

Run a single model

All the models listed above are runnable with Qlib. Users can find the config files we provide and some details about the model through the benchmarks folder. More information can be retrieved at the model files listed above.

Qlib provides three different ways to run a single model, users can pick the one that fits their cases best:

  • User can use the tool qrun mentioned above to run a model's workflow based from a config file.

  • User can create a workflow_by_code python script based on the one listed in the examples folder.

  • User can use the script run_all_model.py listed in the examples folder to run a model. Here is an example of the specific shell command to be used: python run_all_model.py --models=lightgbm, where the --models arguments can take any number of models listed above(the available models can be found in benchmarks). For more use cases, please refer to the file's docstrings.

Run multiple models

Qlib also provides a script run_all_model.py which can run multiple models for several iterations. (Note: the script only support Linux for now. Other OS will be supported in the future. Besides, it doesn't support parrallel running the same model for multiple times as well, and this will be fixed in the future development too.)

The script will create a unique virtual environment for each model, and delete the environments after training. Thus, only experiment results such as IC and backtest results will be generated and stored.

Here is an example of running all the models for 10 iterations:

python run_all_model.py 10

It also provides the API to run specific models at once. For more use cases, please refer to the file's docstrings.

Quant Dataset Zoo

Dataset plays a very important role in Quant. Here is a list of the datasets built on Qlib:

Dataset US Market China Market
Alpha360
Alpha158

Here is a tutorial to build dataset with Qlib. Your PR to build new Quant dataset is highly welcomed.

More About Qlib

The detailed documents are organized in docs. Sphinx and the readthedocs theme is required to build the documentation in html formats.

cd docs/
conda install sphinx sphinx_rtd_theme -y
# Otherwise, you can install them with pip
# pip install sphinx sphinx_rtd_theme
make html

You can also view the latest document online directly.

Qlib is in active and continuing development. Our plan is in the roadmap, which is managed as a github project.

Offline Mode and Online Mode

The data server of Qlib can either deployed as Offline mode or Online mode. The default mode is offline mode.

Under Offline mode, the data will be deployed locally.

Under Online mode, the data will be deployed as a shared data service. The data and their cache will be shared by all the clients. The data retrieval performance is expected to be improved due to a higher rate of cache hits. It will consume less disk space, too. The documents of the online mode can be found in Qlib-Server. The online mode can be deployed automatically with Azure CLI based scripts. The source code of online data server can be found in Qlib-Server repository.

Performance of Qlib Data Server

The performance of data processing is important to data-driven methods like AI technologies. As an AI-oriented platform, Qlib provides a solution for data storage and data processing. To demonstrate the performance of Qlib data server, we compare it with several other data storage solutions.

We evaluate the performance of several storage solutions by finishing the same task, which creates a dataset (14 features/factors) from the basic OHLCV daily data of a stock market (800 stocks each day from 2007 to 2020). The task involves data queries and processing.

HDF5 MySQL MongoDB InfluxDB Qlib -E -D Qlib +E -D Qlib +E +D
Total (1CPU) (seconds) 184.4±3.7 365.3±7.5 253.6±6.7 368.2±3.6 147.0±8.8 47.6±1.0 7.4±0.3
Total (64CPU) (seconds) 8.8±0.6 4.2±0.2
  • +(-)E indicates with (out) ExpressionCache
  • +(-)D indicates with (out) DatasetCache

Most general-purpose databases take too much time on loading data. After looking into the underlying implementation, we find that data go through too many layers of interfaces and unnecessary format transformations in general-purpose database solutions. Such overheads greatly slow down the data loading process. Qlib data are stored in a compact format, which is efficient to be combined into arrays for scientific computation.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the right to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

pyqlib-0.6.1-cp38-cp38-win_amd64.whl (285.8 kB view details)

Uploaded CPython 3.8 Windows x86-64

pyqlib-0.6.1-cp38-cp38-manylinux2010_x86_64.whl (680.7 kB view details)

Uploaded CPython 3.8 manylinux: glibc 2.12+ x86-64

pyqlib-0.6.1-cp38-cp38-manylinux1_x86_64.whl (680.7 kB view details)

Uploaded CPython 3.8

pyqlib-0.6.1-cp38-cp38-macosx_10_14_x86_64.whl (277.7 kB view details)

Uploaded CPython 3.8 macOS 10.14+ x86-64

pyqlib-0.6.1-cp37-cp37m-win_amd64.whl (284.0 kB view details)

Uploaded CPython 3.7m Windows x86-64

pyqlib-0.6.1-cp37-cp37m-manylinux2010_x86_64.whl (644.0 kB view details)

Uploaded CPython 3.7m manylinux: glibc 2.12+ x86-64

pyqlib-0.6.1-cp37-cp37m-manylinux1_x86_64.whl (644.0 kB view details)

Uploaded CPython 3.7m

pyqlib-0.6.1-cp37-cp37m-macosx_10_14_x86_64.whl (277.1 kB view details)

Uploaded CPython 3.7m macOS 10.14+ x86-64

pyqlib-0.6.1-cp36-cp36m-win_amd64.whl (284.2 kB view details)

Uploaded CPython 3.6m Windows x86-64

pyqlib-0.6.1-cp36-cp36m-manylinux2010_x86_64.whl (641.0 kB view details)

Uploaded CPython 3.6m manylinux: glibc 2.12+ x86-64

pyqlib-0.6.1-cp36-cp36m-manylinux1_x86_64.whl (641.0 kB view details)

Uploaded CPython 3.6m

pyqlib-0.6.1-cp36-cp36m-macosx_10_14_x86_64.whl (278.4 kB view details)

Uploaded CPython 3.6m macOS 10.14+ x86-64

File details

Details for the file pyqlib-0.6.1-cp38-cp38-win_amd64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp38-cp38-win_amd64.whl
  • Upload date:
  • Size: 285.8 kB
  • Tags: CPython 3.8, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.8.6

File hashes

Hashes for pyqlib-0.6.1-cp38-cp38-win_amd64.whl
Algorithm Hash digest
SHA256 0c962e49b3cd72baefa6b995b9d625e60d34ef6ddb337c82c2650072baab7524
MD5 fb8de9e5f2698244df82a97d439a50c7
BLAKE2b-256 e46a60d57bd52cfc2fba359f687f847c7746355a5cd4a40002b6fcd8344ad260

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp38-cp38-manylinux2010_x86_64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp38-cp38-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 680.7 kB
  • Tags: CPython 3.8, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for pyqlib-0.6.1-cp38-cp38-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 990ff4ec69ddeda49f90a1cc0fb2575fb82a6600aeec289d93b6f885ec35ad63
MD5 b8a491fac6e2b0a80058d7c92f356584
BLAKE2b-256 70b66c6db8069c5a38d50ac651f7107005bc4d092bf4787e11d640d602819b45

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp38-cp38-manylinux1_x86_64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp38-cp38-manylinux1_x86_64.whl
  • Upload date:
  • Size: 680.7 kB
  • Tags: CPython 3.8
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for pyqlib-0.6.1-cp38-cp38-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 d3c25711b582b743aa507370456e6be55bea9d1783a009bcc5a99f9860c29a95
MD5 56857c00d188d3ed7343329230adb9b0
BLAKE2b-256 117887d8fa049b43b06654ec9ea98014d5701be9f93457c275432eede807cf0d

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp38-cp38-macosx_10_14_x86_64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp38-cp38-macosx_10_14_x86_64.whl
  • Upload date:
  • Size: 277.7 kB
  • Tags: CPython 3.8, macOS 10.14+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.8.6

File hashes

Hashes for pyqlib-0.6.1-cp38-cp38-macosx_10_14_x86_64.whl
Algorithm Hash digest
SHA256 99d4592701edcd7f610e2fca7b02ca7608ce617ea8f8e07923771831ba013493
MD5 5877a982b8bdf1b420c4ce63ed4bf51b
BLAKE2b-256 ccbdcac25a95dd82ce698dccc9554379602fa5dd2f42de72403d54f29b977961

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp37-cp37m-win_amd64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp37-cp37m-win_amd64.whl
  • Upload date:
  • Size: 284.0 kB
  • Tags: CPython 3.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for pyqlib-0.6.1-cp37-cp37m-win_amd64.whl
Algorithm Hash digest
SHA256 85ab9608544ee07a06a0d693ce8c42411da499f245838b8d074d675dedc02f33
MD5 a96e889e74c3272edcf437766dfd9d9f
BLAKE2b-256 53c35b0c04bb076ebba62909d05bf7890c39b5c563ac58133e6ae10c4e0fa779

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp37-cp37m-manylinux2010_x86_64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp37-cp37m-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 644.0 kB
  • Tags: CPython 3.7m, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for pyqlib-0.6.1-cp37-cp37m-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 7bb2fcd3d269e04459ee4baea924ace784060dd6159d5899fc3f506c7b1af1af
MD5 2886fd859971c52a9c1908f7fbf0eae6
BLAKE2b-256 d95b31071be40d5f5b931299beb8da1b8519cc529485752e33e6664d27bc0fbd

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp37-cp37m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 644.0 kB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for pyqlib-0.6.1-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 ac0718b6922566bb6fa45882cc60d32483ccd59ab670d783a123eba22c3907c2
MD5 b77a55487d6be1c15ce058bbc3242b18
BLAKE2b-256 396d25787a1ac8395f758c20e405788caffab25067515c0e2ea056440fcc0205

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp37-cp37m-macosx_10_14_x86_64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp37-cp37m-macosx_10_14_x86_64.whl
  • Upload date:
  • Size: 277.1 kB
  • Tags: CPython 3.7m, macOS 10.14+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for pyqlib-0.6.1-cp37-cp37m-macosx_10_14_x86_64.whl
Algorithm Hash digest
SHA256 c5739fcd34270f121eb49b132982e502e0e1c6ed29ca1cc960e5d5c5e54af1bb
MD5 c5deb5a9baf464a4876ca1c5676009e5
BLAKE2b-256 e232da54e89247ee33d79e91f0afca37fec2fc3bb82682b46a3cea23ab9270b5

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp36-cp36m-win_amd64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp36-cp36m-win_amd64.whl
  • Upload date:
  • Size: 284.2 kB
  • Tags: CPython 3.6m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.6.8

File hashes

Hashes for pyqlib-0.6.1-cp36-cp36m-win_amd64.whl
Algorithm Hash digest
SHA256 752aa92b50224a2164fafbb9d5ff63d93e4780d29c89d0e2f1134654b7b73129
MD5 f2ba0e36010d1c46aa0ffabf83d09cf1
BLAKE2b-256 23ade82dcf126f412fd487cae7f9a31152675138de2d342b964a07440599af3d

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp36-cp36m-manylinux2010_x86_64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp36-cp36m-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 641.0 kB
  • Tags: CPython 3.6m, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for pyqlib-0.6.1-cp36-cp36m-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 216373c8efa622368a486602a763094fc6ac1c8a56d0d8db86c712c9488a8286
MD5 ea8601c66e42edcced30fb9ceda685a2
BLAKE2b-256 6de4e3b3d0908c97270820d0080274332fca8b252e1bde0d112d53e4af9886bc

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp36-cp36m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 641.0 kB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for pyqlib-0.6.1-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 0b9d34ab9a8cbd026b8765b3575d506a85afaaee16063fa412e8c0806762d726
MD5 366b5ac22bdc63c418e5fcf0cbc1f22f
BLAKE2b-256 4894b04af72fbecca795fb68a86e486455c96e9ca80a5348434563da3a77fb60

See more details on using hashes here.

File details

Details for the file pyqlib-0.6.1-cp36-cp36m-macosx_10_14_x86_64.whl.

File metadata

  • Download URL: pyqlib-0.6.1-cp36-cp36m-macosx_10_14_x86_64.whl
  • Upload date:
  • Size: 278.4 kB
  • Tags: CPython 3.6m, macOS 10.14+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.6.12

File hashes

Hashes for pyqlib-0.6.1-cp36-cp36m-macosx_10_14_x86_64.whl
Algorithm Hash digest
SHA256 6fe2b0a5cd9df1978f608cda7339981d6954071ed3674b610397434685c8d3bd
MD5 16279d034eba8c482466b2f154e97e80
BLAKE2b-256 472bb72814c4fe4bb390bae7db31ceb1e88619da3da782385dfd172485ffdc41

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page