mirror of
https://github.com/microsoft/qlib.git
synced 2026-07-02 02:21:18 +08:00
36 lines
1.5 KiB
Markdown
36 lines
1.5 KiB
Markdown
# High-Frequency Dataset
|
|
|
|
This dataset is an example for RL high frequency trading.
|
|
|
|
## Get High-Frequency Data
|
|
|
|
Get high-frequency data by running the following command:
|
|
```bash
|
|
python workflow.py get_data
|
|
```
|
|
|
|
## Dump & Reload & Reinitialize the Dataset
|
|
|
|
|
|
The High-Frequency Dataset is implemented as `qlib.data.dataset.DatasetH` in the `workflow.py`. `DatatsetH` is the subclass of [`qlib.utils.serial.Serializable`](https://qlib.readthedocs.io/en/latest/advanced/serial.html), whose state can be dumped in or loaded from disk in `pickle` format.
|
|
|
|
### About Reinitialization
|
|
|
|
After reloading `Dataset` from disk, `Qlib` also support reinitializing the dataset. It means that users can reset some states of `Dataset` or `DataHandler` such as `instruments`, `start_time`, `end_time` and `segments`, etc., and generate new data according to the states.
|
|
|
|
The example is given in `workflow.py`, users can run the code as follows.
|
|
|
|
### Run the Code
|
|
|
|
Run the example by running the following command:
|
|
```bash
|
|
python workflow.py dump_and_load_dataset
|
|
```
|
|
|
|
## Benchmarks Performance
|
|
### Signal Test
|
|
Here are the results of signal test for benchmark models. We will keep updating benchmark models in future.
|
|
| Model Name | Dataset | IC | ICIR | Rank IC | Rank ICIR | Long precision| Short Precision | Long-Short Average Return | Long-Short Average Sharpe |
|
|
|---|---|---|---|---|---|---|---|---|---|
|
|
| LightGBM | Alpha158 | 0.3042±0.00 | 1.5372±0.00| 0.3117±0.00 | 1.6258±0.00 | 0.6720±0.00 | 0.6870±0.00 | 0.000769±0.00 | 1.0190±0.00 |
|