Blaze the Way of Quantitative Trading

阿新 • • 發佈：2019-02-02

http://www.quantstart.com/articles/Basics-of-Statistical-Mean-Reversion-Testing

trend following/momentum和mean-reversion是兩種最基本的設計策略的思路，對於後者而言，首先我們要知道time-series是不是滿足mean-reversion的。從數學上說，連續的均值回覆過程就是隨機過程中的OU過程(Ornstein-Uhlenbeck)，不同於布朗運動。我們可以利用pandas和statsmodel進行ADF檢驗來判斷均值回覆性(均值回覆性是平穩性的必要條件)，以判斷序列是不是具有均值回覆性。如果我們得到的統計量大於臨界值，那麼不能拒絕原假設，即序列是非均值回覆的，而是隨機遊走。如果是從網路資料庫抓取資料，上面的過程可以寫為

# Import the Time Series libraryimport statsmodels.tsa.stattools as ts

# Import Datetime and the Pandas DataReaderfrom datetime import datetime
from pandas.io.data importDataReader# Download the Google OHLCV data from 1/1/2000 to 1/1/2013
goog =DataReader("GOOG","yahoo", datetime(2000,1,1), datetime(2013,1,1 
))

首先對時間序列進行單位根檢驗

ts.adfuller(,1)

我們還可以用HURST INDEX對平穩性進行檢驗

H<0.5 - The time series is mean reverting
H=0.5 - The time series is a Geometric Brownian Motion
H>0.5 - The time series is trending

from numpy import cumsum, log, polyfit, sqrt, std, subtract
from numpy.random import randn

def hurst 
(ts):"""Returns the Hurst Exponent of the time series vector ts"""# Create the range of lag values
	lags = range(2,100)# Calculate the array of the variances of the lagged differences
	tau =[sqrt(std(subtract(ts[lag:], ts[:-lag])))for lag in lags]# Use a linear fit to estimate the Hurst Exponent
	poly = polyfit(log(lags), log(tau),1)# Return the Hurst exponent from the polyfit outputreturn poly[0]*2.0# Create a Gometric Brownian Motion, Mean-Reverting and Trending Series
gbm = log(cumsum(randn(100000))+1000)
mr = log(randn(100000)+1000)
tr = log(cumsum(randn(100000)+1)+1000)# Output the Hurst Exponent for each of the above series# and the price of Google (the Adjusted Close price) for # the ADF test given above in the articleprint"Hurst(GBM):   %s"% hurst(gbm)print"Hurst(MR):    %s"% hurst(mr)print"Hurst(TR):    %s"% hurst(tr)# Assuming you have run the above code to obtain 'goog'!print"Hurst(GOOG):  %s"% hurst(goog['Adj Close'])

下面討論協整的檢驗，Cointegrated Augmented Dickey-Fuller Test，至於協整這一概念的闡述本質兩個序列的線性組合為平穩的，

#做統計套利時我們常常要作散點圖以及進行協整檢驗

# cadf.py


import datetime
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.dates as mdates
import pandas as pd
import pandas.io.data as web
import pprint
import statsmodels.tsa.stattools as ts


from pandas.stats.api import ols




def plot_price_series(df, ts1, ts2):
    months = mdates.MonthLocator()  #可以是DayLocator HourLocator
    fig, ax = plt.subplots()
    ax.plot(df.index, df[ts1], label=ts1)
    ax.plot(df.index, df[ts2], label=ts2)
    ax.xaxis.set_major_locator(months)
    ax.xaxis.set_major_formatter(mdates.DateFormatter('%b %Y'))
    ax.set_xlim(datetime.datetime(2012, 1, 1), datetime.datetime(2013, 1, 1))
    ax.grid(True)
    fig.autofmt_xdate()


    plt.xlabel('Month/Year')
    plt.ylabel('Price ($)')
    plt.title('%s and %s Daily Prices' % (ts1, ts2))
    plt.legend()
    plt.show()


import datetime
#import numpy as np
import matplotlib.pyplot as plt
import matplotlib.dates as mdates
#import pandas as pd
#import pandas.io.data as web
#import pprint
import statsmodels.tsa.stattools as ts


from pandas.stats.api import ols


#我們的main函式裡面的DataFrame就是d,而程式處理後的df就是mergeDF和Pnl等結果
def plot_price_series(df, ts1, ts2):
    hours = mdates.HourLocator()  #可以是DayLocator HourLocator
    fig, ax = plt.subplots()
    ax.plot(df.index, df[ts1], label=ts1) #這裡要把index重設一下，因為原本的index是pTime和sym_
    ax.plot(df.index, df[ts2], label=ts2)
    ax.xaxis.set_major_locator(hours)
    ax.xaxis.set_major_formatter(mdates.DateFormatter('%b %Y'))
    ax.set_xlim(datetime.datetime(2014, 1, 2), datetime.datetime(2014, 1, 1)) #開始和結束日期要和樣本統一
    ax.grid(True)
    fig.autofmt_xdate()


    plt.xlabel('Hour/Year')
    plt.ylabel('TrdPriceLast ($)')
    plt.title('%s and %s Prices' % (ts1, ts2))
    plt.legend()
    plt.show()


def plot_scatter_series(df, ts1, ts2):
    plt.xlabel('%s TrdPriceLast ($)' % ts1)
    plt.ylabel('%s TrdPriceLast ($)' % ts2)
    plt.title('%s and %s Price Scatterplot' % (ts1, ts2))
    plt.scatter(df[ts1], df[ts2])
    plt.show()


def plot_residuals(df):
    hours = mdates.HourLocator() 
    fig, ax = plt.subplots()
    ax.plot(df.index, df["res"], label="Residuals")
    ax.xaxis.set_major_locator(Hours)
    ax.xaxis.set_major_formatter(mdates.DateFormatter('%b %Y'))
    ax.set_xlim(datetime.datetime(2014, 1, 2), datetime.datetime(2014, 1, 1))
    ax.grid(True)
    fig.autofmt_xdate()


    plt.xlabel('Hour/Year')
    plt.ylabel('TrdPriceLast ($)')
    plt.title('Residual Plot')
    plt.legend()


    plt.plot(df["res"])
    plt.show()




if __name__ == "__main__":
    start = datetime.datetime(2014, 1, 2)
    end = datetime.datetime(2014, 1, 1)


    arex = web.DataReader("AREX", "yahoo", start, end)
    wll = web.DataReader("WLL", "yahoo", start, end)
   
    
    df = pd.DataFrame(index=arex.index)
    d = d.reset_index()
    d = d.set_index(["pTime"])
    df["D"] = d[d["sym"]=="D"][TrdPriceLast]
    df["E"] = d[d["sym"]=="E"][TrdPriceLast]
    
    # Plot the two time series
    plot_price_series(df, "D", "E")


    # Display a scatter plot of the two time series
    plot_scatter_series(df, "D", "E")


    # Calculate optimal hedge ratio "beta"
    res = ols(y=df['D'], x=df["E"])
    beta_hr = res.beta.x


    # Calculate the residuals of the linear combination
    df["res"] = df["D"] - beta_hr*df["E"]


    # Plot the residuals
    plot_residuals(df)


    # Calculate and output the CADF test on the residuals
    cadf = ts.adfuller(df["res"])
    pprint.pprint(cadf)

http://www.quantstart.com/articles/Forecasting-Financial-Time-Series-Part-1

http://matplotlib.org/api/dates_api.html
MinuteLocator
HourLocator

http://blog.sina.com.cn/s/blog_02cf67f00101iuuh.html

Blaze the Way of Quantitative Trading

http://www.quantstart.com/articles/Basics-of-Statistical-Mean-Reversion-Testing trend following/momentum和mean-reversion是兩種最基本的設計策略的思路，對於後

the Way of Python Day 2

sim similar sha maximum per AD lar pick may 　　today,i got lots of knowledge of python ,like how to get the maximum value of three numbers

The way of Webpack learning (I.) -- Configure Webpack from zero（從零開始配置webpack）

-- UNC 初始 exp light 方法 name npm .html 學習之路基於webpack3.10.0，webpack4.0之後更新。一：開始前的配置 1、初始化項目，其實就是新建一個package.json文件，後面的命令依賴裏面的配置項。 npm ini

the way of learning english

As we know, English is a very important communication method in our daily life. But, do we know the way we learn English at high s

the Principle of Trading

如何發現一起盈利能力使用不明確構造歷史 0 澄清事件有兩種辦法，學和思。也就是看別人的想法，和自己琢磨琢磨。本文是第二種途徑的實踐。我打算寫一系列關於交易的文章，目的是澄清相關的事實，梳理並構造體系。達到的狀態是，其他所有交易相關的知識和信息，都可以在這個

Ask HN: Best way to learn the fundamentals of operating systems

I think 'tomes' is where it's at these days.An operating system covers quite a lot these days so a short book would either skip a lot of topics or be very

Mystery at the center of the Milky Way solved

Last spring, researchers published a study about the apparent presence of astonishing and dramatically high levels of three different elements in red gian

Marginally Interesting: Short Review of Edward R. Tufte's "The Visual Display of Quantitative Information"

Tweet On the bottom line, I found the book quite interesting to read,

Android child's face strikingly expressive: Quantitative approach adds rich nuance to the expressions of their robot child face

While robots have featured in advances in healthcare, industrial, and other settings in Japan, capturing humanistic expression in a robotic face remains a

HDU 3591 The trouble of Xiaoqian(多重背包+全然背包)

給他 cas 維數 color cost 代碼 01背包 size code HDU 3591 The trouble of Xiaoqian(多重背包+全然背包) http://acm.hdu.edu.cn/showproblem.php?pid=3591 題意：

hdu 5381 The sum of gcd 2015多校聯合訓練賽#8莫隊算法

names 來看 efi nbsp span ems multipl script there The sum of gcd Time Limit: 2000/1000 MS (Java/Others) Memory Limit: 65536/65536 K

HDOJ 5381 The sum of gcd 莫隊算法

source scanf borde array size ltr d+ miss != 大神題解: http://blog.csdn.net/u014800748/article/details/47680899 The sum of gcd Time

解決：The content of element type "web-app" must match "(icon?display

內容 param match filter res ros welcome local page http://www.educity.cn/wenda/126463.html 解決：The content of element type "web-app" must ma

解決Gradle執行命令時報Could not determine the dependencies of task ':compileReleaseJava'.

內容 oid 兩個 jsb pan mpi 技術分享 android undle Could not determine the dependencies of task ‘:compileReleaseJava‘. > failed to find targe

Java compiler level does not match the version of the installed Java project facet

led epo sin eclips path tar repo alt rip 更換jdk版本時報以下問題：Description Resource Path Location TypeJava compiler level does not match the vers

POJ 2553 The Bottom of a Graph（強連通分量）

margin target 代碼 not push ret dsm ng- http POJ 2553 The Bottom of a Graph 題目鏈接題意：給定一個有向圖，求出度為0的強連通分量思路：縮點搞就可以代碼： #include <

Mastering the game of Go with deep neural networks and tree search

深度策略參數初始化技術以及 -1 簡單 cpu 網絡 Silver, David, et al. "Mastering the game of Go with deep neural networks and tree search." Nature 529.758

The Bottom of a Graph

ive limit rtai assume ted can hab spa mean 　　　　　　　　　　　　　　poj——The Bottom of a Graph

[Leetcode] remove nth node from the end of list 刪除鏈表倒數第n各節點

truct def 倒數 move col lis remove str class Given a linked list, remove the n th node from the end of list and return its head. For exampl

The Zen of Python

read sparse ever one simple -o light practical ood >>> import this The Zen of Python, by Tim Peters Beautiful is better th

Blaze the Way of Quantitative Trading

相關推薦