Demystifying pairs trading: The role of volatility and ... Pairs Trading - … · Electronic copy available at : http ://ssrn.com /abstract = 2774063 Demystifying pairs trading: The

Electronic copy available at: http://ssrn.com/abstract=2774063

Demystifying pairs trading: The role of volatility and

correlation

Stephanie Sarah Riedinger

Catholic University Eichstaett-Ingolstadt

Auf der Schanz 49, 85049 Ingolstadt, Germany

Phone: 0049 841 937 219 26, Mail to: [email protected]

Abstract

This paper investigates how the two technical drivers, volatility and correlation, in-

fluence the algorithm of the pairs trading investment strategy. We model and em-

pirically prove the connection between the rule-based pair selection, the trading al-

gorithm, and the total return. Our insights explain why pairs trading profitability

varies across markets, industries, macroeconomic circumstances, and firm charac-

teristics. Furthermore, we critically evaluate the power of the traditionally applied

pair selection procedure. In the US market, we find risk-adjusted monthly returns of

up to 76bp for portfolios, which are double sorted on volatility and correlation be-

tween 1990 and 2014. Our findings are robust to liquidity issues, bid-ask spread,

and limits of arbitrage.

Keywords: Pairs trading, Relative-value arbitrage, Volatility, Limits of arbitrage

JEL Classification: G11, G12, G17

Electronic copy available at: http://ssrn.com/abstract=2774063

1. Introduction

Does the act of measuring influence the outcome of an experiment? In science, this question is

frequently asked to identify the Hawthorne effect, also called observer effect. Empirical find-

ings lose their credibility, if the applied methodology interferes with the outcome.1 We trans-

fer this issue to investing and ask whether the return of a technical, rule-based strategy is driv-

en by the algorithm itself. In this paper, we answer the initial question for the highly recog-

nized pairs trading investment strategy, which exploits arbitrage profits from temporary mar-

ket inefficiencies.2 Gatev/Goetzmann/Rouwenhorst (2006) find that certain industry portfolios

outperform others. Do/Faff (2010) explain a superior return in times of market distress, be-

tween January 2000-December 2002 and July 2007-June 2009, with a decrease of market effi-

ciency. Jacobs/Weber (2015) observe varying returns across different countries and propose

the number of eligible pairs and limited investor’s attention as possible explanations. Without

doubt, less efficient markets or industries allow more mispricing opportunities, which in turn

increase the strategy’s return. Yet, this paper’s main purpose is to demonstrate that at least

part of the additional return originates from the interaction of the algorithm with higher mar-

ket volatility and correlation levels over time, across countries, and across industries.

We extend the current knowledge about pairs trading with two important findings. Firstly, we

provide evidence that the traditional formula to select pairs privileges pairs with high correla-

tion and low stock volatility, while disregarding other pairs. However, this selection fails to

exploit the full return potential. Therefore, we also study the returns of pairs with different

pair volatility and correlation levels. Secondly, our results demonstrate that pair volatility and

correlation significantly influence the two return dimensions, the return per trade and the trad-

ing frequency, through the trading algorithm. This paper contributes furthermore to the litera-

ture on gradual information diffusion, volatility timing, and limits to arbitrage. Altogether, our

findings contribute to better understand previous academic findings by providing a technical

foundation, how the input factors, pair volatility and pair correlation, interact with pairs trad-

ing in the cross-section and over time. In this context, pair volatility and pair correlation are

1 For instance, in quantum physics it is impossible to detect an electron without interacting with a photon. How-

ever, this interaction biases the electron’s path. 2 In contrast to the classical Hawthorne effect, the observed object, the return, does not adapt its behavior as it is

observed. Rather, the observation system, that is the algorithm, influences the return.

the technical executing factors that translate illiquidity, market distress and other factors into

higher returns. Furthermore, our findings are of high relevance for practitioners, who wish to

implement profit increasing modifications. Finally, our results impressively emphasize the

importance of critically reflecting the impact of a rule-based procedure on the outcome.

The profitability of pairs trading is often explained by the information diffusion hypothesis

(Gatev/Goetzmann/Rouwenhorst (2006), Engelberg/Gao/Jagannathan (2009); Chen/Chen/Li

(2013), Jacobs/Weber (2013, 2015)). The hypothesis argues that pairs trading exploits tempo-

rary mispricing, which arises as one stock incorporates common news faster than another one.

Based thereupon, the idea of univariate pairs trading is simple. Each month, a new pairs trad-

ing period, consisting of a twelve-months identification period and a subsequent six-months

trading period starts. Gatev/Goetzmann/Rouwenhorst (2009) suggest a simple measure to se-

lect pairs for trading, the sum of squared price differences (SSD). At the beginning of each

identification period, they normalize all stock prices to one and compute the daily price dif-

ference between any possible pair combination in the subsequent twelve months. For each

pair, the SSD cumulates the daily price spreads: SSD = ∑ (Pt,a-Pt,b)2T

t=1 , where Pt,a is the nor-

malized stock price of stock a at day t. Furthermore, Gatev/Goetzmann/Rouwenhorst (2009)

compute the standard deviation of a pair’s price spread. The twenty pairs with the lowest

SSD form a trading portfolios, and they are eligible for trading in the consecutive six-months

trading period. Low SSD stocks are chosen, because a small SSD indicates similar price de-

velopment of a pair’s stocks in the past and is expected to predict further commove in the fol-

lowing trading period. At the beginning of the trading period, all prices are again normalized

to one. During the next six months, the author’s check every day whether the price spread of

any pair included in the trading portfolio exceeds two times the standard deviation , as com-

puted during the identification period (2-rule). If the 2-rule applies, they initiate a one unit

long position in the worse performing ‘loser’ stock and a one unit short position in the better

performing ‘winner’ stock. The underlying idea is that the price spread is a temporary mis-

pricing, as both stocks are expected to commove. The positions exploit the anticipated price

reversion and are neutralized as soon as both normalized prices fully converge. On average

during January 1991 and December 2014, a traditional pairs trading portfolio earns a monthly

risk adjusted return of 37bp in the US market. As the complete procedure is highly rule-

driven, we question how the strictly defined process influences the strategy’s return.

This paper challenges the traditional selection procedure and trades pairs that are sorted on

pair volatility and pair correlation. As previously mentioned, we demonstrate that the selec-

tion procedure picks highly correlated pairs with low volatility. Therefore, we are highly in-

terested how traditionally disregarded pairs, with various volatility and correlation levels, per-

form. We suspect that volatility and correlation not only influence the selection formula, but

also the trading formula, due to the technical nature of both formulas. During the twelve-

month identification period, we calculate the volatility of each stock and the correlation of all

possible stock combinations based on daily prices. We refer to this correlation as pair correla-

tion ρAB. Furthermore, we define the pair volatility σAB of a pair as the sum of stock A’s and

stock B’s individual historic volatility σAB = σA2 + σB

2 during the identification period. A port-

folio, containing highly correlated pairs with low pair volatility, earns a monthly risk adjusted

pairs trading return of 47bp, whereas a portfolio with similar pair correlation, but highly vola-

tile, stocks earns even 60% more. Even a portfolio with low pair volatility and negatively cor-

related pairs earns a return of 46bp. We wonder whether this astonishing outperformance is

incidental. Therefore, we systematically investigate the return of different levels of pair vola-

tility and pair correlation. Each month, we allocate all pair combinations based on their pair

volatility σAB and their correlation ρAB to one of five correlation quintiles and one of five pair

volatility quintiles. Afterwards, we randomly select twenty pairs out of the intersection of

each pair volatility and pair correlation quintile and include them into one of twenty-five trad-

ing portfolio. These portfolios are labelled VCS Portfolios (volatility and correlation sorted

portfolios) in the following. Pairs within these VCS Portfolios are eligible for trading in the

consecutive trading period. The most surprising result from the analysis is that twenty, out of

twenty-five, VCS Portfolios significantly outperform the return of the traditional portfolio,

with monthly risk-adjusted returns of 39bp up to 209bp. The return increases for higher pair

volatility and higher correlation, although both effects are not linear across the quintile levels.

Actually, the traditional pairs trading algorithm was designed to perfectly exploit arbitrage

opportunities. However, the demonstrated failure to outperform alternatively formed portfoli-

os emphasizes the need to better understand how the algorithm really works.

To demystify the ‘black box’ pairs trading, we model the technical interdependencies of the

rule-based selection and trading algorithm with the strategy’s return generating components in

figure 1. We argue that the algorithm influences two major components, the selection of pairs

eligible for trading and the maximum return per trade.

[Insert Figure 1]

Firstly, we claim that the strategy’s SSD selection criterion determines the stock selection.

The SSD picks stock pairs, whose historic price spread is low. Our findings indicate that pair

volatility AB and correlation ρAB explain up to 88% of the SSD’s variance. Furthermore, high

pair volatility and low pair correlation increase the SSD. Hence, we conclude that the SSD

selection procedure selects pairs with a low volatility and a high correlation level. To elimi-

nate this selection bias effect in the following analyses, we consider pairs with all pair volatili-

ty and correlation levels.

Secondly, we argue that the traditional algorithm limits the maximum return per trade via the

2σ-rule. The trading algorithm initiates positions as soon as the price difference of a pair’s

stocks exceeds 2σAB. Positions are closed afterwards, as soon as both prices fully converge.

Hence, we approximately earn the initial price spread. The return per trade is obviously high-

er, if the initial price spread 2σAB is high. We conclude that the size of 2σAB restricts the max-

imum return per trade. We elaborate that σAB equals the standard deviation of a portfolio

comprising a long and a short stock. 2σAB can be rewritten as formula with input parameters

pair volatility and pair correlation: 2σAB = 2√σA2 +σB

2 – 2ρAB

σAσB . The first derivation reveals

that high pair volatility and negative correlation enhance the return per trade. Our empirical

results validate that negatively correlated pairs earn twice as much as per trade in comparison

to highly correlated pairs (same volatility level). Furthermore, highly volatile pairs earn up to

three times more per trade than low volatile pairs (same correlation).

Together with the return per trade, the trading frequency, or more precisely the number of

trades, determine the total return of pairs trading. For instance, Do/Faff (2010) find that se-

lecting pairs with frequent stock price crossings significantly enhances the return. We propose

that the trade opening trigger of 2σAB is defined based on the computed pair volatility and

correlation during the identification period. Yet, the likelihood to open a new trade is higher,

if the pair volatility increases or the correlation decreases in the subsequent trading period.

Consistent with a mean-reversion process, our results report more volatility increases for ini-

tially low pair volatility, and more correlation decreases for initially high correlation levels.

More trades in turn increase the trading frequency, which is beneficial for the strategy’s total

return.

Notably, high pair volatility increases both, the return per trade and also the trading frequen-

cy. In contrast, high pair correlation increases the trading frequency on the one hand, but de-

crease the return per trade on the other hand. Yet, it is not obvious, whether the influence on

the return per trade or on the trading frequency dominants. This overlaying effects explain,

why there is no linear change across different pair volatility and correlation quintile levels of

the VCS portfolio’s monthly returns.

One puzzle remains, why pairs trading is profitable for less correlated pairs, although the

strategy intends to exploit temporary mispricing of close economic substitutes. We disagree

that close economic substitutes must be highly correlated and argue, that the information dif-

fusion hypothesis demands a gradual information diffusion across two close economic substi-

tutes. Exactly this required temporary dissimilar price development causes a lower correla-

tion. Highly correlated pairs are likely to incorporate common information simultaneously and

hence offer little chance to exploit temporary mispricing.

The remaining parts of the paper are organized as follows. In section 2, we derive three re-

search propositions from theory. Section 3 introduces the data selection, the methodology of

pairs trading, the return calculation, applied modifications and the test design. Afterwards,

section 4 explores the link between the classical selection criterion, pair volatility and pair

correlation. Section 5 validates our hypotheses with empirical evidence on trade level, where-

as section 6 reports results in calendar time. Section 7 discusses the profitability of close and

non-close economic substitutes. Finally, section 8 discusses possible return sources and ad-

dresses robustness issues, before section 9 concludes.

2. Development of research questions

Although previous papers do not directly examine the influence of pair volatility and correla-

tion, their findings motivate our research. Firstly, out of fifty pairs with the lowest distance

measure (SSD), Do/Faff (2010) select the twenty pairs with the highest number of price cross-

ings during the identification period and observe a higher profitability. These authors argue

that multiple divergences and convergences of a pair within the identification period, predict

more trading opportunities. Moreover, they find an increased profitability during times of

market distress from January 2000-December 2002 and July 2007-June 2009. Both bear mar-

ket phases were accompanied by high market volatility. Secondly, the success of pairs trading

was explored in many countries, for instance Andrade/diPietro/Seashole (2005), Perlin (2009)

and Bolgün/Kurun/Güven (2010) among others. Most notable, Jacobs/Weber (2015) provide a

broad overview for 34 countries worldwide. They observe that less developed markets usually

realize a higher return. These markets are normally less liquid and hence possess higher mar-

ket volatility. Thirdly, Engelberg/Gao/Jagannathan (2009) provide a number of in-depth anal-

yses about the drivers of opening, horizon and divergence risks. Among other findings, they

demonstrate the positive influence of idiosyncratic volatility on the opening probability and a

negative impact of idiosyncratic volatility on the time till convergence and also on the diver-

gence probability. Similarly, Jacobs/Weber (2015) examine the influence of idiosyncratic vol-

atility as proxy for limits to arbitrage and find a positive influence of volatility on the return.

Finally, Huck (2015) investigates the effect of total market volatility over time. He initializes

positions, if the VIX is categorized into a certain regime, in addition to the traditional opening

signal. However, the return is only significant at times of an increasing or high 3-month mov-

ing average VIX.

We understand pair volatility and pair correlation as technical drivers of the pairs trading re-

turns. The total return consists of two return building blocks, the return per trade and the trad-

ing frequency. Firstly, the return per trade is defined as the average return per single round-

trip of a pair. Obviously, a high return per trade increases the total return. Secondly, the trad-

ing frequency is defined as the number of trades of a pair during one trading period. Earning

the return per trades more often raises the strategy’s total profitability. We proceed with intro-

ducing our three research proposals. In section 2.1, we derive why we expect pairs, which are

picked by the traditional selection criterion SSD, to possess certain pair volatility and correla-

tion levels. Afterwards, we explain how the two two drivers, pair volatility and correlation,

influence the return per trade in section 2.2 and the trading frequency in section 2.3.

2.1 The SSD selection criterion

A broad stream of literature applies the distance measure (SSD), as proposed by Gat-

ev/Goetzmann/Rouwenhorst (2006), to identify close economic substitutes. The SSD com-

putes the sum of squared differences of the normalized prices of two stocks and selects the

twenty pairs with the lowest SSD:

Distance Measure (SSD) = ∑ (PA,t-PB,t)2

=Tt=1 (1.1)

= ∑ (PA,t2 + PB,t

2 − 2ρA,B)Tt=1 (1.2)

where ρA,B is the Pearson correlation coefficient of the standardized price time series.3 From

the mathematical term in 1.2 we see that maximizing the correlation coefficient equals mini-

mizing the SSD (Krauss (2015)). Based thereupon, we conclude that the SSD typically selects

highly correlated pairs.

Moreover, we argue that low stock volatility is also crucial to minimize the SSD. Term 1.2

furthermore reveals that high stock prices for PA,t or PB,t also increase the SSD. Both prices

are normalized to one in t = 0. Thus, high values for PA,t and PB,t can only develop, if stock

volatility is high. Whether the minimizing influence of correlation on the SSD or the increas-

ing effect of stock volatility dominates, strongly depends on the magnitude of both factors.

Term 1.1 helps to develop a good intuition. For instance, consider three stocks A, B, and C,

whose prices PA, PB and Pc are all normalized to one in t = 0. PA and Pc are uncorrelated with

ρA,C = 0, but both are independent and normally distributed random variables with the same

standard deviation . Hence, the their confidence interval for the price realizations of PA and

PC is identical with [1 − 𝓏1−𝛼

2

𝜎

√𝑛, 1 + 𝓏1−

𝛼

2

𝜎

√𝑛]. PB is perfectly correlated with PA, however a

△ change in PA translates into a 4△ of PB.4 Consequently, PA and PB are perfectly correlated

with ρA,B = 1. However, the confidence interval of PB is significantly larger, so the deviation

from the expected value is usually higher. The daily price difference between PA and PC might

therefore be smaller than the price difference between PA and PC, although ρA,C = 0 and

3 The Pearson correlation coefficient equals: 𝜌𝐴,𝐵 =

1

𝑡−1∑ (

𝑃𝐴,𝑡−𝑃𝐴̅̅ ̅̅

𝑠𝑃𝐴

) (𝑃𝐵,𝑡−𝑃𝐵̅̅ ̅̅

𝑠𝑃𝐵

)𝑇𝑡=1 , where 𝑃𝑖,𝑡 =

𝑃𝑖,𝑡−𝑃�̅�

𝑠𝑃𝑖

are the

daily z-transformed normalized prices. 4 For instance, if PA raises from 1$ to 1.01$, PB raises from 1$ to 1.04$.

ρA,B = 1. For example in t = 1, Stock A rises from PA,0 = 1 to PA,1 = 1.01. Hence, Stock B

rises from PB,0 = 1 to PB,1 = 1.04. The price difference between stock A and B is: |PA,1-PB,1| =

0.04. Stock C is uncorrelated with stock A and does not move at all, PC,0 = PC,1 = 1. The price

difference between stock A and C is therefore: |PA,1-PC,1| = 0.01. If we cumulate the daily

squared price differences according to 1.1, we come to the following conclusion: The SSD for

two uncorrelated stocks (A and C) can be smaller than the SSD of two highly correlated

stocks (A and B), if the highly correlated pair includes at least one highly volatile stock. Alto-

gether, we expect a positive link between the SSD and correlation, and a reverse link for the

SSD and pair volatility.

Research proposition 1:

Pairs, with a high correlation and a low pair volatility, are associated with a low SSD.

A validation of research proposition 1 would indicate, that applying the SSD results in trading

with highly correlated pairs with little volatility. However, it is unclear, whether these pair

volatility and correlation combinations are beneficial for the return per trade and the trading

frequency.

2.2 Return per trade

We initialize positions as soon as the stock pair’s price difference exceeds two historic stand-

ard deviations 2hist, as calculated during the identification period. The return is continuously

earned while the pair is ‘open’, until stock prices fully converge. At that point in time, we

neutralize positions. We distinguish four different closing types. Firstly, trades that fully con-

verge during the trading period are denoted as ‘natural trades’. Secondly, pairs which not fully

converge on the last day of the trading period, are forcefully closed and labelled ‘incomplete

trades’. Thirdly, as we trade with highly volatile stocks, investors might be concerned about

the asymmetric return profile5 of pairs trading. Therefore, we close a pair, if the price differ-

ence exceeds 4hist on any day while a trade is open, similar to Engelberg/Gao/Jagannathan’s

(2009) 10-day-maximum strategy. Hence, the return potential is symmetric within the range

of -2ơ and +2ơ. These trades are labelled ‘overshooting trades’. Affected pairs are blocked for

5 The positive return per trade is limited to 2ơ, while pairs that further diverge might generate an infinite loss.

the rest of the trading period.6 Finally, ‘delisted trades’ are trades that automatically close if

one stock is delisted.

We conjecture that a pair always earns a positive return, if prices fully converge (natural

trade).7 The return per trade equals the stock price difference at the time of the opening. Posi-

tions are initialized as soon as prices diverge by more than 2hist. Thus, the return of a natural

trade equals 2hist plus an overshooting component. The overshooting component occurs, if

the price difference exceeds the 2ơhist trigger during the day and prices further diverge until

closing prices are set. We derive:

Return per natural trade = 2σhist+ Overshooting. Accordingly, the return of a natural trade is

higher, if the historical standard deviation is larger.

Pairs trading can be understood as a portfolio, consisting of one unit of stock A (long) and one

unit of stock B (short). Therefore, the pair’s historic standard deviation can easily be comput-

ed. A simple mathematic transformation uncovers the relationship between trade return, vola-

tility of stock A and B and the pair’s correlation coefficient pAB:

Return per natural trade = 2√σA2 +σB

2 - 2ρAB

σAσB + Overshooting, (2)

The formula validates our conjecture that the return of a successful trade is always positive.

The overshooting component is always positive, as otherwise the total price difference would

not exceed 2ơhist. As stock A is unequal to stock B, we conclude (σA-σB)2>0. Hence, the in-

fluence of volatility on the return per trade is always positive, as σA2 +σB

2 > 2ρAB

σAσB with

ρAB∈[-1,1]. Based upon this inequality, we argue that a higher volatility level of stock A or

stock B (or both) increases the historic standard deviation and thus the return per trade.

On the contrary, higher pair correlation decreases 2ơhist. Neglecting the overshooting compo-

nent, the first differentiation of the function regarding the historical correlation coefficient

6 Otherwise, a new trade would open on the next day, as the price spread is still above 2ơhist because prices are

not set back to one during the trading period. We also assume that a price spread of 4ơhist indicates a permanent

price spread and thus refrain from further trading with these pairs. 7 A possible unprofitable price development of one stock is always overcompensated by the return of the other

stock that overcomes the previous price difference. For instance, if the stock price of the short position is rising,

the return of the long position will over compensate the negative return of the short position as otherwise, stock

prices would not fully converge.

equals:

∂f(σA,σB,ρAB)

∂ρAB

= - 2σAσB

√σA2 + σB

2 - 2ρABσAσB

, (3)

The first differentiation is always negative, as the numerator and the denominator are positive.

The return should therefore increase with declining historical correlation. We conclude that

more volatile stocks and a negative or low stock correlation enhance the return per trade.

In contrast, overshooting trades, which are closed if the price spread exceeds 4ơhist, always

yield a negative return. Recall that positions are initialized at a price spread of 2ơhist and are

closed at 4ơhist. The potential loss is hence limited to 2ơhist and therefore symmetric to the re-

turn potential of natural trades. The link between return on the one side and volatility and cor-

relation on the other side is reverse to natural trades. Hence, low volatility and high positive

correlation reduce the downside risk.

Incomplete and delisted trades neither fully converge nor diverge by more than 4ơhist. These

trades earn a positive return, if prices converge. However, they generate a loss, if prices di-

verge. In conclusion, the link depends upon the direction of the stock movement.


Highly volatile pairs and negatively correlated pairs increase the return per trade of converg-

ing trades. In contrast, low volatility and high correlation reduce the downside risk of diverg-

ing trades.

2.3 Trading frequency

Turning now to the second return dimension, the trading frequency, we argue that a high trad-

ing frequency is beneficial for the strategy’s total return. The findings of Do/Faff (2010), who

observe a higher profitability for pairs whose prices frequently intersect each other, motivate

us to investigate the drivers of the trading frequency. Obviously, more trades with positive

return are beneficial, whereas many trades with a negative return are harmful. A pair can gen-

erate several natural trades within one trading period. Earning the price spread several times

during one trading period successively increases the total return. In contrast, incomplete

trades, overshooting trades and delisted trades always represent the last trade within a trading

period. A pair cannot open after an incomplete trade, as the incomplete trade is closed on the

last day of the trading period. Pairs are blocked for further trades after an overshooting trade,

as otherwise pairs are opened and closed daily, while the price spread still exceeds 4ơhist.

Likewise, a particular pair obviously cannot open after the delisting of one incorporated stock.

So, as unprofitable trades are limited to one, increasing the trading frequency only raises the

number of natural trades, the closing type which always yields a positive return.

The trading frequency is determined by the probability to open a pair and the time till conver-

gence. The probability to generate loss-making overshooting trades is linked to the probability

of further price divergence, while a pair is open. Engelberg/Gao/Jagannathan (2009) explore

the influence of various variables on the former probabilities, including average mean idio-

syncratic volatility. The authors observe a positive influence of idiosyncratic volatility on the

opening probability and a negative influence on the time till convergence and on the probabil-

ity of further price divergence.

We concentrate on the probability of a pair opening, and we argue that a volatility increase or

a pair correlation decrease between the identification and the trading period raise the opening

probability. Let Xt be a normally distributed random variable,8 which describes the price

spread, the difference between the normalized prices of stock A and B, on day t. The left

graph of figure 2 displays the density function of Xt during the identification period (Den-

sityID_Period) with Xt~N(0; σhist). Pairs trading expects both stocks to strictly co-move, so the

expected value μ of the price spread Xt should be zero. The distribution of Xt is symmetric, as

it is equally likely that stock A outperforms stock B and vice versa. Following the traditional

pairs trading algorithm, the trigger point to open a trade is 2ơhist. More precisely, a pair is

opened, if Xt falls below μ-2ơhist or exceeds μ+2ơhist.

[Insert Figure 2]

The dashed area in the left figure, between μ-2ơhist and μ+2ơhist, represents the density to not

exceed the trigger points. We derive from the density function that this probability is 95.45%.

If the volatility does not change between the identification and the trading period, the proba-

bility to open a new trade is hence 4.55%. However, the opening probabilities might change

8 This is a simplified assumption to demonstrate the idea. In reality, Xt is a function of Xt-1, the return of stock A,

and the return of stock B.

dramatically, if Xt’s volatility increases to ơTrade. The former trigger points of μ +/- 2ơhist,

based on the volatility during the identification period ơhist, is still active. The probability of Xt

to exceed the trigger points, illustrated by the grey area in the right figure, is significantly

higher than before, with Xt~N(0; σTrade). In an extreme case the opening probability could

increase from 4.55% to 31.73%, if the volatility doubles (ơTrade = 2ơhist) between the identifi-

cation and the trading period. On the contrary, a drop in the volatility of Xt (ơTrade < ơhist) re-

duces the probability to open a new trade.

We now turn to the drivers of Xt’s volatility. The volatility of the price spread Xt (ơhist and

ơTrade) is positively influenced by the stock volatility σA and σB. Furthermore, Xt’s volatility is

negatively affected by the correlation of stock A and B, ρA,B.9 Considering all links together,

we conjecture that an increase of stock volatility σA or σB, or a decrease of pair correlation

ρA,B, between the identification and trading period, increase the volatility of Xt. The increase

of Xt’s volatility, in turn, raises the chance to open a new trade and hence raises the trading

frequency, which is beneficial for the total return. In contrast, we expect that a decrease of

stock volatility and a pair correlation increase reduce the trading frequency. Therefore, select-

ing promising pairs is also a matter of correctly forecasting volatility and correlation changes.

In the volatility quintile, which includes the pairs with lowest pair volatility, we expect to find

some stock pairs with permanently low pair volatility. But we also reckon to find some pairs,

whose pair volatility is only temporarily low and which are hence expected to rise in the fu-

ture. Likewise, the pair volatility quintile, including the pairs with the highest pair volatility,

might also include some pairs with temporarily high pair volatility. We hypothesize that at

least some pairs with temporarily low (high) pair volatility will exhibit a pair volatility in-

crease (decrease) in the near future. Therefore, we argue that pairs within the lowest pair vola-

tility quintile are more likely to observe a favorable volatility increase, and hence generate

more trades, than pairs within the highest pair volatility quintile. We argue similarly for corre-

lation and expect to observe more correlation decreases for pairs within the highest correlation

quintile, in contrast to pairs within the lowest correlation quintile.


9 We derive these connection from the following mathematical transformation: V(X) = V(PA − PB) = V(PA) +

V(PB) − 2Cov(PA, PB) = V(PA) + V(PA) − 2ρA,B√V(PA)√V(PB).

The number of natural trades is increased by low stock volatility and high pair correlation

during the identification period, coupled with higher volatility and lower correlation during

the trading period. Pair volatility increases are more likely for pairs with currently low pair

volatility, and correlation decreases are more likely for pairs with a currently high correla-

tion.

Altogether, we conclude that high pair volatility increases the return per trade, but decrease

the trading frequency. Low correlation increases the return per trade, but decreases the num-

ber of trades. So the puzzle remains, which volatility-correlation combination of pairs is the

best for the strategy’s total return. To address this puzzle, we compute the monthly return of

pairs trading to evaluate whether high correlation is beneficial for the total return.

3. Methodology and data selection

After introducing the three central research propositions of this paper in section 2, we pro-

gress with introducing our data selection (3.1) and a detailed description of the traditional

pairs trading algorithm (3.2). Afterwards, we precisely explain how the return is calculated on

the per trade level (3.3.1) and in calendar time (3.3.2). Finally, section 3.4 motivates applied

modifications and describes the test design. Several streams of literature coexist, which apply

different algorithms for selecting pairs or trading. Most prominent are the univari-

ate/multivariate distance approach (Gatev/Goetzmann/Rouwenhorst (2006), Engel-

berg/Gao/Jagannathan (2009), Jacobs/Weber (2015)), the cointegration approach (Vidya-

murthy (2004), Lin/McCrae/Gulati (2006)), the copula based algorithm (Stand-

er/Marais/Botha (2013)), the stochastic approach (Tourin/Yan (2013)) and mixed models. For

further reference, Krauss (2015) provides a detailed overview and evaluation.

Hauck/Afawubo (2015) and Huck (2013, 2015) empirically evaluate different approaches. We

concentrate on the most prominent univariate distance approach, first introduced by Gat-

ev/Goetzmann/Rouwenhorst (2006).

3.1 Data Selection

We obtain the historical index constituents of the S&P 1500 between January 1990 and De-

cember 2014 from the Compustat Monthly Updates - Index Constituents file. Furthermore, we

draw daily stock price data from the WRDS CCM merged database. We exclude stocks for a

particular pairs trading cycle, if any of the three: bid, ask, or closing price, is unavailable on

more than one day during the identification period. After this preselection, approximately

1900 stocks on average are eligible for trading each month, forming around 1.8 million possi-

ble pair combinations. In contrast to fellow papers, we keep stocks in our sample, which are

not traded on one day during the identification period, as bid and ask prices are available and

trading is hence possible. This modification allows us to include less liquid, more volatile

stocks, which is necessary to accurately evaluate our research propositions two and three. All

computations are executed in Stata and the implemented matrix language Mata.

Gatev/Goetzmann/Rouwenhorst (2006) and follow-up papers implement a one-day-waiting

strategy to consider a potential upward bias in returns caused by the bid-ask bounce.10

After

the initial trading signal, they wait for one day until they trade. We refrain from approximat-

ing the bid-ask-spread and trade with the exact last bid or last ask price. At trade initialization,

we buy the long position at the ask price and sell the short position at the bid price. We neu-

tralize the long position at the bid price and the short position at the ask price. Similarly, we

use these latter prices for the daily evaluation. Additional computations, like the average vola-

tility or correlation, are executed with closing prices.

3.2 The Pairs Trading Algorithm

Following Gatev/Goetzmann/Rouwenhorst (2006), one pairs trading period in our paper con-

sists of two phases – first, a 12-months identification period and a subsequent 6-month trading

period.

At the beginning of the identification period, all prices are normalized to one. During the con-

secutive twelve months, we compute three key figures for each possible pair combination AB

in the stock universe: the pair volatility σAB = σA2 + σB

2, the pair correlation ρA,B, and the tra-

ditional selection measure, the sum of squared differences of normalized prices (SSD).

As introduced in the previous section, Gatev/Goetzmann/Rouwenhorst (2006) calculate the

10

In an upward trend, the closing price most likely represents the ask price. As pairs trading sells the increasing

“winner stock”, an investor receives only the lower bid price. Likewise, the closing price of a decreasing stock is

more likely to be the bid price, which must be purchased at the higher ask price. Hence, using closing prices

might overestimate the real return.

SSD as sum of the squared deviations between the normalized price movements of two stocks

over the identification period:

Distance Measure (SSD) = ∑ (PA,t-PB,t)2T

t=1 (4)

where t is the day index during the identification period, Pa,t is the normalized price of stock A

(or B) at day t. At the end of the identification period, the authors form a portfolio, consisting

of the twenty pairs with the lowest SSD, which is traded during the next six months.

To avoid an in-sample bias, the trading period starts on the first day after the identification

period. Again, all prices are normalized to one. At the end of each day, the trading algorithm

checks for each pair, included in the trading portfolio, whether to open a new trade or close a

currently open trade. If the normalized prices of a pair diverge by more than two historic

standard deviations 2σhist, as calculated during the identification period, we initialize the self-

financing trade as follows: We sell one monetary unit of the better performing ‘winner’ stock

and simultaneously buy one monetary unit of the less successful ‘loser’ stock. As soon as both

prices completely converge, we neutralize both positions. Thereafter, a pair might open and

close several times during the trading period. Recall from section 2, that all open positions are

closed, if prices do not fully converge until the end of the trading period. Furthermore, trades

are also closed, if the price spread exceeds 4σhist (overshooting trade) or one stock is delisted

on any day, while the trade is active (delisted trade). A new pairs trading period starts every

month, so at each point in time, six portfolios are trading simultaneously11

.

3.3 Return Calculation

3.3.1 Return per Trade Calculation

To empirically assess research proposition 2, we calculate the return per trade, which is the

payoff of one particular trade. A pairs trader earns a positive return, if the underlying price of

the short position decreases or the long position rises. Consider the following example: The

historical standard deviation of a pair’s price spread is σhist = 0.05. Thus, we open the pair as

soon as the difference (|PA - PB|) between the normalized price of stock A (PA) and the nor-

malized price of stock B (PB) exceeds 0.1 (=2 σhist). For instance, PA,0 = 1 and PB,0 = 1 in t = 0,

11

As a result of the overlapping trading periods, at each point in time one of the portfolios starts to trade in the

particular month, the second one is active for already two months, the third one for three months and so on.

and PA,1 = 1.05 and PB,1 = 0.95 in t = 1. Thus, we open the trade in t = 1. We initiate a short

position in the ‘winner’ stock A with a capital commitment of CA,1 = (-1)$ and a long position

in the ‘loser’ stock B with a capital commitment of CB,1 = +1$. In t = 2, the normalized prices

of stock A and B fully converge to one: PA,2 = 1 (decrease of -4.76%) and PB,2 = 1 (increase of

5.26%). The capital commitments are CA,2 = −1$ ∗ (1 + (−0.0476)) = −0.9524, and

CB,2 = +1$ ∗ (1 + 0.0526) = 1.0526. As both normalized prices of stock A and B fully

converge, we close the trade and receive a payoff of: (CA,2 − CA,1) + (CB,2 − CB,1) =

0.05$ + 0.05$ = 0.10$. The return of 0.10$ represent the return per trade.

3.3.2 Calendar Time Return calculation

Furthermore, we also calculate the monthly return in calendar time. We closely follow Gat-

ev/Goetzmann/Rouwenhorst (2006) return calculation: All positions are marked-to-market

daily. We calculate the return 𝑟𝑖,𝑡, which is the return that stock i realizes between day t-1 and

t, for all stocks in our portfolio. The stock weight wi,t can be interpreted as capital investment

in stock i at day t-1 or as buy-and-hold strategy, which reinvested daily returns until day t-1.

At the opening day t = 1, the stock weight wi,t is defined as follows:

wi,1 = I = {

1 for a long position−1 for a short position0 if the pair is closed

(5)

At the following days t >1, the stock weigths wi,t are calculated as the product of the previous

days’ capital investment:

wt,i = I ∗ wt−1,i ∗ (1 + rt−1,i) = I ∗ (1 + rt−1,i) … (1 + r1,i) = I ∗ ∏ (1 + rt−1,i)t−1t=1 (6)

The pairs trader’s payoff of stock i on day t equals the weight (or capital investment in t-1) wi,t

multiplied with the stock return ri,t between t-1 and t:

Payofft,i = wt,i ∗ rt,i (7)

For instance, the stock commitment in stock A in t-1 is +1.25$ (long position) and stock A

increased from 20$ to 21$ (5% increase). Hence, the stock commitment in A in t is +1.25$ ∗

(1 + 0.05) = 1.3125$. The payoff is: 1.3125$ − 1.25$ = 1.25$ ∗ 0.05 = 0.0625$.12

The daily return of a pair P, which includes stocks 𝑖 ∈ 𝑃, equals:

rP,t =∑ wt,i∗rt,ii∈P

∑ |wt,i|i∈P=

∑ payofft,ii∈P

∑ |wt,i|i∈P (8)

The daily pair return can be understood as sum of the daily payoffs divided by the total pair

capital commitment.13

The daily returns are cumulated to monthly returns afterwards. Follow-

ing Gatev/Goetzmann/Rouwenhorst (2006) and Do/Faff (2010), we report the return on com-

mitted capital. The measure scales the portfolios payoff by the number of actively traded

pairs.14

Six portfolios, each starting in a subsequent month, are traded simultaneously at any time. As

stock prices are likely to diverge further from their normalized prices over time, pair are more

likely to open late in the trading period. Therefore, we average the monthly return of the six

simultaneously traded portfolios. Figure 3 illustrates this process.

[Insert Figure 3]

Gatev/Goetzmann/Rouwenhorst (2006) interpret the return of pairs trading as payoffs to a

proprietary trading desk, where different traders manage six pairs trading portfolios, whose

identification and trading periods are each staggered by one month. Alternatively, the return

can be interpreted as average return of one active pair across all open pairs within the same

portfolio and across six portfolios, which trade simultaneously, but started in consecutive

month. We refer to Gatev/Goetzmann/Rouwenhorst (2006) for further details on the return

calculation.

12

Another example for a short position: The stock commitment in stock B in t-1 is -2$ (short position) and stock

B decreases from 40$ to 37.6$ (6% decrease). Hence, the stock commitment in B in t is: −2$ ∗ (1 + (−0.06)) =

−1.88$. The payoff is: −1.88$ − (−2$) = −2$ ∗ −0.06 = 0.12$. 13

The pair return of our example pair P1, including stock A and B, equals: r1,t =1.25$∗0.05+(−2$∗−0.06)

|1.25$|+|−2$|=

0.0625$+0.12$

|1.25$|+|−2$|=

0.1812$

3.25$= 0.0562.

14 Inactive pairs are neglected, as they do not require a capital commitment and can hence be invested in a risk-

free asset.

The calendar time return might considerably differ from the return per trade for three reasons.

Firstly, the monthly return reflects the combined effect of trading frequency and return per

trade. Secondly, we scale the monthly return to the number of included active pairs in the

portfolio. Thirdly, a trade might be open for several months. Therefore, the pair earns only a

fraction of the return per trade within one month.

3.4 Modifications and test design

To systematically investigate the impact of volatility and correlation, we slightly modify the

classical pair selection algorithm: Firstly, we compute the pair correlation ρA,B

and a proxy for

the combined volatility, the pair volatility (σA2 +σB

2 ) during the twelve month identification

period. Afterwards, we define five pair correlation and five pair volatility quintiles and classi-

fy pairs accordingly. Quintile Corr_Q1 (Corr_Q5) includes the pairs with the lowest (highest)

pair correlation, quintile Vola_Q1 (Vola_Q5) includes the pairs with the lowest (highest) level

of pair volatility. Thirdly, we construct 25 pair groups from the intersection of the five pair

volatility and five pair correlation quintiles groups. For example, Corr_Q1/Vola_Q1 includes

the pairs, whose pair correlation and also pair volatility are among the lowest 20% compared

to all other pair combinations. The quintile and group affiliation of a pair is updated every

identification period. Fourthly, we randomly pick twenty pairs out of each group and include

them into our pair volatility and pair correlation double-sorted (VCS) portfolios. These pairs

are included in the portfolio during the subsequent six month trading period. For each trading

period, we define one trading portfolio for each of the 25 volatility-correlation-quintile com-

binations. Finally, we repeat the random pair selection in the previous step ten times to elimi-

nate a selection bias.

Altogether, we analyze 1200 pairs of each group, at any point in time (20 pairs in each portfo-

lio, 6 simultaneously traded portfolios resulting from the overlapping trading periods, and 10

sets with randomly chosen pairs per group). If not otherwise stated, we report the average

monthly return of the ten simultaneously traded sets (time series) or alternatively the average

return per trade for all trades realized by the 1200 selected pairs (cross-section) in the follow-

ing sections. Following Engelberg/Gao/Jagannathan (2009), the former definition is labelled

calendar time, whereas the latter one is defined as event time.

Table 1 reports the average pair correlation and pair volatility per quintile during the identifi-

cation period. For instance, the average pair correlation of all Corr_Q5 pairs (highest pair cor-

relation quintile) is 0.794.

[Insert Table 1]

The average pair correlation ranges from -0.468 to 0.794. So, the sample also includes uncor-

related (Corr_Q2) and negatively correlated pairs (Corr_Q1). The average pair volatility rang-

es from 0.016 to 0.545. We use normal volatility for the classification, instead of idiosyncratic

volatility, as it is more convenient for practitioners to apply a simple, observable measure. To

calculate idiosyncratic volatility, we follow Xu/Malkiel’s (2003) direct decomposition method

and extract the residuals, obtained from a time series regression of daily stock returns on

Fama/French’s (1993) 3-factor model. We compare the results for normal and idiosyncratic

volatility and find that the allocation to quintiles is almost identical.

4 Link between the classical selection distance measure, volatility

and correlation

What are the implications of applying the traditional pair selection criterion (SSD)? This sec-

tion investigates whether low pair volatility and high correlation influence the SSD, and if to

what extent. The results are important to understand. A significant influence of both factors

indicates that selecting the twenty pairs with the smallest SSD equals selecting pairs with a

specific pair volatility-correlation combination. This specific combination could in turn influ-

ence the trading behavior and hence the return potential. Furthermore, we benchmark our

monthly returns of a traditional pairs trading portfolio in the US between 1990 and 2014 with

the returns of fellow studies.

The classical distance measure was introduced by Gatev/Goetzmann/Rouwenhorst (2006) and

applied by several other authors. It calculates the sum of the squared deviations between the

normalized price movements of two stocks. Research proposition 1 derives, that low pair vol-

atility and high correlation minimize the SSD. We regress the SSD on pair volatility and cor-

relation to investigate the explanation power of both factors. Our sample includes all possible

pair combinations across all identification periods. We utilize period and pair clustered stand-

ard errors. The results are displayed in Table 2, Panel A.

[Insert Table 2]

Pair volatility and correlation explain 88% of the SSD’s total variation, verifying a strong

influence of both factors. As expected, the SSD and correlation are positively linked, whereas

SSD and volatility are inversely linked. It is unclear from the SSD’s mathematical formula,

whether the effect of pair correlation or the effect of single volatility dominates. The standard-

ized regression coefficients reveal that low pair volatility dominates the effect of high correla-

tion. To check the robustness of our results, we conduct the same regression with our ten re-

duced data sets and confirm the previous results separately for each set. We conclude that the

traditional selection algorithm picks pairs with low pair volatility and high correlation. More-

over, we examine into which volatility and correlation quintiles the twenty pairs with the low-

est SSD are classified. Unsurprisingly, almost 67% of all SSD selected pairs are classified as

Corr_Q5/Vola_Q1, the portfolio with the lowest pair volatility and highest correlation. The

remaining pairs are all allocated to one of the other Corr_Q5 or Vola_Q1 portfolios.

The SSD portfolio allows a direct comparison with selected fellow studies. We exclude papers

that apply major modifications, conceal raw returns or one-day-waiting returns15

. Do/Faff

(2010) detect a declining trend in pairs trading return over time. Therefore, we must consider

the covered time period, when comparing the returns. Furthermore, minor adaptions of the

original algorithm might cause small differences, like the pairs preselection in Papa-

dakis/Wysocki (2007). Table 2, Panel B summarizes the monthly U.S. returns for similar

stock universes. Our return of 37bp is consistent with the return of Do/Faff (2010) of 37bp for

1989-2002 and 24bp for 2003-2009, as well as 36bp of Papadakis/Wysocki (2007) for 1994-

2006, suggesting the reliability of our computation and our bid-ask price approach.

In conclusion, this section’s empirical results confirms research proposition 1. This finding

implicates that an investor commits to trade with highly correlated pairs with low pair volatili-

ty, when applying the traditional SSD measure. The results of the following analyses will re-

veal whether the SSD’s pair selection is optimal regarding the return per trade, the trading

frequency, and the monthly return.

15

Previous studies apply a one-day-waiting strategy to model the bid-ask-spread (see Gatev/Goetzmann/

Rouwenhorst (2006) for further details). In contrast, we directly use closing bid prices for selling and closing ask

prices for buying stocks.

5 The effect of volatility and correlation on the return per trade

and the trading frequency

The return of pairs trading consists of two major building blocks, the return per trade and the

trading frequency. The return per trade equals the payoff of one particular pair’s trade. The

return can be earned over a time period between one day and six months. The trading fre-

quency equals the number of trades of one specific pair within one six months trading period,

and it states the frequency of desired trading opportunities. To answer the paper’s main re-

search question, whether part of the return is explained by the pairs trading algorithm itself,

this section separately investigates the influence of the two factors, pair volatility and correla-

tion, on both return building blocks. A significant influence of both factors reinforces our the-

sis, that specific levels of pair volatility and correlation explain at least part of higher observed

returns across industries, countries, and over time.

The previous section uncovered that the traditional selection procedure picks pairs with low

pair volatility and high correlation. In this section’s analysis, we are however interested in the

isolated effect of pair volatility and correlation on the trading algorithm. Therefore, we study

the trading behavior of all twenty-five pair volatility and correlation sorted portfolios (VCS

portfolios), which were previously introduced in section 3. This procedure allows us to ana-

lyze the pure influence of both factors on the trading procedure itself, without the distracting

interference of the SSD’s preselected pair volatility and correlation combination. Section 5.1

starts with investigating the effect of pair volatility and correlation on the first return building

block, the return per trade. Afterwards, section 5.2 analyzes the effect on the second return

building block, the trading frequency.

5.1 Return per Trade

Research proposition 2 conjectures a positive influence of high volatility and a negative influ-

ence of high correlation for profitable natural trades and vice versa for loss-making overshoot-

ing trades. The influence on the return per trade of incomplete and delisted trades depends on

whether prices converge or diverge. Like natural trades, high volatility and low correlation are

beneficial for converging prices, whereas low volatility and high correlation are preferable for

diverging prices.

Table 3 compares the return per trade for each VCS Portfolio separately for each closing type

in panel A to D. Panel A reports that the average return per natural trade of a Vo-

la_Q1/Corr_Q5 pair (lowest volatility and highest correlation quintiles) is 0.14 units for a one-

unit commitment in the long and the short position. The average return per natural trade rang-

es between 0.1441 units for Vola_Q1/Corr_Q5 pairs and 0.7256 units for Vola_Q5/Corr_Q1

pairs. As expected, the return per trade for natural trades rises for higher volatility levels

(within each correlation quintile) and decreases for higher correlation (within each volatility

quintile). Although incomplete trades are forcefully closed on the last day, Panel B reports an

average positive return per trade. Similar to natural trades, higher volatility and lower correla-

tion significantly enhance the profitability of incomplete successful trades. The return ranges

between 0.0118 units for Vola_Q1/Corr_Q5 and 0.0338 units for Vola_Q5/Corr_Q1. Panel C

displays that higher volatility and lower correlation increase the loss per trade for overshoot-

ing trades. The return per trade ranges between -0.4668 units, for Vola_Q5/Corr_Q1, and -

0.1125 units, for Vola_Q1/Corr_Q5. As expected, the link between volatility, correlation and

return of overshooting trades is reverse to natural trades. Panel D reports that delisted trades

are on average negative for highly volatile and low correlated pairs. Yet, the return is increas-

ing and positive for less volatile and stronger correlated pairs. The average return per trade

ranges between -0.1322 units, for Vola_Q5/Corr_Q1, and 0.0190, for Vola_Q2/Corr_4. How-

ever, the return of 8 out of 25 portfolios is insignificant, indicating that there is no clear vola-

tility and correlation impact pattern for delisted trades.

[Insert Table 3]

Altogether, our findings support research assumption 2. As expected, high volatility and low

correlation levels increase the return of converging pairs, but also increase the loss of diverg-

ing pairs. We learn that the volatility and correlation levels during the identification period

dictate the average return level per trade. Hence, pairs in more volatile markets or in times of

general higher market volatility should earn more per trade. Furthermore, our findings imply

that the SSD’s preselection of highly correlated pairs with low volatility earn less per trade

than less correlated pairs with higher volatility.

5.2 Trading Frequency

After examining the return per trade in the previous section, we now turn to the second return

building block, the trading frequency. This section explores the link between pair volatility,

correlation, and trading frequency to evaluate research proposition 3. We argue that raising

the trading frequency translates into earning the return per trade multiple times during one

trading period. Recall that a high trading frequency increases only the number of always prof-

itable, closing type 1 trades (natural trades). Closing type 2, 3 and 4 are always the last trade

within a specific trading period and are therefore unaffected by a higher trading frequency.

The first analysis investigates the influence of pair volatility and correlation on the probability

of a level shift. A level shift describes one pair’s change of pair volatility or correlation, be-

tween the identification period and the trading period. We expect to register more volatility

decreases for pairs with high pair volatility (Vola_Q5 quintile) and less volatility decreases

for pairs with little volatility (Vola_Q1 quintile). Likewise, we argue that correlation increases

are more likely for low correlated pairs (Corr_Q1) and less likely for originally high strongly

correlated pairs (Corr_Q5). The price spread Xt, which is defined as the difference between

the two normalized stock prices of the pair on day t, determines, whether a new trade is ini-

tialized. The opening probability rises, if the volatility of Xt is low during the identification

period and high during the trading period. As derived for research proposal 3, we conjecture

that volatility increases and a correlation decreases raise the volatility of Xt, which in turn

raise the desired opening probability.

We first compute the average pair volatility and the correlation coefficient of each pair during

the identification period and repeat the computation separately for the trading period. Based

thereupon, we generate two dummies for each pair: the Volatility Dummy (Correlation Dum-

my) indicated whether the pair experienced a pair volatility increase (correlation decrease) or

not. Afterwards, we merge the information with our trade dataset, where each trade represents

one observation. So we link each trade observation with the information, whether the trade

generating pair experienced a volatility increase or correlation decrease. Finally, we calculate

for each VCS Portfolios the average percentage of trades, which were generated by pairs with

a desired volatility increases and correlation decreases. Table 4, Panel A reports the results for

volatility increases. For instance, in the Corr_Q1/ Vola_Q1 Portfolio 61.08% of all trades are

executed by pairs, whose volatility level increased. The far right column represents the differ-

ence between the quintile with the lowest and the highest correlation, for a given pair volatili-

ty level. Statistical significance of a two sample mean comparison test at the 10%, 5%, and

1% level is indicated by *, **, and ***. We observe across all volatility levels, that pair vola-

tility decreases are more likely for reversely correlated pairs (Corr_Q1 quintile), as Corr_Q5 -

Q1 is negative for all volatility levels. The bottom row calculates the difference between the

quintile with the lowest and the highest pair volatility, for a given pair correlation level. Con-

sistent with the mean-reversion theory, pair volatility increases are more likely for low pair

volatility levels (Vola_Q1) across all correlation levels as Vola_Q5 - Q1 < 0. Overall, the ta-

ble suggests that most trades are generated by pairs, with a low pair volatility and low correla-

tion.

[Insert Table 4]

Furthermore, we conjecture that correlation decreases raise the trading frequency. Similar to

Panel A, Panel B displays the percentage of trades, originated by pairs whose correlation de-

creased between the identification and trading period. For instance, in the Corr_Q1/ Vola_Q1

Portfolio 23.68% of all trades are executed by pairs, whose correlation level increased. Again,

the far right column and the bottom row show the inter-quintile differences. Beneficial corre-

lation decreases are more likely for pairs with high pair correlation (Corr_Q5 - Q1 > 0) and

more likely for pairs with low volatility (Vola_Q5 - Q1 > 0). Notably, the inter-quintile dif-

ference between the highest and lowest correlation quintile is especially high, indicating an

especially strong effect of correlation on the probability of correlation decreases. Altogether,

we conclude that high volatility and high correlation raise the trading frequency.

So far, we learnt that high pair volatility and high correlation are positive for the correlation-

decrease-effect, but harmful for the volatility-increase-effect. These findings raise the obvious

question, whether the aggregated influence of both factors on the trading frequency is positive

or negative. To answer this question, we assess whether the influence of the correlation-

decrease-effect on the trading frequency dominates the volatility-increase-effect or the other

way round. Therefore, we count the number of trades over all periods for each VCS Portfolio.

The data in Panel C reveals that high pair volatility portfolios generate fewer trades than low

pair volatility portfolios. High correlation portfolios produce significantly more trades than

low correlation portfolios. We conclude that the effect of low pair volatility in the volatility-

increase-effect dominates the effect of high pair volatility in the correlation-decrease-effect.

Furthermore, the effect of high correlation in the correlation-decrease-effect is stronger than

the effect of low correlation in the volatility-increase-effect. Altogether, low pair volatility

and high correlation raise the total number of trades. The SSD, which selects pairs with low

pair volatility and high correlation, is a convenient selection measure to increase the trading

frequency. So far this paper has analyzed the influence of pair volatility and correlation on the

two return building blocks, return per trade and trading frequency. The next section continues

to analyze the aggregated effect of both factors on the total return.

6 Empirical results in calendar time

Our previous results are antithetic so far. High pair volatility and negative correlation increase

the return per trade on the one hand (section 5.1), but reduce the trading frequency on the oth-

er hand (section 5.2). The trading algorithm conceals which effect dominates. However, only

the aggregated effect is of importance for an investor. This section therefore takes a closer

look at the monthly pairs trading returns. The monthly return represents the payoff to an in-

vestor, and it considers the aggregated effect of pair volatility and correlation on the return per

trade and the trading frequency. It provides not only the possibility to compute a risk-adjusted

return, but also allows us to compare the monthly return of a traditionally formed portfolio

and portfolios with alternating volatility-correlation combinations. Additionally, the results

help us to ultimately judge whether the SSD is superior in selecting profitable pairs.

The analysis of the monthly returns between January 199116

and December 2014 uncovers,

whether the positive effect of high pair volatility and low correlation on the return per trade,

or the negative on the trading frequency dominates. Table 5, Panel A reports the average

monthly return for each VCS Portfolio between January 1991 and December 201417

. All re-

turns are positive, significant and range between 19bp to 234bp. For instance, portfolio Vo-

la_Q1/Corr_Q5 (low volatility/high correlation) yields an average monthly return of 46bp.

The most striking result to emerge from Panel A is the successive increase in monthly returns

for higher volatility levels from Vola_Q1 to Vola_Q4 in correlation quintiles Corr_Q2 to

Corr_Q5. The return increase is particularly stronger in higher correlation levels.

16

The monthly returns start in January 1991, as the first 12-months identifications period starts in January 1990. 17

The first identification period starts in January 1990.

[Insert Table 5]

Notably, even correlation quintile Corr_Q1 portfolios earn significant returns. In other words,

pairs trading with negatively correlated pairs is also profitable. The return differences between

Corr_Q5 and Corr_Q1 portfolios are insignificant in three out of five cases, indicating equally

high portfolio returns. This finding contradicts the classical pairs trading idea of exploiting a

temporary mispricing of two close economic substitutes. We address this puzzle in the next

section.

Furthermore, twenty-four out of twenty-five VCS Portfolios outperform the SSD portfolio

with a monthly return of 37bp. Even the direct peer portfolio Vola_Q1/Corr_Q5 (low volatili-

ty/high correlation) earns 10bp more. We speculate that the superior performance of the Vo-

la_Q1/Corr_Q5 portfolio results from randomly selecting twenty out of approximately 72000

pairs within the particular quintile intersection group. Therefore, the selected pairs exhibit an

average level of correlation and volatility within the VCS group. In contrast, the pairs of the

SSD portfolio observe the absolutely lowest volatility and the highest correlation. We con-

clude that even slightly higher volatility and lower correlation levels increase the pairs trading

return.

To account for well-established return patterns (Fama/French (1993), Conrad/Kaul (1989),

Jegadeesh/Titman (1993), Carhart (1997)), we regress monthly returns on a six factor model,

including Fama/French’s three factor model, a momentum factor and a short-term-reversal

factor. All data is obtained from Kenneth French’s website. As our sample includes stocks

with different liquidity levels, we further extend our factor model with Pastor/Stambaughs’

(2003) liquidity factor. Panel B reports the alphas of the regression for each VCS Portfolio.

Like fellow papers, we use Newey-West Standard errors with lag 6. The results confirm our

previous findings. All alphas are positive and almost always significant. Again, high volatility

and strong correlation increase alpha.

Altogether, the results suggest that pairs with high pair volatility and high correlation exploit

the maximal return potential. We conclude that the positive volatility effect on the return per

trade and the positive effect of strong correlation on the trading frequency dominate. This im-

plicates that the SSD, which selects pairs with low volatility, is suboptimal to identify the

most promising pairs. The monthly return of the traditionally selected portfolio with 37bp is

considerably lower than the risk adjusted return of most alternatively formed VCS Portfolios.

7 The information diffusion hypothesis and non-close economic

substitutes

The previous section revealed that also portfolios, consisting of low or negatively correlated

pairs, earn a significantly positive return. At first sight, this finding contradicts the natural idea

of exploiting temporary mispricing of close economic substitutes. We argue that close eco-

nomic substitutes must not necessarily be perfectly correlated. Seemingly uncorrelated pairs

may also share a common economic link, like industry affiliation or the dependence on similar

input factors or identical customers. In the light of the widely accepted information diffusion

hypothesis, pairs trading with negatively correlated pairs can also be successfully, if link spe-

cific information is gradually incorporated. We rather argue that successful pairs trading re-

quires pairs, which are not perfectly correlated. Highly correlated pairs incorporate common

news exactly at the same time, and hence they fail to offer any temporary mispricing. In sum,

we derive that successful pairs trading must share a common sensitivity to certain risk factors.

This section will therefore examine, whether seemingly non-close economic substitutes share

these common links. To quantify the economic closeness of two stocks, we develop the close

economic substitute score (CESS), which indicates the extent to which two stocks react similar

to five risk factors. Furthermore, we are interested whether pairs with many common links

trade more often within a given portfolio than pairs with less common links. This insight is

important to understand why the SSD might be suboptimal. We expect to observe more trades

by less correlated pairs within a portfolio, if perfectly correlated pairs indeed create fewer

trading opportunities.

To measure the economic closeness of two stocks, we develop the close economic substitute

score (CESS), which considers multiple pricing factors. Previous studies like Chen/Chen/Li

(2013) concentrate on similar firm characteristics. However, the demanded stock comovement

is generated by similar price sensitivities to exogenous shocks. We apply the highly recog-

nized five factor model including Fama and French’s three factors model (1993), the WML

factor and the Short-Term Reversal factor to quantify price sensitivities. All data is obtained

from Kenneth French’s website. Firstly, we regress daily closing prices of all stocks on the

five factor model and extract the regression coefficients, which indicate the sensitivity to the

respective factor. Each factor represents one pricing category. Secondly, we calculate the ab-

solute difference between the factors of stock A and B for each category, for all possible pair

combinations, and all identification period. The factor difference quantifies the similarity of

both stock’s price reaction to the respective pricing factor. Thirdly, we sort the absolute factor

difference and allocate the pair accordingly to one of five quintiles each month. This step is

repeated for each of the five categories. For example, a pair is allocated to the lowest market

news quintile. So, this pair consists of two stocks, which react more similar to market shocks

than 80% of all other pair combinations. We denote to the quintile as Ci,n,t, where i is the cate-

gory (i= Market, SMB, HML, Mom and STR), n the particular pair, and t the current identifi-

cation period. As we form five quintiles for each category, Ci,n,t takes on values between 1 and

5. Finally, we calculate the CESS score for each pair combination n in each identification pe-

riod t. The score cumulates the quintile numbers Ci,n,t und subtracts 5.

CESSn,t= ∑ Ci,n,t-55i=1 (9)

The score ranges between zero (the difference between the price sensitivities is among the

lowest 20% in all five categories) and twenty (the difference between the price sensitivities is

among the highest 20% in all five categories). The pair’s CESS score is low, if both stocks

react very similar to common shocks in all five categories, and it is high if both stocks react

differently. For example, the score is equal to ten, if the quintiles of all five categories equal

three. The CESS score is incremented by one point, if the factor difference in one category

increases so that the pair is allocated to the next higher quintile. The CESS measures the simi-

larity to common pricing factors, whereas the correlation coefficient quantifies the pure math-

ematical relationship. As both measures are closely related, we expect to observe a higher

CESS for negatively correlated pairs.

We argued previously that even negatively pairs might share some common links and might

hence exploit temporary mispricing. The previous section revealed that all VCS Portfolio,

including the portfolios with negatively correlated pairs, yield a positive return. Therefore, we

are now interested whether these pair’s stocks react similar to at least common risk factors,

measured by the CESS. Table 6, Panel A reports the average equally weighted CESS for all

VCS Portfolio across all pairs and over time. The average CESS ranges from 7.16

(Corr_Q5/Vola_Q1) to 13.48 (Corr_Q5/Vola_Q5). The CESS of the SSD portfolio is 7.61 and

smaller than 92% of the other portfolios. So, the SSD indeed selects pairs, with similar sensi-

tivity to common risk factors, which can also be called close economic substitutes. The CESS

score is higher for portfolios with negatively correlated pairs (Corr_Q1). The average CESS

score of Corr_Q1 portfolios is between 9.85 and 13.48. A pair with no similarities would be

classified into the highest quintile in each category and have a CESS score of 20. Hence, a

CESS between 9.85 and 13.48 indicates that the pair shares at least some common links. The

mathematical negative correlation of Corr_Q1 pairs is an aggregated measure, which might

conceal that stocks temporary co-move. We argue that as long as a common link exists, link

specific news might create trading opportunities. Therefore, pairs trading with negatively

pairs might also be profitable.

[Insert table 6]

To assess whether close economic substitutes trade more often than non-close economic sub-

stitutes within a portfolio, the next analysis compares the portfolio CESS with the average

CESS per trade.

Therefore, we assign a pair’s CESS to each executed trade by the pair. Afterwards, we com-

pute the average CESS across all trade within a portfolio. The CESS per portfolio can be un-

derstood as equally weighted average of all pairs included in the portfolio. The CESS per

trade in contrast the trade weighted average CESS. The CESS per trade and the portfolio

CESS are equal, if all pairs are equally likely to trade, irrespective of whether they are close

economic substitutes or not. In contrast, if the CESS per trade and the portfolio CESS are un-

equal, than a certain group of pairs is more likely to generate new trades. We derive that close

economic substitutes (lower CESS) trade more frequently, if the per trade CESS is smaller

than the portfolio CESS and vice versa. For the next analysis, we calculate the difference be-

tween the portfolio CESS and per trade CESS, and divide the difference by the portfolio

CESS to scale it. Panel B reports this percentage difference between portfolio CESS and per

trade CESS. The data reveals that close economic substitutes trade more frequently than non-

close economic substitutes, as the percentage is negative for all VCS Portfolios. For instance,

in the Corr_Q1/Vola_Q1 portfolio the trade weighted CESS is 4.27% smaller than the average

CESS of all pairs included in the portfolio. Hence, relatively closer economic substitutes must

trade more frequently than relatively less close economic substitutes to decrease the trade-

weighted CESS. However, we observe a reverse pattern for the SSD portfolio with a CESS

increase of +19.70%. So, within a portfolio of highly correlated pairs, pairs which are relative-

ly less similar trade more often. We conclude that perfectly correlated pairs indeed generate

less trades than less correlated pairs. Furthermore, we found that even negatively correlated

pairs share common links and hence may earn a positive pairs trading return.

8 Robustness checks

The previous results naturally raise a number of questions. Section 8.1 discusses the question,

whether arbitrage opportunities are actually exploitable or only theoretical. Furthermore, we

investigate whether the returns of highly volatile or low correlated portfolios simply represent

a liquidity premium. Afterwards, section 8.2 evaluates, whether specific attributes of non-

close substitutes generate the volatility and correlation effect. Finally, section 8.3 answers the

question, whether the return is mainly driven by the short position, because of short selling

constraints.

8.1 Limits of arbitrage

The literature on limits of arbitrage suggests that market frictions deprive arbitrageurs from

practically exploiting trading opportunities (Shleifer/Vishny (1997)) due to price inefficien-

cies. In the absence of an economical foundation, one possible explanation for abnormal re-

turns is the persistence of such unexploitable trading opportunities. In the context of pairs

trading, Engelberg/Gao/Jagannathan (2009) emphasize the importance of trading costs, short

selling constraints and idiosyncratic volatility. Do/Faff (2012) investigate whether pairs trad-

ing is profitable, after controlling for commission, market impact and short selling constraints.

They find decreasing but still significant positive returns.

However, some of our pairs might be more volatile and possibly more and hence demand

higher trading costs. We address the most crucial issue of the bid-ask bounce, by directly trad-

ing with bid and ask prices. This procedure is more accurate than the traditional one-day-

waiting procedure.

Engelberg/Gao/Jagannathan (2009) examine divergence risk as possible trading barrier. Inves-

tors might be confronted with margin calls, if the short stock further declines. Subsequently,

some arbitrageurs must liquidate their positions to meet the demanded additional capital

commitment. In our analysis, divergence risk affects overshooting trades (closing type 3) in

particular. Similar to Table 3, Panel C, we examine the volatility and correlation effect on the

trading frequency of overshooting trades. Our unreported results indicate, that high volatility

and low correlation reduce the number of overshooting trades and hence divergence risk. This

finding is consistent with Engelberg/Gao/Jagannathan (2009), who find that high idiosyncratic

volatility decreases the divergence probability. Furthermore, the maximum stock divergence

is limited to 4hist, thus investors are unaffected by margin calls beyond this barrier. We con-

clude that divergence risk is not stronger for our pair combinations compared to other papers.

D’Avolio (2002) finds that only 16% of all stocks included in the monthly CRSP file can

eventually not be shorted. 91% of all stocks, including almost all S&P 500 constituents, cost

less than 1% to borrow and have a value-weighted mean fee of 17%. The remaining 9%

stocks, also called “special stocks”, have a mean fee of 4.5% per annum. Less than 1% of the

special stocks demand negative rebate rates and charge a fee of up to 50%. Not surprisingly,

smaller stocks demand higher fees. However, these special stocks account for less than 1% of

the market by value. We assume that the probability of “specials” in our dataset is relatively

low for two reasons. First, our stock universe is restricted to current or former members of the

S&P 1500. These stocks cover around 90% of the total market capitalization and should be

under regular investor’s attention. Second, our stock universe is restricted to stocks with a

listed bid and ask price. These restrictions secure the liquidity and tradability of our stocks.

A further threat, while shorting a stock, is the probability of a stock recall before prices fully

converge. Recalls are more likely, when prices are falling. So in pairs trading when prices

start to converge, the shorted stock of a pair is expected to fall and is hence more likely to be

recalled. However, D’Avolio (2002) observes a low recall rate of around 2% per month. Fur-

thermore, pairs traders can still earn a profit, if the price already converged until the time

where the stock was recalled.

To decrease the probability of hard-to-short stocks and recalls, we test our strategy with large

stocks included at the time in the S&P 100 or the NASDAQ 100. Stocks are excluded, if they

lose their index membership to consider possible liquidity declines. This robustness check

tests, whether our findings persist in an almost limits of arbitrage free setting, as all stocks are

liquid, easy-to-short, and highly efficient due to high analyst cover. Table 7, Panel A (B)

shows the results for the S&P 100 (NASDAQ 10018

).

[Insert Table 7]

All returns are smaller compared to previous returns. So, part of the previous returns can be

explained with short selling constraints, a liquidity premium, and efficiency issues. However,

all returns are still positive and significant. High correlation levels decrease the returns of low

volatility portfolios. The volatility effect persists, especially for strong correlation portfolios.

Moreover, the NASDAQ 100 portfolios outperform the corresponding S&P 100 portfolios.

This result is consistent with our previous results, as NASDAQ 100 stocks are more volatile

than the S&P 100 stocks. Taking all results into consideration, we conclude that the volatility

and correlation effect are robust to limits of arbitrage and liquidity issues.

8.2 The influence of volatility and correlation on Close Economic Substitutes

As fellow papers predominantly trade with close economic substitutes, this section investi-

gates, whether specific attributes of non-close economic substitutes cause the observed vola-

tility and correlation effect. Therefore, we construct a stock pair universe which exclusively

comprises close economic substitutes. The difference between all price sensitivities of both

stocks must be among the lowest 20% in each pricing factor category (CESS = 0), under the

constraint that both stock companies operate within the same 49 Fama/French industries. Sim-

ilar to the original procedure, we divide pairs into four volatility and four correlation quartiles

each month. Afterwards, sixteen double sorted groups are formed. 20 pairs are arbitrarily

drawn from each group and are eligible for trading in the next trading period. The trading pro-

cedure and return calculation remain identical.

Table 8, Panel A displays the monthly mean returns for all portfolios. The returns are all sig-

nificant, positive, and consistent with previous results. Most volatility interquartile differences

are highly positive and significant. At the most, the return incline exceeds 100% between the

lowest Q1 and the highest Q4 volatility quartile. Two conclusions can be drawn. Firstly, the

results establish the positive link between volatility and profitability. Secondly, volatility sig-

18

Bid and ask prices are reported starting in Jan 1995.

nificantly increases the return potential of close economic substitutes, while not contradicting

the original idea of trading with close economic substitutes.

[Insert Table 8]

In contrast, the differences between correlation quartile Q4 (high correlation) and Q1 (low

correlation) and also Q2 (low-medium correlation) are not significant. This observation is

little surprising, as our rigorous pair selection only picks highly correlated pairs. Hence, the

variation of correlation might be too small to cause a significant effect. Although the inter-

quartile differences are not significant, lower correlation slightly increases the return especial-

ly for low and medium volatility. This finding weakly supports the inverse link between cor-

relation and return per trade.

As a consequence of the insignificant correlation effect, we refrain from controlling for corre-

lation in the next analysis. This allows us to increase the number of volatility levels to explore

the volatility effect more precisely. For Panel B, we divide our restricted stock universe into

ten volatility deciles and repeat the previous selection and trading procedure. Again, all

monthly returns are significant and positive, ranging between 49bp in decile Q1 (lowest vola-

tility) and 125bp in decile Q9 (second highest volatility). In comparison to table 5 in section 6,

volatility decile Q1 combined with decile Q2 can be regarded as former Q1 quintile, and dec-

ile Q9 combined with decile Q10 as the former Q5 quintile.

Altogether, the inter-decile difference is sometimes insignificant. Larger inter-decile differ-

ences are however usually significant. Even the decile Q6 return (medium volatility) signifi-

cantly outperforms the decile Q1 and Q2 returns. These results confirm the volatility effect

even for close economic substitutes.

8.3 Short vs. long leg

Stambaugh/Yu/Yuan (2015) argue that return asymmetry of long-short strategies might origi-

nate from short selling constraints of overpriced stocks compared to easily exploitable under-

priced stocks. In this case, the isolated short leg return exceeds the long leg return. Alterna-

tively, Gatev/Goetzmann/Rouwenhorst (2006) ponder that long leg profits might represent a

compensation for an unrealized bankrupt. In this alternative case, the long leg contributes

more to the total return than the short leg. To shed further light on this topic, we compare the

isolated return of the short and the long leg, similar to Gatev/Goetzmann/Rouwenhorst (2006)

and Jacobs (2015).

Table 9 reports the median percentage contribution of the long leg to the total return. The per-

centage is below 50% in twenty-four out of twenty-five portfolios, indicating a higher contri-

bution of the short leg. The long legs of Vola_Q5/Corr_Q1&2 portfolios contribute relatively

little with a share of 31.64%-32.60%. As the return asymmetry is especially pronounced for

highly volatile stocks, the short leg contribution might originate from practically not exploita-

ble mispricing. However, the median long leg of all other VCS Portfolios contributes more

than 40% to the total return. It is therefore unlikely, that the return solely arise from unex-

ploitable mispricing.

[Insert Table 9]

9 Conclusion

We demonstrate that part of the pairs trading return is determined by the applied rule-based

algorithm itself. The strategy’s total return consists of the return per trade and the trading fre-

quency. We link volatility and correlation levels with both return building blocks. Our empiri-

cal results reveal that reverse correlated pairs earn twice as much as highly correlated pairs

per converging trade. Higher volatility can even triple the average return per converging trade.

Furthermore, we observe eight times more trades for pairs with extremely high correlation and

low volatility compared to negatively correlated pairs. Overall, the effect of high volatility

and low correlation is positive for the return per trade, but negative for the trading frequency.

To investigate the aggregated effect of pair volatility and correlation across both return build-

ing blocks, we calculate the monthly return. We find that high correlation (beneficial for the

return per trade) and high correlation (beneficial for the trading frequency) dominate. These

portfolios with high volatility and high correlation are the most superior ones, with a monthly

risk-adjusted return of up to 76bp in the US market between 1990 and 2014. The traditionally

formed portfolio yields only a monthly risk-adjusted return of 37bp during the same time pe-

riod. The results are robust to short selling constraints, liquidity issues, and exposure to com-

mon risk factors.

We find that perfectly correlated pairs are not necessarily the best pairs, and that close eco-

nomic substitutes must not be perfectly correlated. All portfolios include pairs which share at

least some similarities, which can generate temporary mispricing, if similarity specific infor-

mation is incorporated gradually. Merely, the distribution of high and low close economic

substitute pairs varies across portfolios. The comparison between average CESS and CESS

per trade reveals that pairs need economic similarities, but also some dissimilarity, to create

trading opportunities. Further research could concentrate on the specific attributes of news

that generate trades of less close economic substitutes.

Focusing on the stocks of one particular industry or country, or applying a certain selection

procedure like the SSD results in a preselection of pairs eligible for trading. This preselection

of pairs involves the determination of a given pair volatility and correlation combination. The

determination, in turn, affects the total return via the return per trade and the trading frequen-

cy.

Altogether, we learn that some part of the variation of pairs trading returns across industries,

countries, and over time, can be explained by the variation in pair volatility and correlation. In

the future, we should always consider the role of pair volatility and correlation, before deriv-

ing any economical deductions from pairs trading research.

References

[1.] Andrade, Sandro C.; di Pietro, Vadim; Seasholes, Mark S. (2005): Understanding the

Profitability of Pairs Trading. Working paper, UC Berkeley Haas School and Northwest-

ern University.

[2.] Antón, Miguel; Polk, Christopher (2014): Connected Stocks. The Journal of Finance,

Vol. 69, pp.1099-1127.

[3.] Bolgün, Kaan E.; Kurun, Engin; Güven, Serhat (2010): Dynamic Pairs Trading Strategy

for the Companies Listed in the Istanbul Stock Exchange. International Review of Ap-

plied Financial Issues and Economics, Vol. 2, No. 1, pp. 37-57.

[4.] Brennan, Michael; Jegadeesh, Narasimhan; Swaminathan, Bhaskaran (1993): Investment

Analysis and the Adjustment of Stock Prices to Common Information. The Review of Fi-

nancial Studies, Vol. 6, No. 4, pp. 799-824

[5.] Carhart, M. (1997): On Persistence in Mutual Fund Performance. The Journal of Finance,

Vol. 52, pp. 57–82.

[6.] Chen, H.; Chen, S.; Li, F. (2013): Empirical investigation of an equity pairs trading strat-

egy. Working Paper, University of British Columbia, University of Michigan.

[7.] Cohen, Lauren; Lou, Dong (2012): Complicated firms. Journal of Financial Economics,

Vol. 104, pp. 383–400.

[8.] Conrad, Jennifer; Kaul, Gautam (1989): Mean Reversion in Short-horizon Expected Re-

turns. Review of Financial Studies, Vol. 2, pp. 225–240.

[9.] D'Avolio, G. (2002): The Market for Borrowing Stock. Journal of Financial Economics,

Vol. 66, pp. 271-306.

[10.]Do, Binh; Faff, Robert (2010): Does Simple Pairs Trading Still Work? Financial Ana-

lysts Journal, Vol. 66, No. 4, pp. 83–95.

[11.]Do, Binh; Faff, Robert (2012): Are Pairs Trading Profits Robust to Trading Costs? The

Journal of Financial Research. Vol. 35, No. 2, pp.261–287.

[12.]Engelberg, Joseph; Gao, Pengjie; Jagannathan, Ravi (2009): An Anatomy of Pairs Trad-

ing: The role of idiosyncratic news, common in-formation and liquidity. Working Paper,

University of North Carolina, University of Notre Dame, Northwestern University.

[13.]Fama, Eugene; French, Kenneth (1993): Common risk factors in the returns on stocks

and bonds. Journal of Financial Economics, Vol. 33, pp. 3–56.

[14.]Krauss, Christopher (2015): Statistical arbitrage pairs trading strategies: Review and out-

look. IWQW Discussion Paper Series, No. 09/2015.

[15.]Gatev, Evan; Goetzmann, William N.; Rouwenhorst, K. Geert (2006): Pairs Trading:

Performance of a Relative-Value Arbitrage Rule. The Review of Financial Studies, Vol.

19, No. 3, pp. 797–827.

[16.]Huck, N. (2013): The high sensitivity of pairs trading returns. Applied Economics Let-

ters, Vol. 20, No. 14, pp. 1301–1304.

[17.]Huck, Nicolas (2015): Pairs trading: does volatility timing matter? Applied Economics,

forthcoming.

[18.]Jacobs, Heiko; Weber, Martin (2015): On the determinants of pairs trading profitability.

Journal of Financial Markets, Vol. 23, pp. 75-97.

[19.]Jacobs, H. (2015): What explains the dynamics of 100 anomalies? Journal of Banking &

Finance, Vol. 57, pp. 65–85.

[20.]Jegadeesh, N.; S. Titman (1993): Returns to Buying Winners and Selling Losers: Impli-

cations for Stock Market Efficiency. Journal of Finance, Vol. 48, pp. 65–91.

[21.]Lin, Y.-X.; McCrae, M.; Gulati, C. (2006): Loss protection in pairs trading through min-

imum profit bounds: A cointegration approach. Journal of Applied Mathematics and De-

cision Sciences, Vol. 1, pp. 1–14.

[22.]Lo, Andrew; MacKinlay, Craig (1990): When are Contrarian Profits Due to Stock Mar-

ket Overreaction. The Review of Financial Studies, Vol. 3, No. 2, pp. 175-205.

[23.]Papadakis, George; Wysocki, Peter (2007): Pairs Trading and Accounting Information.

Working Paper, Boston University School of Management/ MIT Sloan School of Man-

agement

[24.]Pastor, L.; Stambaugh, R. (2003): Liquidity risk and expected stock returns. Journal of

Political Economy, Vol. 111, pp. 642-685.

[25.]Perlin, Marcelo S. (2009): Evaluation of pairs-trading strategy at the Brazilian financial

market. Journal of Derivatives & Hedge Funds, Vol. 15, No. 2, pp. 122-136.

[26.]Stander, Y.; Marais, D.; Botha, I. (2013): Trading strategies with copulas. Journal of

Economic and Financial Sciences, Vol. 6, No. 1, pp. 83–107.

[27.]Shleifer, Andrei; Vishny, Robert (1997): The Limits of Arbitrage. The Journal of Fi-

nance, Vol. 52, pp. 35-55.

[28.]Stambaugh, Robert; Yu, Jiangeng; Yuan, Yu (2015): Arbitrage Asymmetry and the Idio-

syncratic Volatility Puzzle. The Journal of Finance, Vol. 70, pp. 1903-1948.

[29.]Tourin, A.; Yan, R. (2013): Dynamic pairs trading using the stochastic control approach.

Journal of Economic Dynamics and Control, Vol. 37, No. 10, pp. 1972–1981.

[30.]Vidyamurthy, G. (2004): Pairs trading: Quantitative methods and analysis. John Wiley &

Sons, Hoboken, N.J.

[31.]Xu, Yexiao; Malkiel, Burton (2003): Investigating the Behavior of Idiosyncratic Volatili-

ty. The Journal of Business, Vol. 76, No. 4, pp. 613-645.

Appendix

Figure 1: The interdependencies of the pairs trading algorithm and the strategy’s return

Figure 1 illustrates the interdependencies of pairs trading. The algorithm influences the pairs

selection and the maximum return per trade. The pairs selection, in turn, determines the corre-

lation and common volatility level. These levels influence the return per trade and the trading

frequency. The strategy’s total return is a function of the return per trade and the trading fre-

quency.

Figure 2: Distribution of the prices spread Xt during the identification and trading period

Figure 2 shows the density function of Xt during the identification period (left & right hand

side) and during the trading period (right hand side). Xt is the spread of the two normalized

prices of a pair on day t. 2hist is calculated as two times the standard deviation of Xt during

the identification period, and it sets the trigger barrier to open a new trade.

Figure 3: The return calculation of the monthly portfolio return

Figure 3 displays the calculation of the monthly portfolio returns. The daily return of a portfo-

lio is averaged across all active pairs within the portfolio. Afterwards, the daily returns are

cumulated to the monthly return. This return is again averaged across the six parallel traded

portfolios, where the start of each trading portfolios is staggered by one month.

Table 1: Correlation and volatility levels over quintiles

Table 1 reports the average correlation and the average common volatility per quintile for all

selected pairs as computed during the identification periods between January 1990 and June

2014. The sample includes all pairs of all twenty-five VCS portfolios of all 10 repetitions. The

volatility of a pair is defined as sum of stock A’s and B’s volatility.

Table 2: Classical selection criterion (SSD)

Table 2, Panel A displays findings from a panel regressions of the SSD on the standardized

pair volatility (Vola_AB_std) and the standardized correlation coefficient (CorrelCoef_std).

We control for pair fixed and time fixed effects. Standard errors are clustered at the pair level

and p-values (pvar) are reported in parentheses and adjusted for heteroscedasticity and clus-

tered by pair combination. ***, **, * denote significance at the 1%, 5% and 10% level.

Panel B benchmarks the monthly excess returns of the SSD portfolio with the returns of fellow

papers.

Statistical significance at the 10%, 5%, and 1% level is indicated by *, **, and ***, respec-

tively.

Table 3: Return per trade

Table 3 shows the average return per trade for double sorted portfolios on volatility and correlation

(VCS portfolios) between January 1990 and December 2014.

Panel A displays the average return per trade for natural trades, which close after the full conver-

gence of both stocks within the trading period.

Panel B displays the average return per trade for incomplete trades, which are forcefully closed on

the last day of the trading period.

Panel C displays the average return per trade for overshooting trades, which are closed if price

diverge by more than 4hist.

Panel D displays the average return per trade for delisted trades, which are close if one stock is

delisted while the pair is open. The trade universe includes all trades of pairs from a particular

VCS Portfolio of all pair sets over time.

Statistical significance at the 10%, 5%, and 1% level is indicated by *, **, and ***, respectively.

Table 4: Level shifts and frequency

Table 4, Panel A (B) reports the percentage of pairs that experience a volatility increase (correla-

tion decrease) between the identification and the trading period for each double sorted portfolios

on volatility and correlation (VCS portfolio). The analysis includes all trades of 57600 pairs for

each VCS portfolio (20 pairs per portfolio*10 sets*288 identification periods). Corr Q5-Q1 (Vola

Q5-Q1) shows the return of a long position in the highest correlation (volatility) quintile portfolio

and a short position in the lowest correlation (volatility) quintile portfolio. Statistical significance

at the 10%, 5%, and 1% level is indicated by *, **, and ***, respectively.

Panel C reports the total number of trades for each VCS Portfolios over all trading periods.

Table 5: Average monthly trading return

Table 5, Panel A reports the average monthly raw returns of trading portfolios for each double

sorted portfolios on volatility and correlation (VCS portfolio) between January 1991 and Decem-

ber 2014. For each quintile level of volatility and correlation, Q5-Q1(2) reports the monthly return

of a strategy that invests in the Q5 portfolio and sells the according Q1(2) portfolio.

Panel B displays the alphas from a time-series regression of the one-month pairs trading return on

a six factor model, including Fama/French’s three factor model, a momentum factor, a short term

reversal factor, and Stambaugh/Pastor’s liquidity factor. We use Newey-West Standard errors with

lag 6. Statistical significance at the 10%, 5%, and 1% level is indicated by *, **, and ***, respec-

tively.

Table 6: Close Economic Substitute Score

Table 6, Panel A displays the average Close Economic Substitute Score (CESS) within each dou-

ble sorted portfolio on volatility and correlation (VCS portfolio) and the SSD portfolio for all in-

cluded pairs between January 1991 and December 2014. The CESS is defined on a scale between 0

(highly similar price sensitivity to pricing factors) and 20 (highly dissimilar price reaction to com-

mon pricing factors). Common pricing factors include Fama/French’s three factor model, the mo-

mentum factor and the short-term reversal factor.

Panel B displays the percentage difference between the average CESS of a VCS or SSD portfolio

and the average CESS of all pairs that actually traded within the respective VCS or SSD portfolio.

A negative percentage indicates that pairs with a lower CESS traded more often than the average

pair within a portfolio.

Table 7: Pairs trading in highly liquid markets

Table 7, Panel A reports the average monthly raw returns of trading portfolios for each double

sorted portfolio on volatility and correlation (VCS portfolio) between January 1991 and December

2014. The sample includes only at that time current members of the S&P 100 between January

1990 and December 2014. For each quintile level of volatility and correlation, Q5-Q1(2) reports

the monthly return of a strategy that invests in the Q5 portfolio and sells the according Q1(2) port-

folio. Likewise, Panel B displays the average monthly return of NASDAQ 100 members.

Statistical significance at the 10%, 5%, and 1% level is indicated by *, **, and ***, respectively.

Table 8: Pairs trading with close economic substitutes

Table 8, Panel A reports the pairs trading returns of sixteen portfolios double sorted on volatility

and correlation between January 1990 and December 2014. All included stock pairs must be close

economic substitutes with the lowest possible CESS of zero. Furthermore, both stocks of a pair

must operate within the same industry (Fama/French 49 industry classification).

Panel B shows the return of ten portfolios solely sorted on volatility. The corresponding p-values

are reported in brackets. Q-Q1 refers to the return difference between the observed portfolio and

the portfolio with the lowest volatility Q1. Likewise, Q-Q2 refers to the return difference between

the observed portfolio and the portfolio with the second lowest volatility Q2. Statistical signifi-

cance at the 10%, 5%, and 1% level is indicated by *, **, and ***, respectively.

Table 9: Return per short and long leg

Table 9 investigates the contribution of the short and long leg separately and displays the median

percentage contribution of the long leg to the total return between January 1991 and December

2014.

Documents

Demystifying pairs trading: The role of volatility and ... Pairs Trading - … · Electronic copy available at : http ://ssrn.com /abstract = 2774063 Demystifying pairs trading: The