Pricing Multi-Strike Quanto Call Options on Multiple Assets with Stochastic Volatility, Correlation, and Exchange Rates ()
1. Introduction & Method
In modern finance, tradable assets are typically modeled with stochastic volatility, which was introduced by Hull and White in 1987 [1]. Stein and Stein (1991) introduced a mean-reverting stochastic volatility model where the volatility of asset returns follows a Brownian motion mean-reverting process [2]. This model reflects the observation that volatility tends to return to a long-term average level over time, providing a more realistic depiction of market behavior compared to constant volatility models. In 1993, Heston [3] correlated the stochastic stock price and stochastic stock price volatility by correlating their Brownian motions. Another example of a stochastic volatility model was developed by Ball and Roma (1994) [4]. Their model presents a framework for option pricing that accounts for stochastic volatility by simplifying the Fourier option pricing techniques and implementing power series methods. They demonstrate that the characteristic function of the average variance is crucial in this approach, particularly when there is no correlation between security price innovations and volatility. This model corrects certain biases in the Black-Scholes model, improving on Stein and Stein’s analysis [4]. Bates’ (1996) [5] model further extends Heston’s model by incorporating jumps in the asset price process, thereby capturing sudden, large movements in the market, which is a common feature observed in financial time series data. Modeling volatility as stochastic captures the empirical observation that market volatility tends to cluster over time, reflecting periods of high and low market uncertainty, which cannot be explained by a constant volatility model. Moreover, the phenomenon of volatility smiles has been studied extensively and seems to be alleviated by models with non-constant volatility ([3] [6]-[9]).
However, it is much less common in comparison to see such models extended further with a correlation that varies stochastically over time. Some existing research concerned with modeling correlations as stochastic is Engle 2002 [10], Lu & Meissner 2014 [11], Buraschi et al. 2010 [12] [13], and Da Fonseca et al. 2007/2008 [14]. Modeling correlation as stochastic (SC) is beneficial because it reflects the reality that correlations between asset returns are not constant and can change due to varying market conditions, such as shifts in economic cycles or changes in investor sentiment. This variability in correlation can significantly egimpact the pricing and hedging of multi-asset derivatives. In his research, Pricing Foreign Equity Options with Stochastic Correlation and Volatility (2009), Jun Ma develops a novel model of this type for foreign equity option pricing. Foreign equity, FX, and currency derivatives are widely traded on a global scale. Crucially, participants incur additional risk due to exchange rate uncertainty when trading foreign equity options, as highlighted by Ma in his paper [15]. Moreover, when trading derivatives that rely on multiple underlying assets, participants also incur an additional correlation risk that has to be accounted for. As clarified by Ma, Quanto options do not yield to pricing via the BS risk-neutral framework when we incorporate stochastic correlation [15]. In cases when a simple closed-form solution is unknown, some popular alternatives for pricing such derivatives are numerical methods, simulations, or series solutions.
The primary aim of this paper is to use simulations to tackle the problem of pricing Quanto options on two and three underlying assets under stochastic correlation and volatility driven by different stochastic differential equations (SDEs). The following models are tested and compared: Heston, GARCH, GARCH-Jump, 3/2 diffusion, and Bates for volatility, and Jacobi, Wright-Fisher diffusion, Weibull diffusion, and a mean-reverting SC for correlation. The study is focused specifically on Quanto options on two or three foreign equity market indices. These options act like a basket correlation option with the payoff depending on multiple correlated assets but also on exchange rates between the currencies of the indices. We test three different models of exchange rate dynamics, with both rates being either GBM, a mean reverting SDE inspired by the OU process, or an exponential levy process that incorporates jumps. The stochastic differential equations governing all of these SV, SC, and SER models and their key features can be found in Section 2 of the paper. The most unique feature of Quanto options is the payoff structure since it is paid in the foreign currency of the underlying but then converted to the domestic currency. Details of the payoff structure and the overall model outlines are discussed fully in Section 3 of the paper. Section 4 of our paper outlines the methodology of the Monte Carlo simulation and clarifies details of the discretization schemes for the SDEs, whilst Section 5 discusses the results. Due to the additional correlation risk, it is prudent to also consider how to effectively hedge such products with Cora and Gora, which is done in Section 6.
We collect observed market prices for the indices SP500, FTSE100, and STOXX600 (underlying assets), as well as the GBP/USD, EUR/USD, and EUR/GBP exchange rates from Yahoo Finance. The code written for this paper allows the user to select what date range the option will be over and, hence, what data to collect. For the paper, we focus on 2021-2022 and 2022-2023 as two specific time periods to test. We perform the comparison of all model variants and the discretization schemes for both cases of the option and for both of these date ranges as the lifetimes of the option. The constant parameters of the SDEs of the volatilities, correlations, and exchange rates are calibrated to real market data based on summary statistics of the SP500, FTSE100, and STOXX600 indices and USD/GBP, USD/EUR exchange rates values. For example, aside from the starting points of all the processes, we also selected the volatility parameters of the FX rate SDEs based on the rolling standard deviations for the exchange rates. Similarly, the mean-reversion level (
) and rate (
) of the Ornstein-Uhlenbeck (OU) process for modeling FX rates are determined using historical average exchange rates and autocorrelation analysis. Likewise, we use historical long-run averages, rolling standard deviations, and rolling correlations to select the parameters of the SV and SC SDEs. We ensure that look-ahead bias is avoided by only using data available on the start date of each option for the calibration. To facilitate a visual comparison with the plots of paths of different processes produced by the simulation, we standardize the plots of the observed trajectories of our underlying assets and exchange rates in different years to plot them with the same starting points with 100 used as an example value (Figure S29 and Figure S30). We also plot rolling window volatilities and correlations of the assets and exchange rates for different window sizes (Figure S31 and Figure S32). We use the 13-week US Treasury Bill rate from Yahoo Finance as the US domestic interest rate (Figure S24). The Bank of England [16] and European Central Bank [17] interest rates are used for the two foreign interest rates. Figure S1 shows the observed starting values for the underlying assets and the exchange rates. These values are used as the starting parameters for the MC simulations as discussed in Section 4.
2. Models of Stochastic Volatility, Correlation & Exchange Rates
The quest for pricing models that incorporate random volatility is driven by empirical evidence from various studies of financial time series supporting the hypothesis of stochastic volatility. Ma (2009) emphasizes that implied volatilities, calculated using the Black-Scholes formula [18], exhibit random fluctuations over time, manifesting in the term structure of implied volatility. Moreover, when we fix a maturity time T, and consider varying strike price K, it can be seen that implied volatility is higher for options contracts with only the moneyness level being different, which is often referred to as the volatility smile or skew. This pattern arises because the Black-Scholes model assumes constant volatility, which does not align with its market behavior, where volatility varies over time and across strike prices. While the volatility smile is typically symmetric for both puts and calls, this pattern is more common in currency (FX) and commodity markets, where implied volatility curves form a valley or smile shape, rising at both ends for deep in-the-money (ITM) and OTM options. In contrast, equity markets often exhibit a downward-sloping implied volatility graph, commonly referred to as a skew or smirk. In such markets, OTM put options tend to have higher implied volatility than ITM put options (left to right downward slope) and OTM call options tend to have lower implied volatility than ITM call options (right to left downward slope). Also, this pattern is more pronounced for puts than for calls due to increased demand for downside protection, reflecting perceived risks of large negative movements in asset prices. According to Ma, stochastic volatility models elucidate deviations from constant implied volatility, and it has been shown that they can capture the volatility smile better.
Traditionally, in the vast body of financial and economic literature on multi-asset options, the correlation coefficient between correlated variables has been assumed to be constant (e.g., see Black and Scholes, 1973 [18]; Margrabe, 1978 [19]; Garman, 1992 [20]). However, Ma (2009) highlights that relying on long-term estimates of constant correlation can be misleading, potentially resulting in significant mispricing and risk management issues. Historical correlations must be used with caution as they can be more unstable than volatility [15]. An alternative approach involves inferring implied correlations from market prices, akin to implied volatility, which offers an estimation of stochastic correlation based on market data [15]. In this study, we test different models of stochastic volatility of the underlying assets, of stochastic correlation between their Brownian motions, and of stochastic exchange rates.
2.1. Volatility Models
As mentioned in the introduction, we test five models for stochastic volatility: Heston, 3/2 volatility, GARCH, Bates, and GARCH-Jump. The first two rely on a mean reversion drift term, with the parameters kappa and theta being the rate of mean reversion and the long-run value to which the volatility process reverts, respectively. Also, the first two have the parameter sigma to control the standard deviation of the random fluctuations. 3/2 volatility can be thought of as a higher-order extension of Heston volatility. Bates and GARCH-Jump processes introduce jumps to replicate the behavior of sudden moves of volatility observed in markets with Poisson-process-driven, normally distributed jumps. We ensure that the simulation functions do not produce negative volatility for all of the SDEs of stochastic volatility models.
2.1.1. Heston Model
(1)
Note that setting
(2)
in the Heston model ensures the process is strictly positive (Feller condition [21]).
2.1.2. GARCH Inspired Model
The time-homogeneous GARCH process satisfies the following linear SDE according to Li et al. as described in their 2018 paper [22]. Hence, this model is very similar to the Heston model discussed above, but the diffusion term is multiplied by
to raise the power on the volatility component from 1/2 to 1...
(3)
2.1.3. GARCH Inspired Model with Jumps
The GARCH-Jump model extends the standard GARCH (Generalized Autoregressive Conditional Heteroskedasticity) framework by incorporating jumps, providing a more comprehensive tool to capture the aforementioned volatility dynamics observed in financial markets. This model is particularly useful in capturing the sudden large movements or jumps in asset prices that cannot be explained by continuous processes alone. The GARCH-Jump model was developed and extensively analyzed by Duan et al. (2004), who highlighted its efficacy in better fitting historical time series data and explaining the observed volatility smile in option prices. Their research demonstrated that incorporating jumps into the GARCH framework significantly improves the model’s performance in capturing the empirical features of asset returns and volatility [23]. We implement a variant of GARCH-Jump SV by adding a jump term to our GARCH-inspired model discussed above.
(4)
In our code implementation, the jumps are modeled as a compound Poisson process where the jump sizes are normally distributed. Specifically,
is constructed by first generating a number of jumps using a Poisson distribution with intensity parameter
. For each jump, the size is drawn from a normal distribution with mean
and standard deviation
. The total jump impact
is then the sum of all individual jump sizes occurring within a given time interval
. Hence, the jumps are modeled as follows:
(5)
Here,
is a Poisson process with intensity
, and
are i.i.d. normal random variables with mean
and variance
. Duan et al. (2004) employed the GARCH-Jump model to explore option pricing under conditions where both price and volatility exhibit jump-diffusion behavior. They found that the GARCH-Jump model provides a robust framework for understanding and predicting market behaviors characterized by sudden and significant changes. The model’s ability to capture jumps makes it particularly valuable for pricing derivatives and managing financial risk in environments subject to abrupt market movements [23].
2.1.4. Bates Model
The Bates volatility model extends Heston volatility with jumps [5]. Here, the parameters kappa, theta, and sigma have the same use as for the Heston volatility model.
(6)
where
represents the jumps with normally distributed jump sizes.
and
are two more parameters in the simulation of this model, which control the mean size and standard deviation of the jumps, respectively, as discussed above. Here
is a multiplier that controls the magnitude of the effect of the jumps as above.
2.1.5. 3/2 Model
The 3/2 stochastic volatility model is an extension of the constant elasticity of variance (CEV) model and was developed by Carr and Sun (2007) to better capture the dynamics of financial markets. In this model, the volatility of the underlying asset is driven by a process that is proportional to the power 3/2 of the volatility itself. The stochastic differential equation (SDE) governing the 3/2 model is given by:
(7)
In this context:
is the speed of mean reversion, determining how quickly the process reverts to its long-term mean
.
is the long-term mean level of the variance process.
represents the volatility of the variance process, indicating the magnitude of random fluctuations.
The 3/2 model is particularly useful in capturing the empirical features of volatility observed in financial markets, such as the leverage effect and the fact that volatility tends to spike during market downturns [24]. Carr and Sun (2007) developed this model to provide a more accurate framework for pricing options and other derivative securities. According to Carr and Sun, the 3/2 model has several desirable properties. The process remains non-negative and exhibits mean-reverting behavior, where the speed of mean reversion is proportional to the level of the process. The 3/2 model also yields closed-form solutions for the joint Fourier-Laplace transform of returns and their quadratic variation, which is useful for efficiently pricing and hedging derivatives [24].
2.2. Correlation Models
Correlations can be influenced by factors such as industrial production, T-bill rates, and unanticipated inflation, often acting as a business cycle indicator. Even after adjusting for business cycle effects, correlation risk persists (Driessen, Maenhout, and Vilkov, 2006 [25]). Although the correlation coefficient between two assets is not directly tradable, it remains crucial to devise hedging strategies for correlation risk. Developing robust frameworks for constructing portfolios to hedge against correlation risk can ensure more secure risk management practices [15]. For all of the stochastic correlation models, the SDEs have mean reversion or bounds or are clipped to ensure that correlations remain within [−1, 1]. Four models for stochastic correlation of increasing complexity are implemented in this study. The simplest is a stochastic correlation SDE inspired by the modeling of processes in studies of genetics, which has a diffusion term, making sure it stays within [−1, 1]. The next simplest is the Jacobi correlation used by Ma, which gives the user the option to keep the correlation process within bounds h and f by altering this diffusion term. The second most complicated model is the mean-reverting extension of the first two simpler models. The most complex model that is tested is the Weibull distribution stochastic correlation model.
2.2.1. Wright-Fisher (WF) Model
Wright-Fisher diffusions are used in biology and biochemistry to model gene frequencies and other natural/bodily processes. As explained in their paper, A mean-reverting SDE on correlation matrices, Ahdida et al. focus on stochastic differential equations ‘valued on correlation matrices’ [26] and develop a mean-reverting extension of the Wright-Fisher SDE to model correlations in finance. In our paper, we test both the original diffusion on [−1, 1] and the mean-reverting extension developed by Ahdida et al. as alternatives for modeling correlation. We do not implement the original version of the diffusion on [0, 1] since we want the process to mimic the variability across the range [−1, 1] when modeling financial correlations. The classic WF diffusion on [−1,1] has the SDE:
(8)
Here,
represents the instantaneous correlation at time
.
is the mean reversion rate, determining how quickly the process reverts to the long-term mean correlation
.
is the long-term mean correlation towards which
reverts.
represents the volatility of the correlation process, indicating the extent of random fluctuations around the mean, and
is a standard Wiener process, as usual. Ahdida et al. highlight that the term
ensures that the correlation
remains within the interval
. This term becomes zero when
approaches the boundaries, preventing it from exceeding these limits. This bounded characteristic makes this version of the WF diffusion particularly suitable for modeling correlations in finance, where it is critical to maintain realistic correlation values [26].
2.2.2. Jacobi Process
The Jacobi process is used to model stochastic correlation and is described by the following stochastic differential equation (SDE):
(9)
where
represents the correlation at time
,
is the mean reversion rate,
is the long-term mean correlation, and
and
are the upper and lower bounds of the correlation, respectively. The term
denotes a standard Wiener process as usual. The parameter
determines the speed at which the correlation reverts to its long-term mean
. A higher value of
implies a faster reversion. The parameters
and
set the natural boundaries for the correlation, ensuring that it remains within a realistic range. The volatility parameter
controls the amplitude of fluctuations around the mean [15].
The Jacobi process is particularly useful for modeling correlation because it can capture both the mean-reverting nature and the bounded behavior of correlation coefficients. The square root term
ensures that the correlation stays within the interval
. Ma (2009) introduced the use of the Jacobi process in the context of pricing foreign equity options with stochastic correlation. This model allows for a more accurate reflection of market dynamics compared to constant correlation models. The Jacobi process can be seen as an extension of other mean-reverting processes, such as the Ornstein-Uhlenbeck process, but with the added complexity of bounded behavior. This makes it particularly suited for financial applications since correlations tend to naturally exhibit such characteristics [15].
2.2.3. Mean-Reverting Correlation
This is an extension of the WF diffusion on [−1, 1] developed by Ahdida et al. (Ahdida 2013), which includes an adaptive mean-reversion component. The mean-reverting SDE is given by:
(10)
Here
,
,
,
, are all the same as for the WF correlation model discussed above. Also, the term
has the same use of ensuring that the correlation
remains within the interval
. By adjusting the mean-reverting drift term
, Ahdida et al extend the process to capture the empirically observed tendency of correlations to revert to a long-term mean with the inclusion of the
to ensure that the reversion speed adjusts dynamically based on the current correlation level [26].
2.2.4. Weibull Model
The Weibull model is characterized by the following stochastic differential equation:
(11)
where
is a mean term derived from Weibull distribution parameters. This model was originally proposed by Miñano et al. (2013) for wind speed modeling, demonstrating its efficacy in generating wind speed trajectories with desired statistical properties [27]. In this context,
and
are diffusion terms uniquely defined to ensure the model captures the desired statistical properties. Specifically,
and
are given by:
(12)
(13)
where
is the probability density function (PDF) of the Weibull distribution,
is the scale parameter,
is the shape parameter, and
represents the Gamma function. The inclusion of
and
ensures the model’s diffusion term is appropriately scaled to reflect the nature of the underlying process. This formulation makes the Weibull model particularly suitable for phenomena where the Weibull distribution provides a good fit, such as wind speed data and potentially skewed financial data. Specifically, this model was designed to simulate wind speeds that follow a Weibull distribution and exhibit exponential autocorrelation, as discussed in the work by Miñano et al. [27]. Their objective was to accurately replicate the statistical properties of wind speed for applications in power systems, highlighting the model’s ability to generate realistic wind speed trajectories for various simulations and analyses. To ensure the mathematical validity of the model, it is essential that
paths stay non-negative and
, as the Weibull distribution is defined only for non-negative values and requires a positive shape parameter. These conditions guarantee that the diffusion terms
and
are well-defined, ensuring the stochastic process remains within the domain where the Weibull distribution accurately describes the behavior. Therefore, this SC model works in situations when we want to model positive correlations.
It has been shown that the Johnson SB distribution is a best-fit distribution for equity and default probability correlation distributions [28]. Also, Gunter Meissner et al. conducted a comprehensive analysis of correlation behavior based on daily closing prices of 30 stocks within the Dow Jones index, spanning the period from January 1972 to October 2012 [29]. Their findings indicate that correlation levels are at their lowest during periods of robust economic growth, wherein equity prices are predominantly influenced by idiosyncratic factors rather than broader macroeconomic conditions. Conversely, during recessions, correlation levels typically rise as macroeconomic factors overshadow idiosyncratic influences, leading to simultaneous downturns across multiple stocks. Furthermore, the volatility of correlations is observed to be lowest during economic expansions and higher during normal periods and recessions [29]. The study also highlighted that the Johnson SB distribution, characterized by its shape, location, and scale parameters, provided the most accurate fit for modeling these correlations. This distribution’s flexibility effectively captures the intricate properties and variations of the correlation data observed under different economic conditions [29]. Something we want to explore further in future work is to apply the approach used for the Weibull correlation model to capture statistical properties of the distribution in the SDE to develop a stochastic correlation model that captures desired properties of the Johnson SB distribution and incorporates mean reversion.
2.3. Models of Stochastic Exchange Rates
The domestic currency for the investor in these Quanto options is US dollars. In our paper, one of the underlying assets is denominated in the home currency (US dollar). For all choices of correlation and volatility, the underlying assets are modeled with the same SDEs throughout. We test three SDEs for the exchange rates (scenarios 1 - 3). The simplest model is for both exchange rates to be GBM [30] with different parameters based on observed starting values and summary statistics (as outlined in section 1). We also test a model with mean-reversion incorporated into the SDEs used for both exchange rates. It has been shown in many studies that foreign exchange rates tend to exhibit strong mean reversion [31] [32]. In the third model, both exchange rates follow an exponential Lévy process, which incorporates jumps into the GBM model of scenario 1 [33] [34]. The foreign risk-free interest rate is
. This is the risk-free rate of return in the first foreign currency, which could be the interest rate of a government bond or another risk-free security in the foreign market of GBP.
is the risk-free rate of return in the second foreign currency, and
is the domestic interest rate. In the equations below,
represents the rate of mean reversion,
is the mean level to which the process reverts,
is the volatility, and
is the increment of the Wiener process.
2.3.1. Geometric Brownian Motion (GBM)
The GBM is characterized by its exponential growth with constant drift and volatility [30].
(14)
2.3.2. Ornstein-Uhlenbeck (OU) Process
In this scenario, we model the exchange rates as an OU process, which introduces mean reversion into the drift term.
(15)
2.3.3. Exponential Lévy Process
In this scenario, the exchange rates follow an exponential Lévy process, which incorporates jumps into the SDEs, accounting for sudden and significant changes in the exchange rates [33] [35]. The key parameters in this model include
for volatility,
for the jump intensity,
for the mean jump size, and
for the jump size volatility.
(16)
where
is the jump component, modeled as before (Sections 2.1.3 and 2.1.4) for the GARCH-Jump and Bates SV. These jumps account for sudden, significant changes in the exchange rates, making the exponential Lévy process a suitable candidate for modeling FX rates with jumps [35] [36].
3. Basket Quanto Calls
We test all combinations of choices of the 5 SV, 4 SC, and 3 SER models outlined above for pricing 2 types of basket Quanto call options. Case 1 involves two underlying assets and one exchange rate, whilst Case 2 has three underlying assets and two exchange rates. Both Cases 1 and 2 include an underlying asset in the domestic currency of US dollars, and the other asset(s) must be converted into dollars using the exchange rate(s). In this paper, we model the domestic and foreign interest rates as constants throughout the lifetime of the options. We pick the values observed on the start dates of the options. However, as shown in Figure S24, the interest rates are not usually constant in practice, even over a 1-year time window. We use the 13-week US Treasury Bills rate from Yahoo Finance as the US domestic interest rate, and it can be seen to change in value significantly throughout 2022-2023 (more than 5% change) but remain relatively constant in the prior year. Similar observations can be made with the BoE rates. Thus, it could be prudent to model interest rates as a deterministic function of time, mixed jump diffusion, or discrete event/jump process (compound Poisson or Hawkes, for example), with appropriate adjustments for monthly or quarterly frequencies but this extension is left for future work. In addition, modeling interest rates with SDEs would also allow us to explore whether we should introduce correlations between them. As seen in Figure S24, all three interest rates appear to be strongly positively correlated.
The overall models for the pricing of our Quanto calls using specific choices of the SDE models for SV, SC, and SER can be written as follows:
3.1. Case 1: Two Underlying Assets-Single FX Rate
3.1.1. Variables
: Price of the 1st underlying (e.g. GBP asset).
: Price of the 2nd underlying (e.g. USD asset).
: FX rate (e.g. for USD/GBP).
3.1.2. Stochastic Differential Equations
Underlying Asset Prices
(17)
(18)
SV Choice (same process for both underlying assets)
(19)
(20)
SC Choice
(21)
SER Choice
(22)
3.1.3. Correlation between Brownian Motions
(23)
Here,
and
are independent standard Brownian motions.
3.1.4. Payoff
(24)
Here,
denotes the jump component of volatility,
denotes the jump component of correlation, and
denotes the jump component for the FX rate, as defined in Section 2. Our choice of the SV, SC, and SER models determines the m, s, and
functions above. For example, GARCH-Jump SV has
,
, and
. Similarly, we can write the general equations for our model for Case 2 as shown below.
3.2. Case 2: Three Underlying Assets: Two FX Rates
3.2.1. Variables
: Price of the GBP asset.
: Price of the USD asset.
: Price of the EUR asset.
: FX rate for USD/GBP.
: FX rate for USD/EUR.
3.2.2. Stochastic Differential Equations
Underlying Asset Prices
(25)
(26)
(27)
SV Choice (same process for all underlying assets)
(28)
(29)
(30)
SC Choice (between both pairs, USD-GBP and USD-EUR)
(31)
SER Choice (same process for both FX Rates, USD/GBP, & USD/EUR)
(32)
(33)
3.2.3. Correlation between Brownian Motions
(34)
Here,
,
,
, and
are independent standard Brownian motions.
3.2.4. Payoff
(35)
Hence, as highlighted before, we correlate the Brownian motions of the processes of the underlying assets and model this correlation as stochastic. Also, we model the volatility of the underlying as stochastic and independent from the level of the underlying. Moreover, the FX rates are modeled as stochastic and independent from each other as well as from the level of their respective underlying asset. In the simulation, we use different sets of increments of Brownian motion for all the stochastic processes and only introduce a correlation between those of the underlying assets. This correlation, between the Brownian motions of USD and GBP assets as well as between the Brownian motions of the USD and EUR assets, is the same in our experiments. In all of the SC models, we tested
as we do not investigate the presence of jumps in correlation.
4. Discretization of SDEs & MC Simulation
We test all 60 (=5 * 4 * 3) combinations of different choices of the SV, SC, and SER models with three different discretization schemes and we use the Euler-Maruyama scheme only for the SDEs of the underlying assets throughout. The Euler-Maruyama Scheme is the simplest computationally as it only includes the first three terms of the Ito-Taylor expansion applied to the SDE. However, it is expected to yield limited accuracy since this approximation expands the drift term to
but only expands the diffusion term to
. The Milstein scheme should yield improved accuracy since a second diffusion term is added, expanding the diffusion term to
as well. Although the Milstein scheme has a higher order, its main drawback is that we need to compute the first derivative of the volatility function. This may not always be possible, or it may be computationally expensive. The Runge-Kutta scheme can be used to alleviate this issue while maintaining this higher-order by leveraging the Runge-Kutta approximation of the derivative required in the Milstein scheme. Higher-order Runge-Kutta schemes can be derived by including a more detailed approximation of this derivative. The SDE of a general Itô diffusion
is
and can be discretized via the three alternatives as described below. Here, when discretizing and simulating the increments of Brownian motion, , we write
with
,
, and
. To apply the Euler-Maruyama, Milstein, and Runge-Kutta discretization schemes for SDEs that include an additional jump term, such as those in the GARCH-Jump, Bates Stochastic Volatility (SV), and Exponential Lévy FX rate models, these schemes need to be adjusted as described below. The SDE for an Itô process with an additional jump component can be written as
(36)
where
represents the jump term, often modeled as a compound Poisson process. Indeed, in our paper, the jumps are modeled as compound Poisson processes as discussed in Section 2.1.3. Here, we have
, where
represents the cumulative effect of jumps in the interval
. Specifically,
, where
represents the number of jumps occurring in the time interval
, and each
represents the jump sizes drawn from a normal distribution with mean
and variance
. Thus,
.
4.1. Euler-Maruyama Scheme
(37)
(38)
4.2. Euler-Maruyama Scheme with Jumps
(39)
4.3. Milstein Scheme
(40)
(41)
4.4. Milstein Scheme with Jumps
(42)
4.5. Runge-Kutta Scheme
(43)
(44)
where
(45)
4.6. Runge-Kutta Scheme with Jumps
(46)
(47)
In summary, to handle an additional jump term in our SDEs we add the jump component directly to the discretization for the Euler-Maruyama and Milstein schemes, and the Runge-Kutta scheme also requires calculating the intermediate values including the jump component, and then adjusting the final update accordingly. These modifications ensure that each discretization scheme accurately captures the effects of both continuous and jump components in the SDEs.
4.7. Simulation Procedure
1) Initialize paths: Set initial values for
,
,
,
,
, and
. Set T (time to expiration in years and
per path in days is T*252), N (number of paths in simulation), set increment
for daily time series.
2) Simulate paths over time:
- For each time step, generate Brownian increments and jump instances.
- Update the volatility paths using the specified volatility model and discretization scheme.
- Update the correlation paths using the specified correlation model and discretization scheme.
- Update the FX rate paths using the specified model and discretization scheme.
- Update the underlying asset paths using the simulated volatility and correlation paths according to the Euler-Maruyama discretization scheme.
3) Calculate payoffs: Use the final asset prices and FX rate values to calculate the payoff for each simulated path. For Case 1, the payoff for each path is calculated as:
(48)
For Case 2, the payoff for each path is calculated as:
(49)
4) Discount and average payoffs: Discount the payoffs to present value and average them to get the option price. The discounted option price is given by:
(50)
where
is the number of simulated paths,
is the domestic risk-free rate, and
is the time to maturity.
4.8. Variance Reduction - Antithetic Variates
Variance reduction methods seek to improve the efficiency of an estimator, enabling more accurate results for a given number of simulations, N. Some choices of variance reduction that could be applied to our Monte Carlo simulations include control variates, which leverage the known expectation of auxiliary variables to reduce variance, and randomized Quasi-Monte Carlo (QMC), which replaces purely random sampling with low-discrepancy sequences (e.g. Halton, Sobol, or Fauer) to enhance convergence properties. In this paper, we implement antithetic variates as a form of variance reduction to improve our Monte Carlo simulations. Antithetic variates aim to reduce the variance of the estimator by introducing a negative correlation between paired simulations, thereby improving the convergence rate of the Monte Carlo estimator without increasing the number of simulations. In our context of Monte Carlo simulations involving stochastic differential equations (SDEs), this involves generating antithetic paths by using the negative of the original Brownian motion increments. Then, by averaging the results from the original and antithetic paths, the variance of the estimator is reduced. Specifically, let
be an estimator of the option price based on the original simulations, and
be the estimator based on the antithetic simulations. The combined estimator is then:
(51)
Now, since
and
are negatively correlated, the variance of
is reduced compared to the variance of either
or
alone. The variance of our estimator can be expressed as follows:
(52)
and our X’s both have equal variance so this expression can be simplified further as follows:
(53)
which means that the variance of our antithetic estimator is less than that of the original estimator by
if the covariance between the antithetic and original versions of the random variables is negative:
(54)
Hence, the effectiveness of the antithetic variates method depends on the extent of the negative correlation between the estimators
and
. The more negatively correlated they are, the greater the variance reduction. Since
, we can rewrite this reduction in variance as follows:
(55)
4.8.1. Antithetic Brownian Increments
To incorporate antithetic variates into our simulation process, for each path in the simulation, we generate a corresponding antithetic path by using the negative of the samples of a standard normal random variable in the Brownian increments used in the original path. This approach is applied to all the Brownian motions involved in the simulation. Specifically, if we let
be the Brownian increments generated for the original path at time step
. Then, by writing
, we get that the antithetic increments
are given by:
4.8.2. Modification of the Simulation Procedure
We modify the steps of our simulation process outlined in Section 4.7 as described below to incorporate antithetic variates. Step 1 is the same as in the original version, but Steps 2 and 3 change slightly as we have to include the generation of the antithetic versions for all Brownian increments and updating all paths (including the antithetic versions), respectively. Step 4 also mainly stays the same but we need to compute the payoffs for both the antithetic paths and the normal paths. Then, the key adjustment is in changing step 5 as follows:
New Step 5 discount and average combined payoffs: Compute the combined payoff for each pair of original and antithetic sets of paths as:
(56)
As before, discount the combined payoffs and average them to obtain the option price estimate:
(57)
In both Case 1 and Case 2 of our options, since our payoff functions are monotonic in the asset prices (& FX rates) and all the SDEs we use are linear in the Brownian increments, the negative correlation introduced at the level of the Brownian increments propagates through to the terminal asset prices, and the option payoffs. Also, the symmetry of the Gaussian distribution guarantees that when we negate a (zero-mean) Normal random variable it is still distributed according to the same distribution. Because of this, for any of our SDEs, the solution of the version modified with antithetic Brownian increments has the same distribution as the original SDE’s solution. Indeed, with this, it can be shown that the antithetic and original option price estimates are identically distributed and have a negative covariance, as required for variance reduction via antithetic variates.
5. Results Comparison & Discussion
We use the number of simulation paths with N = 500,000 for all of the simulations in this study. This number of realizations performed well in the preliminary testing of the code. The lists of parameter values used for the simulations of 2021 and 2022 starting dates, respectively, are shown in the last two figures. To compare the performance of the discretization schemes and model variants, we compute the standard deviation of the option value estimates and 95% confidence intervals (using z* = 1.96). Then, all 180 model variants are ordered from lowest standard deviation to highest, and we compute the average price estimate of the 40 best variants to be the ‘true’ value of the option. Then, all the models are re-ordered by the lowest to highest percentage error from this value, and the top 30 are displayed in bold in the results tables. Figure S2 and Figure S3 display the top 30 models by percentage error (PE) for cases 1 and 2 respectively and for both start dates we test (2021 on the left and 2022 on the right subplots). Figures S4-19 (inclusive) show the outputs for all model variants and discretization schemes ordered by PE. From our simulations, the target values (average of 40 best models by standard deviation) for case 1 of the option with 2021 and 2022 start dates were $ 2653.73 and $ 2837.10, respectively (2 d.p.). For Case 2 of the option, our simulations yield the values $ 2667.76 and $ 2854.90 for 2021 and 2022 start dates, respectively (2 d.p.). It makes sense that these options were slightly more expensive with the 2022 starting date since all the assets were significantly more expensive on 2022-01-04 than on 2021-01-4, as shown in Figure S1. These are call options, so the strike is subtracted from the respective underlying (weighted by exchange rate) in each component of the payoff. This means the option price should be greater if the prices of the underlying assets are all higher ceteris paribus. Comparing models by percentage error relative to this calculated value is an attempt to perform model selection even though we do not have real observed prices, another pricing method, or a closed-form/series solution to compare our MC simulation prices and performance to. As shown in Figure S2 and Figure S3, the best-performing combination of SV, SC, and SER models for Case 1 is (GARCH-Jump, Weibull, OU) for both start dates of the option. For Case 2, the best-performing combination is also (GARCH-Jump, Weibull, OU). The 3/2 SV model was a close runner-up in our simulations but a higher number of the top 10 models across both cases and start dates of the options include GARCH-Jump SV. The combination of the Weibull SC and OU SER models performs better than all the alternatives we tested for SC and SER choices. Figures S33-41 show plots of some sample paths from each of the (9) SDEs in our best model over 252 time steps.
As a benchmark, we also test the performance of all model variants with constant correlation. Figures S20-23 (inclusive) show the outputs of the MC simulations with constant correlation with the tables of the model variants ordered by percentage error. The best-performing model variants with constant correlation achieve 0.436 (case 1 with 2021 start), 0.154 (case 2 with 2021 start), 0.147 (case 1 with 2022 start), and 0.431 (case 2 with 2022 start) percentage error. These best-performing model variants have the following combinations of SV, SER, and discretization scheme: (3/2, GBM, Runge-Kutta) for Case 1 with 2021 start, (3/2, GBM, Euler) for Case 2 with 2021 start, (GARCH-Jump, GBM, Runge-Kutta) for case 1 with 2022 start, and (GARCH-Jump, GBM, Milstein) for Case 2 with 2022 start. For Case 1 of the option with a 2021 start date, this performance places the best constant correlation model 13th overall by percentage error when compared to the results of all models with stochastic correlation. This indicates that the inclusion of stochastic correlation significantly improves pricing via MC simulation. However, for Case 2 with the same start date and both cases of the option with the 2022 start date, the best performing constant correlation models place 4th, 4th, and 5th, respectively, when compared to the 30 best model variants with SC. As highlighted by Ma, market data analysis reveals that implied correlation often deviates from realized correlation, indicating a non-zero correlation risk premium [15] (Buraschi, Porchia, and Trojani, 2006 [12]; Driessen, Maenhout, and Vilkov, 2006 [25]). Moreover, Figure S20 shows that correlations seem to fluctuate randomly over time and are far from constant. This evidence supports the inclusion of random correlation structures in derivative pricing models.
The execution times of the simulations of all of the model variants are also recorded in the code and displayed in seconds. The Euler scheme generally exhibits faster execution times and performs well. The Milstein and Runge-Kutta schemes show lower stdevs, indicating higher accuracy compared to the Euler scheme. The Runge-Kutta scheme often shows the lowest stdevs, indicating the best accuracy, albeit with longer execution times. Based on our simulations, the Milstein discretization scheme offers the best balance between execution times and lower standard deviations. However, the Runge-Kutta scheme should be used when the derivative of the coefficient of the diffusion term is not available or hard to compute. In addition, the simulations for the 2022-2023 period generally showed slightly increased execution times and wider confidence intervals compared to the 2021-2022 period, reflecting the increased volatility and market uncertainty during that time frame. This is also reflected in the increased rolling correlations. Also, the simulations took slightly longer on average for Case 2 of the option than for Case 1 as it involves more processes to be simulated. For the same reason, price estimates for Case 2 of the option generally have wider CIs on average than those for case 1.
6. Hedging Correlation Risks
In the context of our foreign equity quanto call options, Cora and Gora are metrics used to quantify and manage the correlation risks associated with the inclusion of multiple correlated underlying assets in the payoffs of these complex derivatives. Cora (correlation delta) measures the sensitivity of the option’s value to changes in the correlation between the underlying assets and Gora (correlation gamma) measures the second-order sensitivity. It is a crucial parameter for understanding how variations in correlation impact the option’s price, especially when the correlation itself is stochastic. By using Cora and Gora, traders and risk managers can better understand and manage the impact of correlation changes on the option prices, ensuring more effective hedging strategies for portfolios that include basket foreign equity quanto call options such as those explored in our study [37]. Cora is defined as the first-order partial derivative of the option price (discounted expectation of payoff) with respect to correlation:
(58)
Gora is defined as the second-order partial derivative of the option price with respect to correlation:
(59)
The discount factor can be taken outside of these derivatives, so the main task of deriving Cora and Gora becomes taking derivatives of the option’s payoff and applying the chain rule. Also, in our models the SC only applies to the BMs driving the stochastic processes of the underlying assets and the BMs driving the FX rate processes are independent of this stochastic correlation process. This allows us to simplify the formulas for Cora and Gora as all partial derivatives of the FX rate processes with respect to this SC are 0.
6.1. Case 1: Single FX Rate
For the single FX rate case, where the option payoff is given by:
(60)
The Cora and Gora metrics can be derived as follows: Cora measures how sensitive the option’s value is to changes in the correlation between the GBP asset price and the USD asset price. Mathematically, this sensitivity is expressed as:
(61)
Gora measures the rate of change of the sensitivity (Cora) with respect to the correlation between the GBP and USD asset prices. It captures the curvature of the option price with respect to correlation changes, indicating how Cora itself responds as the correlation changes:
(62)
Using the chain rule, the first partial derivative of
with respect to
is:
(63)
then, since the partial derivative of the FX rate process with respect to the correlation is 0, we get:
(64)
For the second partial derivative, we get:
(65)
Simplifying fully, we get:
(66)
6.2. Case 2: Two FX Rates
For the two FX rates case, the option payoff is given by:
(67)
Here, Cora and Gora metrics need to account for multiple correlations: between GBP and USD asset prices and between EUR and USD asset prices. The Cora for each of these correlations would be defined as:
(68)
(69)
Similarly, the Gora metrics would measure the second-order sensitivity for each of these correlations, indicating how each Cora changes with respect to changes in the corresponding correlations:
(70)
(71)
Using the chain rule, the first partial derivative of
with respect to
is:
(72)
Similarly, the first partial derivative of
with respect to
is:
(73)
Simplifying further, we get:
(74)
(75)
For the second partial derivatives:
(76)
Similarly, for
:
(77)
Therefore, we get:
(78)
(79)
7. Conclusions & Future Work
Ultimately, we conclude that to accurately price and hedge multi-asset foreign equity quanto options, it is essential to incorporate stochastic correlations alongside stochastic volatilities and the appropriate modeling of stochastic FX rates. The payoffs of our Quanto call options incorporate multiple correlated underlying assets weighted by the FX rate values (which are uncorrelated in our paper). This makes accurate modeling of the FX rates and stochastic correlation between the BMs of the underlying assets essential for achieving optimal performance in MC simulations. By systematically testing all combinations of choices for a varied selection of SDEs for SC, SV, and SER, we identify the most effective configuration of our model. Overall, the combination of GARCH-Jump SV, OU FX rates, Weibull SC, and the Milstein or Runge-Kutta discretization scheme consistently performs well across both cases of the option and start dates we tested. We also find that incorporating mean reversion into stochastic correlation or stochastic FX rate modeling is beneficial for MC simulation pricing. Specifically, it seems that incorporating mean reversion into stochastic correlation models is beneficial not only to ensure simulated correlation paths stay within the realistic range [−1, 1] but also since the motions of correlations between assets observed in the market demonstrate this property.
Moreover, hedging correlation risks is a crucial aspect of using multi-asset Quanto options effectively in practice. We derive formulas for Cora and Gora of our Quanto options in terms of partial derivatives. Our derived Cora and Gora expressions can be made even more explicit by evaluating the partial derivatives for specific choices of SV and SC of the underlying assets, which allows for efficient hedging of correlation risk for both cases of the options. The next step in our research is to extend our framework to handle payoffs involving three or more assets without necessarily including an asset in the investor’s domestic currency. The primary challenge of this extension is the increased number of stochastic differential equations (SDEs) that must be discretized and simulated, which grows quickly with the addition of more currencies and foreign equity indices. Another potential consideration for future work is introducing jumps into the stochastic correlation processes as we did with volatility (Bates and GARCH-Jump models). Whilst we do not test such models in this paper, sudden moves in correlations are also feasible in some market conditions, so this should be studied. In addition, modeling volatilities as driven by fractional Itô processes (“fractional stochastic” or “rough” volatility models) is worth exploring. Fractional Brownian motion (fBM) with Hurst parameter
can be used to better capture the long-memory property of volatility. The FSV model utilizes fractional Brownian motion
to incorporate long-memory effects, where the increments of
are positively correlated when
. This allows the model to capture the mean-reverting nature of volatility without explicit mean-reversion terms. The significance of the Hurst exponent
lies in its ability to describe the roughness or smoothness of the volatility paths, with
indicating roughness and
indicating smoothness. This approach has the advantage of supposedly aligning better with observed market volatilities compared to traditional models. Additionally, there is potential to extend this framework to model stochastic correlation, allowing for both fractional stochastic volatility and fractional stochastic correlation. As shown in Figure S32, for shorter rolling windows the observed correlations look like they could potentially be modeled more effectively by a fractional stochastic process.
Supplementary: List of Figures
Please see the following links for the figures:
Figure S1. (Table of Observed Values) in the link
http://dersoft.com/All_figures/Table_values.png
Figure S2. (Case 1 Top 30 Model by PE) in the link
http://dersoft.com/All_figures/Case_1_top_30_by_PE.png
Figure S3. (Case 2 Top 30 Model by PE) in the link
http://dersoft.com/All_figures/Case_2_top_30_by_PE.png
Figure S4. (Performance of MC Simulation by Percentage Error, 2021 Start, Case 1, Part 1) in the link http://dersoft.com/All_figures/Case_1_1st_2021.png
Figure S5. (Performance of MC Simulation by Percentage Error, 2021 Start, Case 1, Part 2) in the link
http://dersoft.com/All_figures/Case_1_2nd_2021.png
Figure S6. (Performance of MC Simulation by Percentage Error, 2021 Start, Case 1, Part 3) in the link
http://dersoft.com/All_figures/Case_1_3rd_2021.png
Figure S7. (Performance of MC Simulation by Percentage Error, 2021 Start, Case 1, Part 4) in the link
http://dersoft.com/All_figures/Case_1_4th_2021.png
Figure S8. (Performance of MC Simulation by Percentage Error, 2021 Start, Case 2, Part 1) in the link http://dersoft.com/All_figures/Case_2_1st_2021.png
Figure S9. (Performance of MC Simulation by Percentage Error, 2021 Start, Case 2, Part 2) in the link
http://dersoft.com/All_figures/Case_2_2nd_2021.png
Figure S10. (Performance of MC Simulation by Percentage Error, 2021 Start, Case 2, Part 3) in the link
http://dersoft.com/All_figures/Case_2_3rd_2021.png
Figure S11. (Performance of MC Simulation by Percentage Error, 2021 Start, Case 2, Part 4) in the link
http://dersoft.com/All_figures/Case_2_4th_2021.png
Figure S12. (Performance of MC Simulation by Percentage Error, 2022 Start, Case 1, Part 1) in the link
http://dersoft.com/All_figures/Case_1_1st_2022.png
Figure S13. (Performance of MC Simulation by Percentage Error, 2022 Start, Case 1, Part 2) in the link
http://dersoft.com/All_figures/Case_1_2nd_2022.png
Figure S14. (Performance of MC Simulation by Percentage Error, 2022 Start, Case 1, Part 3) in the link
http://dersoft.com/All_figures/Case_1_3rd_2022.png
Figure S15. (Performance of MC Simulation by Percentage Error, 2022 Start, Case 1, Part 4) in the link
http://dersoft.com/All_figures/Case_1_4th_2022.png
Figure S16. (Performance of MC Simulation by Percentage Error, 2022 Start, Case 2, Part 1) in the link
http://dersoft.com/All_figures/Case_2_1st_2022.png
Figure S17. (Performance of MC Simulation by Percentage Error, 2022 Start, Case 2, Part 2) in the link
http://dersoft.com/All_figures/Case_2_2nd_2022.png
Figure S18. (Performance of MC Simulation by Percentage Error, 2022 Start, Case 2, Part 3) in the link
http://dersoft.com/All_figures/Case_2_3rd_2022.png
Figure S19. (Performance of MC Simulation by Percentage Error, 2022 Start, Case 2, Part 4) in the link
http://dersoft.com/All_figures/Case_2_4th_2022.png
Figure S20. (Performance of MC Simulation with Constant Correlation by Percentage Error, 2021 Start, Case 1) in the link
http://dersoft.com/All_figures/MC_Sim_constant_corr_by_percentage_error_2021_start_case_1.png
Figure S21. (Performance of MC Simulation with Constant Correlation by Percentage Error, 2021 Start, Case 2) in the link
http://dersoft.com/All_figures/MC_Sim_constant_corr_by_percentage_error_2021_start_case_2.png
Figure S22. (Performance of MC Simulation with Constant Correlation by Percentage Error, 2022 Start, Case 1) in the link
http://dersoft.com/All_figures/MC_Sim_constant_corr_by_percentage_error_2022_start_case_1.png
Figure S23. (Performance of MC Simulation with Constant Correlation by Percentage Error, 2022 Start, Case 2) in the link
http://dersoft.com/All_figures/MC_Sim_constant_corr_by_percentage_error_2022_start_case_2.png
Figure S24. (Interest Rates) in the link http://dersoft.com/All_figures/IRs.png
Figure S25. (Plots of Observed Underlying Asset Prices and Exchange Rates 1/2) in the link http://dersoft.com/All_figures/Observed_prices_1.png
Figure S26. (Plots of Observed Underlying Asset Prices and Exchange Rates 2/2) in the link http://dersoft.com/All_figures/Observed_prices_2.png
Figure S27. (Plot of Observed Underlying Asset Prices and Exchange Rates in Multiple Years) in the link
http://dersoft.com/All_figures/Observed_prices_multiple_years.png
Figure S28. (Plot 2 of Observed Underlying Asset Prices and Exchange Rates in Multiple Years) in the link
http://dersoft.com/All_figures/Observed_prices_multiple_years_2.png
Figure S29. (Plot of Observed Underlying Asset Prices and Exchange Rates in Multiple Years w. Aligned Start) in the link
http://dersoft.com/All_figures/Observed_prices_multiple_years_aligned.png
Figure S30. (Plot 2 of Observed Underlying Asset Prices and Exchange Rates in Multiple Years w. Aligned Start) in the link
http://dersoft.com/All_figures/Observed_prices_multiple_years_aligned_2.png
Figure S31. (Observed Volatilities and Correlations 1/2) in the link
http://dersoft.com/All_figures/Observed_vol_and_corr.png
Figure S32. (Observed Volatilities and Correlations 2/2) in the link
http://dersoft.com/All_figures/Observed_vol_and_corr_2.png
Figure S33. (MC Simulation Paths 1/9) in the link
http://dersoft.com/All_figures/Simulation_paths_1.png
Figure S34. (MC Simulation Paths 2/9) in the link
http://dersoft.com/All_figures/Simulation_paths_2.png
Figure S35. (MC Simulation Paths 3/9) in the link
http://dersoft.com/All_figures/Simulation_paths_3.png
Figure S36. (MC Simulation Paths 4/9) in the link
http://dersoft.com/All_figures/Simulation_paths_4.png
Figure S37. (MC Simulation Paths 5/9) in the link
http://dersoft.com/All_figures/Simulation_paths_5.png
Figure S38. (MC Simulation Paths 6/9) in the link
http://dersoft.com/All_figures/Simulation_paths_6.png
Figure S39. (MC Simulation Paths 7/9) in the link
http://dersoft.com/All_figures/Simulation_paths_7.png
Figure S40. (MC Simulation Paths 8/9) in the link
http://dersoft.com/All_figures/Simulation_paths_8.png
Figure S41. (MC Simulation Paths 9/9) in the link
http://dersoft.com/All_figures/Simulation_paths_9.png
Figure S42. (MC Simulation Parameters for 2021-2022) in the link
http://dersoft.com/All_figures/MC_Sim_params_start_2021.png
Figure S43. (MC Simulation Parameters for 2022-2023) in the link
http://dersoft.com/All_figures/MC_Sim_params_start_2022.png