Optimization of the Enhanced Index Model

Qian Yao

doi:10.4236/ojapps.2025.153039

Open Journal of Applied Sciences > Vol.15 No.3, March 2025

Optimization of the Enhanced Index Model

Qian Yao
School of Mathematics and Statistics, Shandong Normal University, Jinan, China.
DOI: 10.4236/ojapps.2025.153039 PDF HTML XML 29 Downloads 141 Views

Abstract

With the development of the domestic economy and the increase in household income, the demand for investment has been growing, and funds are widely favored for their safety and flexibility. Enhanced index funds combine the advantages of both passive and active management, with the potential to outperform the market and reduce tracking errors, attracting the attention of many investors. To address the risk that tracking portfolios may incur significant losses due to market index declines, this paper proposes the introduction of a non-parametric Mean Absolute Deviation (MAD) as a downside risk constraint in the enhanced index model, aiming to effectively control the downside risk of the tracking portfolio. Firstly, the study uses a non-parametric method to estimate the MAD and proves that this estimator is a convex function of the portfolio position. Secondly, an enhanced index model is constructed under the MAD constraint, where the objective function consists of a weighted sum of tracking error and excess return. Specifically, we use downside risk to measure tracking error. Finally, it is proven that the model is a convex optimization problem. Empirical research shows that the enhanced index model proposed in this paper, which considers the non-parametric MAD constraint, effectively controls downside risk.

Keywords

Enhanced Index Model, Mean Absolute Deviation, Downside Risk

Share and Cite:

Yao, Q. (2025) Optimization of the Enhanced Index Model. Open Journal of Applied Sciences, 15, 604-618. doi: 10.4236/ojapps.2025.153039.

1. Introduction

Traditionally, index-based fund management strategies are broadly divided into passive management and active management. Fund managers implementing a passive management strategy aim to replicate the performance of a specific financial market index (the so-called benchmark) as closely as possible, such as the CSI 300 or the CSI 500. This strategy is known as index tracking, and it seeks to mimic the market index by selecting a subset of stocks from the benchmark, thereby minimizing a function that measures how closely the portfolio tracks its benchmark index (tracking error). Fund managers implementing an active management strategy aim to outperform the benchmark. This strategy involves analyzing a company’s financial condition, industry prospects, market trends, and other factors to predict the future performance of a stock, selecting stocks with better future prospects to construct a portfolio with the goal of outperforming the benchmark index. Additionally, [1] have shown that a significant number of actively managed funds fail to outperform their benchmark over the long term. Therefore, fund managers typically prefer to adopt a hybrid strategy, often using a passive strategy to manage the majority of the fund’s investments, while employing an active strategy to manage a limited portion of the investments (see [2]).

Enhanced index tracking is an investment strategy aimed at achieving higher returns than the benchmark index (excess returns) while minimizing tracking errors. Therefore, the Enhanced Index Tracking Problem (EITP) seeks to minimize tracking error while maximizing excess returns above the benchmark. This investment strategy is an effective combination of passive and active management, providing relatively stable returns, which has attracted the attention of many scholars.

In recent years, many scholars have developed models and solved the Enhanced Index Tracking Portfolio Problem. [3] proposed a related mixed-integer linear programming formulation for the enhanced index tracking problem, which includes transaction costs, constraints on the number of stocks that can be purchased, and limits on the total transaction costs incurred. They provided numerical results using a standard solver (Cplex). [4] proposed a large-scale linear optimization model for enhanced index tracking, which selects the optimal portfolio based on a new stochastic dominance criterion and designed an effective constraint generation technique to solve the model. [5] proposed a partial replication strategy to construct a risk-averse enhanced index fund. By defining asset returns and return covariance terms as random variables to account for parameter estimation risk, they developed a stochastic mixed-integer nonlinear model. [6] presented an empirical study analyzing the effectiveness of a portfolio selection model based on second-order stochastic dominance (SSD) in the context of enhanced indexing.

The goal of EITP is to minimize tracking error while maximizing excess returns relative to the benchmark. Therefore, this problem is essentially a multi-objective optimization problem. [7] proposed a multi-objective optimization approach for EITP, providing a framework where the objectives are defined as maximizing the degree of outperformance relative to the benchmark and minimizing the cumulative error of underperformance, with transaction costs restricted in the constraints. The paper introduced a disturbance-resistant multi-objective optimization algorithm to solve the enhanced index tracking problem. [8] proposed a linear bi-objective optimization method, which maximizes the average excess return of the portfolio relative to the benchmark during the learning phase and minimizes the maximum downside deviation of portfolio returns from the market index, solving it efficiently to optimality using standard linear programming techniques. [9] proposed a bi-objective mixed-integer linear programming formulation, provided computational results for a set of benchmark instances, and then designed a heuristic process to approximate the Pareto optimal solution set.

The multi-objective optimization model results in a set of near-optimal solutions, and the specific solution still requires subjective selection by the decision-maker. Therefore, many scholars have considered converting the bi-objective problem into a single-objective problem. [10] made an appropriate trade-off between the objective functions of tracking error and excess return in the enhanced index tracking problem, and then solved the problem in two steps. First, they selected stocks that statistically represent the index and limited the number of stocks in the tracking portfolio by considering a subset of stocks. Second, the allocation of the tracking portfolio sets the weights for each stock. Other scholars have also established single-objective models based on the ratio of tracking error to excess return. [11] applied the Omega ratio for the first time and proposed two optimization models, showing that both models can be converted into linear programming models.

In addition, some models and methods proposed in the literature have considered risk control to some extent. [12] were the first to attempt using the return-to-risk ratio in the context of enhanced indexing. The authors introduced a nonlinear optimization model based on maximizing the modified Sortino ratio and solved it using a genetic algorithm. [13] were the first to apply the theoretical framework of the risk-return ratio model to the enhanced index tracking problem. They proposed a novel bi-criteria optimization model based on Conditional Value-at-Risk (CVaR), using the risk-return ratio as the objective. [14] used the two-tail mixed Conditional Value-at-Risk (TMCVaR) measure for index tracking. [15] pointed out that when using index-based investment strategies for portfolio management, the tracking portfolio also suffers losses when the target index declines. Therefore, it is necessary to incorporate downside risk constraints into the enhanced index model. CVaR was introduced as a constraint in the general index tracking model to control the downside risk of the portfolio composed of the benchmark index component stocks.

This paper considers that in enhanced index tracking investments, investors seek to have portfolio returns exceed the benchmark index returns while avoiding portfolio returns falling below the benchmark index returns. Based on [10], we have constructed an enhanced index model with the weighted sum of tracking error and excess returns as the objective function. In particular, to better meet the needs of investors, we use downside risk to measure tracking error. According to [15], we incorporate the Mean Absolute Deviation (MAD) as a lower bound constraint to effectively control the downside risk of the tracking portfolio. Compared to more complex measurement methods, MAD is simple, robust, and easy to implement, offering significant advantages in controlling downside risk and preventing large losses.

The remainder of this paper is organized as follows. In Section 2, we use a non-parametric method to derive an estimator for the MAD. In Section 3, we prove that the non-parametric MAD estimator is a convex function of portfolio positions. In Section 4, We have developed an enhanced index model with the MAD constraint and proved that the model is a convex optimization problem. In Section 5, we conducted an empirical study that specifically analyzes the model’s ability to control downside risk. Section 6 provides a conclusion.

2. Non-Parametric Estimation of the MAD

Let the asset return be a random variable $X$ , and the target return $α$ is a value set in advance based on the investor’s risk preference or wealth status, typically taken as 0, the risk-free rate, or the expected return. The Mean Absolute Deviation (MAD) can be defined as

$\begin{matrix} {MAD}_{α} (X) = E [| α - X |] \\ = \int_{- \infty}^{\infty} | α - x | f (x) d x \\ = \int_{- \infty}^{α} (α - x) f (x) d x + \int_{α}^{\infty} (x - α) f (x) d x . \end{matrix}$ (2.1)

To obtain the analytical expression of the MAD in Equation (2.1), the density function of asset returns must be defined. However, in practice, the density function is usually unknown and must be estimated from historical return data. Common estimation methods include parametric, semi-parametric, and non-parametric approaches. Parametric and semi-parametric methods assume a specific distribution and estimate its parameters, but they depend on model assumptions, which may introduce biases. In contrast, non-parametric methods avoid assumptions and estimate the distribution directly from historical data. This typically provides more accurate and reliable risk assessments. Let $x_{t}, t = 1, 2, \dots, T$ be the sample of $X$ , then the non-parametric kernel estimate of $f (x)$ is

$\hat{f} (x) = \frac{1}{T h} \sum_{t = 1}^{T} k (\frac{x - x_{t}}{h}) .$ (2.2)

$k (y)$ is the kernel function, and $h$ is the bandwidth, where the Gaussian kernel function is $k (y) = {(2 π)}^{- 1 / 2} e^{- y^{2} / 2}$ , and the bandwidth can be selected according to the algorithm rules.

$h = c_{0} \hat{σ} (X) = c_{0} \sqrt{\frac{1}{T - 1} \sum_{t = 1}^{T} {(x_{t} - \bar{x})}^{2}} .$ (2.3)

where $c_{0} = 1.06 \times T^{- 1 / 5}$ is a constant, and $\bar{x} = \frac{1}{T} \sum_{t = 1}^{T} x_{t}$ . The non-parametric estimator of the MAD is then given by

$\begin{matrix} \hat{{MAD}_{α}} (X) = \int_{- \infty}^{\infty} | α - x | f (x) d x \\ = \int_{- \infty}^{α} (α - x) f (x) d x + \int_{α}^{\infty} (x - α) f (x) d x \end{matrix}$

$\begin{matrix} = \int_{- \infty}^{α} (α - x) \frac{1}{T h} \sum_{t = 1}^{T} k (\frac{x - x_{t}}{h}) d x + \int_{α}^{\infty} (x - α) \frac{1}{T h} \sum_{t = 1}^{T} k (\frac{x - x_{t}}{h}) d x \\ = \frac{1}{T} \sum_{t = 1}^{T} \int_{- \infty}^{\frac{α - x_{t}}{h}} (α - x_{t} - h y) k (y) d y + \frac{1}{T} \sum_{t = 1}^{T} \int_{\frac{α - x_{t}}{h}}^{\infty} (x_{t} + h y - α) k (y) d y . \end{matrix}$ (2.4)

3. Convexity of the Non-Parametric MAD Estimator in Portfolio Positions

Let the return of a stock index in the market be a random variable $r_{t}$ . This index consists of $N$ constituent stocks, and a tracking portfolio is constructed using $n (n \leq N)$ of these constituent stocks. Let $r = {(r_{1}, r_{2}, \dots, r_{n})}^{⊤}$ be the return vector of the $n$ constituent stocks, and $a = {(a_{1}, a_{2}, \dots, a_{n})}^{⊤}$ be the portfolio weights invested in the $n$ constituent stocks. Then, the return of the tracking portfolio is $a^{⊤} r$ . Let ${r_{t}}_{t = 1}^{⊤}$ and ${r_{I, t}}_{t = 1}^{⊤}$ represent the return samples of the $n$ constituent stocks and the index, respectively, where $r_{t} = {(r_{1 t}, r_{2 t}, \dots, r_{n t})}^{⊤}$ . Then, the return sample of the tracking portfolio is $a^{⊤} r_{t}$ , where $t = 1, 2, \dots, T$ .

Let $X = a^{⊤} r$ and $x_{t} = a^{⊤} r_{t}$ . Then, according to Equation (2.4), the non-parametric estimator of the MAD for the tracking portfolio is given by

$\begin{matrix} \hat{{MAD}_{α}} (a^{⊤} r) = \frac{1}{T} \sum_{t = 1}^{T} \int_{- \infty}^{\frac{α - a^{⊤} r_{t}}{h}} (α - a^{⊤} r_{t} - h y) k (y) d y \\ + \frac{1}{T} \sum_{t = 1}^{T} \int_{\frac{α - a^{⊤} r_{t}}{h}}^{\infty} (a^{⊤} r_{t} + h y - α) k (y) d y . \end{matrix}$

Let $ξ_{t} = \frac{α - x_{t}}{h}$ , $Φ_{0} (ξ_{t}) = \int_{- \infty}^{ξ_{t}} k (y) d y$ , $Φ_{1} (ξ_{t}) = \int_{- \infty}^{ξ_{t}} y k (y) d y$ , ${Φ^{'}}_{0} (ξ_{t}) = \int_{ξ_{t}}^{\infty} k (y) d y$ , ${Φ^{'}}_{1} (ξ_{t}) = \int_{ξ_{t}}^{\infty} y k (y) d y$ . The above expression can be simplified as

$\begin{matrix} \hat{{MAD}_{α}} (a^{⊤} r) = - \frac{1}{T} \sum_{t = 1}^{T} [(a^{⊤} r_{t} - α) Φ_{0} (ξ_{t}) + h Φ_{1} (ξ_{t})] \\ + \frac{1}{T} \sum_{t = 1}^{T} [(a^{⊤} r_{t} - α) {Φ^{'}}_{0} (ξ_{t}) + h {Φ^{'}}_{1} (ξ_{t})] . \end{matrix}$ (3.1)

The bandwidth is determined according to Equation (2.3).

$\begin{matrix} h = c_{0} \hat{σ} (a^{⊤} r) = c_{0} \sqrt{\frac{1}{T - 1} \sum_{t = 1}^{T} {(a^{⊤} r_{t} - a^{⊤} \bar{r})}^{2}} \\ = c_{0} \sqrt{\frac{1}{T - 1} \sum_{t = 1}^{T} a^{⊤} (r_{t} - \bar{r}) {(r_{t} - \bar{r})}^{⊤} a} \\ = c_{0} \sqrt{a^{⊤} \hat{Σ} a} . \end{matrix}$ (3.2)

where $\hat{Σ} = \frac{1}{T - 1} \sum_{t = 1}^{T} (r_{t} - \bar{r}) {(r_{t} - \bar{r})}^{⊤}$ , $\bar{r} = \frac{1}{T} \sum_{t = 1}^{T} r_{t}$ .

Lemma 3.1. The bandwidth $h = c_{0} \sqrt{a^{⊤} \hat{Σ} a}$ is a convex function of the portfolio position $a$ .

Proof. The derivative of $h$ with respect to $a$ is

$\frac{\partial h}{\partial a} = C_{0} \frac{\hat{Σ} a}{\sqrt{a^{⊤} \hat{Σ} a}}$

Further, taking the derivative with respect to $a^{⊤}$

$\begin{matrix} \frac{\partial h}{\partial a \partial a^{⊤}} = C_{0} \frac{\hat{Σ} {(a^{⊤} \hat{Σ} a)}^{\frac{1}{2}} - \hat{Σ} a {(a^{⊤} \hat{Σ} a)}^{- \frac{1}{2}} a^{⊤} \hat{Σ}}{a^{⊤} \hat{Σ} a} \\ = C_{0} \frac{\hat{Σ} a \hat{Σ} a - \hat{Σ} a a^{⊤} \hat{Σ}}{{(a^{⊤} \hat{Σ} a)}^{\frac{3}{2}}} . \end{matrix}$

It follows that $a^{⊤} Σ a \geq 0$ . Let $\hat{Σ} = p p^{⊤}$ , and since

$\begin{array}{l} X^{⊤} (\hat{Σ} a \hat{Σ} a - \hat{Σ} a a^{⊤} \hat{Σ}) X \\ = X^{⊤} \hat{Σ} a \hat{Σ} a X - X^{⊤} \hat{Σ} a a^{⊤} \hat{Σ} X \\ = X^{⊤} p p^{⊤} a p p^{⊤} a X - X^{⊤} p p^{⊤} a a^{⊤} p p^{⊤} X \\ = {(p^{⊤} a)}^{⊤} (p^{⊤} a) {(p^{⊤} X)}^{⊤} (p X) - {({(p^{⊤} a)}^{⊤} p^{⊤} X)}^{2} \\ = {‖ p^{⊤} a ‖}^{2} {‖ p^{⊤} X ‖}^{2} - {(p^{⊤} a \cdot p^{⊤} X)}^{2} \geq 0 \end{array}$

Therefore, it follows that $\frac{\partial h}{\partial a \partial a^{⊤}}$ is a positive semi-definite matrix, i.e., the bandwidth $h = c_{0} \sqrt{a^{⊤} \hat{Σ} a}$ is a convex function of the portfolio position $a$ .

Proposition 3.1. The non-parametric estimator of the MAD, $\hat{M A D_{α}} (a^{⊤} r)$ , is a convex function of the portfolio position $a$ .

Proof. According to Equation (3.1), let

$\begin{array}{l} F (a) = {\hat{MAD}}_{α} (a^{⊤} r) = - \frac{1}{T} \sum_{t = 1}^{T} [(a^{⊤} r_{t} - α) Φ_{0} (ξ_{t}) + h Φ_{1} (ξ_{t})] \\ + \frac{1}{T} \sum_{t = 1}^{T} [(a^{⊤} r_{t} - α) {Φ^{'}}_{0} (ξ_{t}) + h {Φ^{'}}_{1} (ξ_{t})] . \end{array}$ (3.3)

Take the derivative of both sides of Equation (3.3) with respect to $ξ_{i}$ , and based on the definition of $ξ_{i}$ , we get

$\begin{matrix} \frac{\partial F (a)}{\partial ξ_{t}} = - \frac{2}{T} \sum_{t = 1}^{T} ((a^{⊤} r_{t} - α) k (ξ_{t}) + h ξ_{t} k (ξ_{t})) \\ = - \frac{2}{T} \sum_{t = 1}^{T} k (ξ_{t}) (a^{⊤} r_{t} - α + h ξ_{t}) = 0 \end{matrix}$ (3.4)

Take the derivative of $ξ_{t}$ with respect to $a$ , and based on the definition of $ξ_{t}$ , we get

$\frac{\partial ξ_{t}}{\partial a} = - \frac{1}{h} r_{t} - \frac{α - a^{⊤} r_{t}}{h^{2}} \frac{\partial h}{\partial a} = - \frac{1}{h} r_{t} - \frac{ξ_{t}}{h} \frac{\partial h}{\partial a}$ (3.5)

Take the derivative of both sides of Equation (3.3) with respect to $a$ , and using Equation (3.4), we get

$\begin{matrix} \frac{\partial F (a)}{\partial a} = - \frac{1}{T} \sum_{t = 1}^{T} (Φ_{0} (ξ_{t}) r_{t} + Φ_{1} (ξ_{t}) \frac{\partial h}{\partial a}) \\ + \frac{1}{T} \sum_{t = 1}^{T} ({Φ^{'}}_{0} (ξ_{t}) r_{t} + {Φ^{'}}_{1} (ξ_{t}) \frac{\partial h}{\partial a}) \end{matrix}$ (3.6)

Furthermore, take the derivative of both sides of Equation (3.6) with respect to $a$ , and using Equation (3.5), we get

$\begin{matrix} \frac{\partial^{2} F (a)}{\partial a \partial a^{⊤}} = - \frac{1}{T} \sum_{t = 1}^{T} (k (ξ_{t}) r_{t} \frac{\partial ξ_{t}}{\partial a^{⊤}} + ξ_{t} k (ξ_{t}) \frac{\partial h}{\partial a} \frac{\partial ξ_{t}}{\partial a^{⊤}} + Φ_{1} (ξ_{t}) \frac{\partial^{2} h}{\partial a \partial a^{⊤}}) \\ + \frac{1}{T} \sum_{t = 1}^{T} (- k (ξ_{t}) r_{t} \frac{\partial ξ_{t}}{\partial a^{⊤}} - ξ_{t} k (ξ_{t}) \frac{\partial h}{\partial a} \frac{\partial ξ_{t}}{\partial a^{⊤}} + {Φ^{'}}_{1} (ξ_{t}) \frac{\partial^{2} h}{\partial a \partial a^{⊤}}) \\ = - \frac{2}{T} \sum_{t = 1}^{T} k (ξ_{t}) (r_{t} + ξ_{t} \frac{\partial h}{\partial a}) {\frac{\partial ξ_{t}}{\partial a}}^{⊤} \\ - \frac{1}{T} \frac{\partial^{2} h}{\partial a \partial a^{⊤}} \sum_{t = 1}^{T} Φ_{1} (ξ_{t}) + \frac{1}{T} \frac{\partial^{2} h}{\partial a \partial a^{⊤}} \sum_{t = 1}^{T} {Φ^{'}}_{1} (ξ_{t}) \\ = \frac{2}{T} \sum_{t = 1}^{T} k (ξ_{t}) \frac{\partial ξ_{t}}{\partial a} {\frac{\partial ξ_{t}}{\partial a}}^{⊤} - \frac{1}{T} \frac{\partial^{2} h}{\partial a \partial a^{⊤}} \sum_{t = 1}^{T} Φ_{1} (ξ_{t}) + \frac{1}{T} \frac{\partial^{2} h}{\partial a \partial a^{⊤}} \sum_{t = 1}^{T} {Φ^{'}}_{1} (ξ_{t}) . \end{matrix}$ (3.7)

Since the kernel function $k (ξ_{t}) \geq 0$ , the sample size $T > 0$ , and the bandwidth $h > 0$ , it follows that $\frac{2 h}{T} \sum_{t = 1}^{T} k (ξ_{t}) \frac{\partial ξ_{t}}{\partial a} {\frac{\partial ξ_{t}}{\partial a}}^{⊤}$ is a positive semi-definite matrix. The function $k (y)$ is the density function of the standard normal distribution, and after simple derivation, we obtain

$Φ_{1} (ξ_{t}) = \int_{- \infty}^{ξ_{t}} y k (y) d y = - k (ξ_{t}) \leq 0$

${Φ^{'}}_{1} (ξ_{t}) = \int_{- \infty}^{ξ_{t}} y k (y) d y = k (ξ_{t}) \geq 0$

Since $- \frac{1}{T} < 0$ , $Φ_{1} (ξ_{t}) \leq 0$ , ${Φ^{'}}_{1} (ξ_{t}) \geq 0$ , and according to Lemma 3.1, $\frac{\partial^{2} h}{\partial a \partial a^{⊤}}$ is a positive semi-definite matrix, it follows that the last two terms are also positive semi-definite matrices. Therefore, combining everything, we conclude that $\frac{\partial^{2} F (a)}{\partial a \partial a^{⊤}}$ is a positive semi-definite matrix, meaning that $F (a)$ is a convex function of $a$ .

4. The Enhanced Index Model under MAD Constraint

In this section, we construct the enhanced index model and discuss the objective function and constraints of the model. This paper emphasizes that traditional enhanced index models do not include a downside risk constraint, which may lead to the risk of the tracking portfolio deviating negatively from the benchmark index, a major concern in the current Chinese market. Therefore, we incorporate a constraint based on the MAD into the model. We also prove that the enhanced index model with the non-parametric MAD constraint is a convex optimization problem.

For the enhanced index tracking problem, our goal is to generate a portfolio that seeks to achieve relatively high excess returns while minimizing tracking error. Tracking error refers to the difference between the actual returns of the portfolio and the returns of the benchmark index. This difference can be adjusted according to specific circumstances and preferences, for example, by using metrics such as mean squared error, root mean squared error, downside risk, or other risk measures. In enhanced index investing, investors expect the portfolio to outperform the benchmark index, rather than merely tracking it. Therefore, we consider using downside risk to measure tracking error, and define excess return as the average difference between the portfolio’s actual return and the benchmark index’s return, which better aligns with the risk perception of enhanced index investors. The objective of the enhanced index model is to minimize the linear combination of tracking error $T E$ and excess return $E R$

$λ T E - (1 - λ) E R = λ {(\sum_{t = 1}^{T} ω_{t} {(max (r_{I, t} - a^{⊤} r_{t}, 0))}^{γ})}^{1 / γ} - (1 - λ) \sum_{t = 1}^{T} ω_{t} (a^{⊤} r_{t} - r_{I, t}) .$ (4.1)

where $ω_{t}$ represents the probability of the t-th outcome, typically taken as equal probability, i.e., $ω_{t} = \frac{1}{T}$ . $γ$ is any positive integer greater than zero, and different values of $γ$ can be set. When $γ = 2$ , the tracking error is the Lower Partial Deviation. The $γ$ -th power of the tracking error is used to eliminate the influence of dimensionality, ensuring that the units of the tracking error and excess return are consistent.

We introduce the MAD of the tracking portfolio to control the downside risk. By embedding the non-parametric estimator of MAD from Equation (3.1) into model (4.1), and assuming that the maximum downside risk the investor can bear is $v$ , we obtain the enhanced index model based on the non-parametric MAD constraint. We require that the portfolio weights $a = {(a_{1}, a_{2}, \dots, a_{n})}^{T}$ invested in $n$ constituent stocks do not involve short positions, i.e., $a_{i} \geq 0, \forall i = 1, \dots, n$ , and that the investment weights in each stock are normalized, i.e., $\sum_{i = 1}^{n} a_{i} = 1$ .

$P (γ, λ) = {\begin{array}{l} min_{a \in ℝ^{n}} λ {(\sum_{t = 1}^{T} ω_{t} {(max (r_{I, t} - a^{⊤} r_{t}, 0))}^{γ})}^{1 / γ} - (1 - λ) \sum_{t = 1}^{T} ω_{t} (a^{⊤} r_{t} - r_{I, t}) . \\ s . t . \\ \begin{array}{l} \hat{{MAD}_{α}} (a^{⊤} r) = - \frac{1}{T} \sum_{t = 1}^{T} [(a^{⊤} r_{t} - α) Φ_{0} (ξ_{t}) + h Φ_{1} (ξ_{t})] \\ + \frac{1}{T} \sum_{t = 1}^{T} [(a^{⊤} r_{t} - α) {Φ^{'}}_{0} (ξ_{t}) + h {Φ^{'}}_{1} (ξ_{t})] \leq v . \end{array} \end{array}$

Theorem 4.1. For any positive integer $γ \geq 1$ , if the feasible set $Ω$ is non-empty, the enhanced index model $P (γ, λ)$ based on the non-parametric MAD is a convex optimization problem.

Proof. In the model $P (γ, λ)$ , besides the non-parametric Mean Absolute Deviation (MAD) constraint, all other constraints are linear, and the set of linear constraints is necessarily a convex set. According to Theorem 3.1, $\hat{{MAD}_{α}} (a^{⊤} r)$ is a convex function of the portfolio position $a$ . According to optimization theory, the lower level set of a convex function is a convex set. Therefore, the constraint set of the non-parametric MAD, ${MAD}_{α} (a^{⊤} r) \leq ν$ , is a convex set, and thus the feasible set $Ω$ of model $P (γ, λ)$ is a convex set. The objective function consists of two parts, with the second part being a linear function of the decision variable $a$ , and thus also a convex function of $a$ . Therefore, the following key result is to prove that $f (a) = {(\sum_{t = 1}^{T} ω_{t} {(max (r_{I, t} - a^{⊤} r_{t}, 0))}^{γ})}^{1 / γ}$ is a convex function of $a$ . To this end, we first present Lemma 4.1 and Lemma 4.2.

Lemma 4.1 (Minkowski Inequality). If $x_{t}, y_{t} > 0, t = 1, 2, \dots, T$ and $γ \geq 1$ , then the following holds

${(\sum_{t = 1}^{T} {(x_{t} + y_{t})}^{γ})}^{1 / γ} \leq {(\sum_{t = 1}^{T} x_{t}^{γ})}^{1 / γ} + {(\sum_{t = 1}^{T} y_{t}^{γ})}^{1 / γ} .$

Lemma 4.2 (Triangle Inequality). If $x, y \in ℝ$ , then the following holds

$| x + y | \leq | x | + | y | .$

Proof. For any two decision vectors $a_{1}$ and $a_{2}$ , and any real number $κ \in [0, 1]$ , based on Lemma 4.1 and Lemma 4.2, we have

$\begin{array}{l} f (κ a_{1} + (1 - κ) a_{2}) \\ = {(\sum_{t = 1}^{T} ω_{t} {(max (r_{I, t} - {(κ a_{1} + (1 - κ) a_{2})}^{⊤} r_{t}, 0))}^{γ})}^{1 / γ} \\ = {(\sum_{t = 1}^{T} ω_{t} {(\frac{r_{I, t} - {(κ a_{1} + (1 - κ) a_{2})}^{⊤} r_{t}}{2} + | \frac{r_{I, t} - {(κ a_{1} + (1 - κ) a_{2})}^{⊤} r_{t}}{2} |)}^{γ})}^{1 / γ} \\ = (\sum_{t = 1}^{T} ω_{t} (\frac{κ (r_{I, t} - a_{1}^{⊤} r_{t})}{2} + \frac{| κ (r_{I, t} - a_{1}^{⊤} r_{t}) |}{2} \\ + {{\frac{(1 - κ) (r_{I, t} - a_{2}^{⊤} r_{t})}{2} + \frac{| (1 - κ) (r_{I, t} - a_{2}^{⊤} r_{t}) |}{2})}^{γ})}^{1 / γ} \\ \leq {(\sum_{t = 1}^{T} ω_{t} {(κ \max (r_{I, t} - a_{1}^{⊤} r_{t}, 0) + (1 - κ) max (r_{I, t} - a_{2}^{⊤} r_{t}, 0))}^{γ})}^{1 / γ} \\ = {(\sum_{t = 1}^{T} {(\sqrt[γ]{ω_{t}} κ \max (r_{I, t} - a_{1}^{⊤} r_{t}, 0) + \sqrt[γ]{ω_{t}} (1 - κ) max (r_{I, t} - a_{2}^{⊤} r_{t}, 0))}^{γ})}^{1 / γ} \\ \leq {(\sum_{t = 1}^{T} {(\sqrt[γ]{ω_{t}} κ \max (r_{I, t} - a_{1}^{⊤} r_{t}, 0))}^{γ})}^{1 / γ} + {(\sum_{t = 1}^{T} {(\sqrt[γ]{ω_{t}} (1 - κ) max (r_{I, t} - a_{2}^{⊤} r_{t}, 0))}^{γ})}^{1 / γ} \\ = κ f (a_{1}) + (1 - κ) f (a_{2}) . \end{array}$

Therefore, $f (a)$ and $λ f (a)$ are convex functions of the portfolio position $a$ .

5. Empirical Analysis

To further evaluate the performance of the enhanced index model proposed in this paper in real financial markets, we conducted an empirical analysis using the CSI 300 Index and its constituent stocks. The data is sourced from the baostock economic and financial database, specifically collecting daily closing price data of the CSI 300 Index and its constituent stocks from January 1, 2015, to December 31, 2024. The time trend of the CSI 300 Index is shown in Figure 1. To test the model’s risk control ability during the downtrend of the index, this paper focuses on three downtrend periods of the index: February 20, 2017, to April 3, 2019 (Period 1), June 3, 2021, to May 5, 2023 (Period 2), and July 23, 2020, to February 4, 2021 (Period 3).

Figure 1. CSI 300 index trend and long-term downturn periods.

After performing the logarithmic difference, the CSI 300 index returns data (in %) are obtained. Table 1 presents the descriptive statistics of the index returns data for the three sample periods. From the mean and median values, the overall performance in all three periods is poor, indicating a downtrend. From the maximum, minimum, and standard deviation values, Period 1 and Period 3 show high volatility and risk, indicating significant market weakness. From the skewness, kurtosis, and JB statistic, the return distributions in all three periods deviate from normal distribution, indicating frequent extreme fluctuations in these periods. From the lower partial moment, Period 1 has the highest downside risk, while Periods 2 and 3 have relatively lower downside risks.

Table 1. Descriptive statistics table.

Indicator	Period Length	Mean	Median	Standard Deviation	Minimum	Maximum	Skewness	Kurtosis	JB Statistic	Lower Partial Moment
Period 1	519	−0.0744	−0.0775	1.4614	−17.9352	7.9413	−3.2446	4.4193	430.6320	1.2700
Period 2	466	−0.0594	0.0000	0.9606	−6.0865	2.7895	−0.8387	4.3937	42.8552	0.1350
Period 3	134	−0.0398	−0.1930	1.1212	−3.9515	4.7067	0.6954	2.6010	48.2127	−0.0940

Next, we use the subgradient descent (SGD) to assess the model’s ability to control risk during the long-term downtrend of the index. Specifically, the data is divided into a training set (Period 1), a validation set (Period 3), and a test set (Period 2). This division ensures the temporal order of the data and allows the training, validation, and testing processes to more reasonably reflect the model’s effectiveness and stability.

In addition, the core of the enhanced index model is to track the index and generate excess returns using a small number of constituent stocks. Therefore, before optimizing the model, it is necessary to determine the set of constituent stocks that will be included in the tracking portfolio. To reduce the complexity of model solving, this paper focuses on the fundamental issues of the model itself, using the Beta values of the constituent stocks for stock selection, and tracking the index and generating excess returns using the selected stocks. Specifically, the Beta values of the constituent stocks in the CSI 300 index were calculated, and the 30 constituent stocks with Beta values closest to 1 were selected to track the index.

Since the enhanced index model can better achieve the goal of generating excess returns while tracking the index, the enhanced index model is used in the empirical analysis with $λ = 0.5$ . Additionally, to evaluate performance under different MAD constraints, three different risk constraint values are selected: $v = 1.1$ , $v = 1.2$ , and $v = 1.3$ . Based on the earlier definition of symbols, the return of the tracking portfolio is $a_{r} r_{t}$ , and the return of the index is $r_{I_{t}}$ , where $t = 1, 2, \dots, T$ . The following indicators are defined to compare the performance of the tracking portfolio: Excess Return ( $E R$ ), Information Ratio $I R = \frac{E R}{T E}$ , Standard Deviation $\hat{σ} = \sqrt{a^{⊤} \hat{Σ} a}$ , Sharpe Ratio $S R = \frac{E R}{\hat{σ}}$ , and Downside Risk based on the risk-free rate.

Table 2. Sample period indicators.

Sample	Indicator	Excess Return	Information Ratio	Standard Deviation	Sharpe Ratio	Downside Risk
Period 1	v = 0.05	0.0067	0.0967	0.0117	0.1371	0.0122
	v = 0.10	0.0088	0.1001	0.0083	0.1446	0.0075
	v = 0.15	0.0068	0.0997	0.0120	0.1343	0.0173
	Index	0.0000	0.0000	0.0123	0.0000	0.0324
Period 3	v = 0.05	0.0054	0.0209	0.0327	0.0290	0.0133
	v = 0.10	0.0083	0.0490	0.0130	0.0461	0.0127
	v = 0.15	0.0054	0.0202	0.0428	0.0278	0.0135
	Index	0.0000	0.0000	0.0246	0.0000	0.0168
Period 2	v = 0.05	0.0061	0.0209	0.0115	0.0760	0.0079
	v = 0.10	0.0090	0.0709	0.0112	0.0920	0.0076
	v = 0.15	0.0070	0.0675	0.0113	0.0870	0.0077
	Index	0.0000	0.0000	0.0189	0.0000	0.0084

Table 2 presents a comparison of the returns and risks of the tracking portfolio under different parameters $v$ for each sample period. From Table can be observed that in all sample periods, the investment strategy based on the enhanced index model consistently generates positive excess returns, information ratio, and positive Sharpe ratio. This indicates that, from the perspective of returns and risk-adjusted returns, the investment strategy developed in this paper outperforms the traditional index strategy. Further analysis of the standard deviation and downside risk shows that the standard deviation and downside risk of the investment strategy are lower than those of the benchmark index (except in Period 3). Specifically, in Period 3, when $v = 0.10$ , both the standard deviation and downside risk of the investment strategy are smaller than those of the benchmark index, suggesting that the investment strategy developed in this paper performs better in terms of risk control compared to the benchmark index.

Additionally, comparing the results for different $v$ values reveals that when $v = 0.10$ , the excess return is maximized, the standard deviation is minimized, and consequently, the information ratio and Sharpe ratio also reach their maximum values. Under this condition, the downside risk is minimized. Meanwhile, $v = 0.05$ and $v = 0.15$ yield similar results, both outperforming the benchmark index and showing an ability to control risk. This indicates that by introducing an appropriate risk constraint level, risk can be effectively controlled while generating excess returns. Therefore, the enhanced index model developed in this paper proves to be effective in investment management within real financial markets.

Figure 2 visually demonstrates the cumulative returns of the investment strategy when $v = 0.10$ across different sample periods. As can be seen from the figure, the cumulative returns of the tracking portfolio constructed in this paper

Figure 2. Comparison of portfolio cumulative returns and benchmark index cumulative returns.

closely follow the cumulative returns of the index, while exceeding the index returns. This indicates that the investment strategy developed in this paper achieves the investor’s objective by both closely tracking the index’s trend and generating excess returns. Specifically, in Periods 1 and 3, the downside risk is well controlled, as the cumulative return of the portfolio does not decline along with the index’s cumulative return. This suggests that our model is better at controlling downside risk.

6. Conclusion

In this paper, we consider the risk of significant losses in the tracking portfolio when the market index experiences a downward jump, as the tracking portfolio tends to follow the index trend. Therefore, we introduce a downside risk constraint into the traditional enhanced index model and construct an enhanced index model under the constraint of MAD, aiming to effectively control the downside risk of the tracking portfolio and achieve excess returns. We prove that the model is a convex optimization problem and optimize it using SGD. Empirical research shows that the enhanced index model proposed in this paper, which considers the non-parametric MAD constraint, can effectively control downside risk.

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.

References

[1]	Borch, K. (1960) An Attempt to Determine the Optimum Amount of Stop Loss Reinsurance. Transactions of the 16th International Congress of Actuaries, 1, 597-610.
[2]	Scowcroft, A. and Sefton, J. (2005) Understanding Momentum. Financial Analysts Journal, 61, 64-82. https://doi.org/10.2469/faj.v61.n2.2717
[3]	Canakgoz, N.A. and Beasley, J.E. (2009) Mixed-integer Programming Approaches for Index Tracking and Enhanced Indexation. European Journal of Operational Research, 196, 384-399. https://doi.org/10.1016/j.ejor.2008.03.015
[4]	Bruni, R., Cesarone, F., Scozzari, A. and Tardella, F. (2012) A New Stochastic Dominance Approach to Enhanced Index Tracking Problems. Economics Bulletin, 32, 3460-3470.
[5]	Lejeune, M.A. and Samatlı-Paç, G. (2013) Construction of Risk-Averse Enhanced Index Funds. INFORMS Journal on Computing, 25, 701-719. https://doi.org/10.1287/ijoc.1120.0533
[6]	Roman, D., Mitra, G. and Zverovich, V. (2013) Enhanced Indexation Based on Second-Order Stochastic Dominance. European Journal of Operational Research, 228, 273-281. https://doi.org/10.1016/j.ejor.2013.01.035
[7]	Li, Q., Sun, L. and Bao, L. (2011) Enhanced Index Tracking Based on Multi-Objective Immune Algorithm. Expert Systems with Applications, 38, 6101-6106. https://doi.org/10.1016/j.eswa.2010.11.001
[8]	Bruni, R., Cesarone, F., Scozzari, A. and Tardella, F. (2014) A Linear Risk-Return Model for Enhanced Indexation in Portfolio Optimization. OR Spectrum, 37, 735-759. https://doi.org/10.1007/s00291-014-0383-6
[9]	Filippi, C., Guastaroba, G. and Speranza, M.G. (2016) A Heuristic Framework for the Bi-Objective Enhanced Index Tracking Problem. Omega, 65, 122-137. https://doi.org/10.1016/j.omega.2016.01.004
[10]	Dose, C. and Cincotti, S. (2005) Clustering of Financial Time Series with Application to Index and Enhanced Index Tracking Portfolio. Physica A: Statistical Mechanics and its Applications, 355, 145-151. https://doi.org/10.1016/j.physa.2005.02.078
[11]	Guastaroba, G., Mansini, R., Ogryczak, W. and Speranza, M.G. (2016) Linear Programming Models Based on Omega Ratio for the Enhanced Index Tracking Problem. European Journal of Operational Research, 251, 938-956. https://doi.org/10.1016/j.ejor.2015.11.037
[12]	Meade, N. and Beasley, J.E. (2011) Detection of Momentum Effects Using an Index Out-Performance Strategy. Quantitative Finance, 11, 313-326. https://doi.org/10.1080/14697680903460135
[13]	Guastaroba, G., Mansini, R., Ogryczak, W. and Speranza, M.G. (2020) Enhanced Index Tracking with Cvar-Based Ratio Measures. Annals of Operations Research, 292, 883-931. https://doi.org/10.1007/s10479-020-03518-7
[14]	Goel, A., Sharma, A. and Mehra, A. (2018) Index Tracking and Enhanced Indexing Using Mixed Conditional Value-at-Risk. Journal of Computational and Applied Mathematics, 335, 361-380. https://doi.org/10.1016/j.cam.2017.12.015
[15]	Wang, M., Xu, C., Xu, F. and Xue, H. (2011) A Mixed 0-1 LP for Index Tracking Problem with Cvar Risk Constraints. Annals of Operations Research, 196, 591-609. https://doi.org/10.1007/s10479-011-1042-9

Journals Menu

Follow SCIRP

	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies