A Perturbation Analysis of Low-Rank Matrix Recovery by Schatten p-Minimization

Zhaoying Sun; Huimin Wang; Zhihui Zhu

doi:10.4236/jamp.2024.122032

Journal of Applied Mathematics and Physics > Vol.12 No.2, February 2024

A Perturbation Analysis of Low-Rank Matrix Recovery by Schatten p-Minimization

Zhaoying Sun, Huimin Wang^*, Zhihui Zhu
Department of Mathematics, Shaoxing University, Shaoxing, China.
DOI: 10.4236/jamp.2024.122032 PDF HTML XML 50 Downloads 126 Views

Abstract

A number of previous papers have studied the problem of recovering low-rank matrices with noise, further combining the noisy and perturbed cases, we propose a nonconvex Schatten p-norm minimization method to deal with the recovery of fully perturbed low-rank matrices. By utilizing the p-null space property (p-NSP) and the p-restricted isometry property (p-RIP) of the matrix, sufficient conditions to ensure that the stable and accurate reconstruction for low-rank matrix in the case of full perturbation are derived, and two upper bound recovery error estimation ns are given. These estimations are characterized by two vital aspects, one involving the best r-approximation error and the other concerning the overall noise. Specifically, this paper obtains two new error upper bounds based on the fact that p-RIP and p-NSP are able to recover accurately and stably low-rank matrix, and to some extent improve the conditions corresponding to RIP.

Keywords

Nonconvex Schatten p-Norm, Low-Rank Matrix Recovery, p-Null Space Property, the Restricted Isometry Property

Share and Cite:

Sun, Z. , Wang, H. and Zhu, Z. (2024) A Perturbation Analysis of Low-Rank Matrix Recovery by Schatten p-Minimization. Journal of Applied Mathematics and Physics, 12, 475-487. doi: 10.4236/jamp.2024.122032.

1. Introduction

Low-rank matrix recovery (LMR) is a fast-growing field nowadays, attracting much attention in numerous applications such as quantum state tomography scanners [1] , deep learning [2] , nonlinear system identification [3] , computer visualization [4] , and medical imaging [5] . The mathematical expression of the low-rank matrix recovery issue is described as

$b = A (X)$ (1)

where $X \in ℝ^{m_{1} \times m_{2}}$ is an unknown low-rank matrix or approximate low-rank matrix, which need be recovered and $b \in ℝ^{N}$ is a known observation vector, $A : ℝ^{m_{1} \times m_{2}} \to ℝ^{N}$ is a given measurement operator or linear mapping which is defined by the following formula

$A (X) = {[t r (X^{T} B^{(1)}), t r (X^{T} B^{(2)}), \dots, t r (X^{T} B^{(N)})]}^{T}$ (2)

where $B^{(1)}, B^{(2)}, \dots, B^{(N)}$ are named the measurement matrices, $X^{T}$ is the transpose matrix of X and $t r (\cdot)$ is the trace function. The main goal of LMR is to recover the low-rank matrix X on the basis of observation vector b and operator $A$ .

Actually, the linear measurement b is affected by the noisy vector y. The noisy LMR model is shown by

$\hat{b} = A (X) + y$ , (3)

$\hat{b}$ is an observed measurement disturbed by the noise vector y, y is an additive noise that does not depend on X.

Moreover, more LMR models may involve casein which the observed vector b is perturbed noise vector , meanwhile, the linear mapping $A$ is hampered by Φ, i.e., $A$ is replaced by $\hat{A} = A + Φ$ to bring about multiplicative noise Φ(X) associated with X. In a large number of applications of complete separation problems such as remote sensing [6] , source separation [7] , and telecommunication [8] , complete perturbation problems usually arise. In order to require the optimal solution from this fully perturbed problems, a common approach is to solve a class of nuclear norm minimization issues (NNM) as described below:

$\min_{X \in ℝ^{m_{1} \times m_{2}}} {‖ X ‖}_{*} s .t . {‖ \hat{A} (X) - \hat{b} ‖}_{2} \leq {\hat{ϵ}}_{A, r, b}$ , (4)

where ${\hat{ϵ}}_{A, r, y} \geq 0$ represents the overall noise, ${‖ X ‖}_{*}$ is the trace norm of the matrix X, namely the sum of its singular values. While $m_{1} = m_{2}$ and $X = d i a g (x) (x \in ℝ^{m_{1}})$ is diagonal matrix, issue (4) relegates into a compressed sensing issue

$\min_{x \in ℝ^{m_{1}}} {‖ x ‖}_{1} s .t . {‖ \hat{A} x - \hat{b} ‖}_{2} \leq {\hat{ϵ}}_{A, r, b}$ , (5)

here ${‖ x ‖}_{1}$ is $l_{1}$ the norm of the vector $x$ , in a word, the sum of the absolute values of the elements of $x$ .

Chartrand’s study [9] revealed that nonconvex variants of (5) can produce accurate reconstruction with fewer measurements. Specifically, the $l_{1}$ norm minimization is substituted with the $l_{p}$ norm minimization:

$\min_{x \in ℝ^{n_{1}}} {‖ x ‖}_{p}^{p} s .t . {‖ \hat{A} x - \hat{b} ‖}_{2} \leq {\hat{ϵ}}_{A, r, b}$ , (6)

in which ${‖ x ‖}_{p} = {(\sum_{i} {| x_{i} |}^{p})}^{\frac{1}{p}}$ is $l_{p}$ -quasi-norm of vector $x$ . Even though the

$l_{p}$ -quasi-norm is not a norm, but it satisfies the triangle inequality.

Numerous studies have focused on the recovery of vector $x$ by $l_{p}$ minimization ( $0 < p \leq 1$ ) [10] [11] [12] . Chartrand [9] conducts numerical experiments using random and non-random Fourier measurements, which acquied fewer measurements are required for accurate restoration than when p = 1. Chartrand [13] extended the result of Restricted Isometry Property (RIP) in Candès and Tao [14] to the case of $l_{p}$ minimization. Kong and Xiu [15] explored nonconvex relaxation methods for recovering the vector x. In summary, the case of (6) where x is free of the noise and perturbation for ( ${\hat{ϵ}}_{A, r, b} = 0$ ) extends to the matrix, referred to as $M_{p}$ -minimization:

$\min_{X \in ℝ^{n_{1} \times n_{2}}} {‖ X ‖}_{p}^{p} s .t . A (X) = b$ . (7)

The related work [13] considers the scenario of matrix recovery with noise but without perturbation, i.e., where the linear mapping $A$ is not interfered with by Φ. From an applied and practical perspective, it is crucial to investigate the problem of rank-r matrix recovery in the fully perturbed case.

Therefore, we present the fully perturbed model of LMR through nonconvex Schatten p-norm minimization ( $0 < p \leq 1$ ):

$\min_{X \in ℝ^{n_{1} \times n_{2}}} {‖ X ‖}_{p}^{p} s .t . {‖ \hat{A} (X) - \hat{b} ‖}_{2} \leq {\hat{ϵ}}_{A, r, b},$ (8)

a p-norm of matrix X, and ${‖ X ‖}_{p} = {(\sum_{}^{i} σ_{i}^{p} (X))}^{\frac{1}{p}} (0 < p \leq 1)$ , its singular value

decomposition (SVD) is $X = U d i a g (σ (X)) V^{T}$ , $σ_{i} (X)$ being the ith singular value of X. This model characterizes the problem of rank-r matrix recovery in a fully perturbed scenario, with $ϵ_{A}^{(r)}$ , $η_{A}^{(r)}$ , $κ_{A}$ , $α_{r}$ , $β_{r}$ , $ϵ_{A}$ , and $ϵ_{y}$ as parameters in the model.

Now, unlike the previous concept of restricted isometry constant, this paper follows the notion of restricted isometry property (p-RIP) given by Zhang in the article [13] , which is viewed below:

Definition 1.1 (p-RIP of the measurement mapping (or operator) $A$ ) For the measurement operator $A : ℝ^{m \times n} \to ℝ^{M}$ , a positive integer r and $0 < p \leq 1$ , the restricted p-isometry constant (p-RIC) of order r denoted by $δ_{r}$ and for any matrix $X \in ℝ^{m \times n}$ and $R (X) \leq r$ , which has

$(1 - δ_{r}) {‖ X ‖}_{F}^{p} \leq {‖ A (X) ‖}_{p}^{p} \leq (1 + δ_{r}) {‖ X ‖}_{F}^{p} .$ (9)

If $δ \in (0, 1)$ , then $A$ meets the restricted p-isometry property of order r.

The restricted isometry property (RIP) of matrix is an essential tool for LMR theory analysis. For the accurate LMR (i.e., $y = 0$ , $Φ = 0$ in (1)) or a noisy/partly perturbed LMR (i.e., $Φ = 0$ in (7)), there are some sufficient conditions based on RIP, which include $δ_{k} + θ δ_{2 r + k} < θ - 1$ , when $θ > 1$ and

$k = ⌈ 2 r θ^{\frac{2}{2 - p}} ⌉ (0 < p \leq 1)$ [16] , $\sqrt{2} δ \max {r + ⌈ \frac{3 k}{2} ⌉, 2 k} {\frac{k}{2 r}}^{\frac{1}{p} - \frac{1}{2}} δ_{2 r + k} < {\frac{k}{2 r}}^{\frac{1}{p} - \frac{1}{2}}$ [17] .

Moreover, another crucial tool for analyzing low-rank matrix recovery is the null space property (NSP) of the linear mapping $A$ . Gao and Peng et al. [18] extended the general null space property to obtain the sparse vector p-null space property (p-NSP). Furthermore, we further extend sparse vector x recovery to the low-rank matrix, the notion is defined like this:

Definition 1.2 (p-NSP of measurement operator $A$ ) For the measured operator $A : ℝ^{m \times n} \to ℝ^{M}$ , with a constant $0 < s < 1$ and $v > 0$ , for any matrix $X \in ℝ^{m \times n}$ and $r a n k (X) \leq r$ , there exist

${‖ X_{r} ‖}_{p}^{p} \leq s {‖ X_{r}^{c} ‖}_{p}^{p} + v {‖ A (X) ‖}_{p}^{p},$ (10)

so the operator $A$ fulfills the p-null space property of order r.

2. Symbols and Main Results

Before presenting the key results of this paper, some notations similar to those of the article [19] which quantize disturbances Φ and y with different upper bounds are given like such:

$\frac{{‖ Φ ‖}_{p}}{{‖ A ‖}_{p}} \leq ϵ_{A}, \frac{{‖ Φ ‖}_{p}^{(r)}}{{‖ A ‖}_{p}^{(r)}} \leq ϵ_{A}^{r}, \frac{{‖ Φ ‖}_{2}}{{‖ A ‖}_{2}} \leq ϵ_{b},$ (11)

Here ${‖ A ‖}_{p} = \sup {\frac{{‖ A (X) ‖}_{p}}{{‖ X ‖}_{F}}, X \in ℝ^{m_{1} \times m_{2}} with X \neq 0}$ is the Schatten p-norm of the linear mapping $A$ and ${‖ A ‖}_{p}^{(r)} = \sup {\frac{{‖ A (X) ‖}_{p}}{{‖ X ‖}_{F}}, X \in ℝ^{m_{1} \times m_{2}}, X \neq 0, R (X) = r}$ , ${‖ A ‖}^{(r)}$ is the norm of its initial image set of $A$ consisting of nonzero matrix of order r, furthermore

$α_{r} = \frac{{‖ X_{r}^{c} ‖}_{F}}{{‖ X_{r} ‖}_{F}}, β_{r} = \frac{{‖ X_{r}^{c} ‖}_{p}}{r^{\frac{1}{p} - \frac{1}{2}} {‖ X_{r} ‖}_{F}}, η_{A^{(r)}} = \sqrt[p]{\frac{1 + δ_{r}}{1 - δ_{r}}}, κ_{A} = \frac{{‖ A ‖}_{p}}{\sqrt[p]{1 - δ_{r}}},$ (12)

$X_{r}^{c} = X - X_{r}$ , $X_{r}$ stands for the best r-rank approximation of X whose singular values consist of the r-largest singular values of X.

Secondly, two theorems are obtained based on the matrix of restricted p-isometry property (p-RIP) (see Definition 1.1) and the p-null space property (p-NSP) (see Definition 1.2) defined in section 1. Two theorems derive sufficient conditions guaranteeing stable and accurate recovery of rank matrices, and offer recovery error upper bounds. The content of two theorems is descripted as below:

Theorem 2.1: Let there be a linear operator $A : ℝ^{m \times n} \to ℝ^{M}$ , $\hat{y} \in ℝ^{M}$ and $0 < p \leq 1$ . Let $t > 1$ and $k = 2 t r$ , if the restricted p-isometry constant of the linear operator $A$ fulfills

$(1 + δ_{2 r (t + 1)}) {(1 + ϵ_{A}^{2 r (t + 1)})}^{p} + t^{\frac{p}{2} - 1} δ_{2 t r} {(1 + ϵ_{A}^{2 t r})}^{p} < 2 - t^{\frac{p}{2} - 1} {(1 + ϵ_{A}^{2 t r})}^{p},$ (13)

a general matrix X with r-rank meets

$α_{r} + β_{r} < \frac{1}{η_{A}^{(r)}},$ (14)

and the overall noise is

${\hat{ϵ}}_{A, r, b} = (\frac{ϵ_{A}^{(r)} η_{A}^{(r)} + ϵ_{A} κ_{A} α_{r}}{1 - η_{A}^{(r)} (α_{r} + β_{r})} + ϵ_{y}) {‖ b ‖}_{2} .$ (15)

$X \in ℝ^{m \times n}$ , $r a n k (X) \leq r$ , $\hat{A} = A + Φ$ and ${‖ \hat{A} (X) - \hat{b} ‖}_{2} \leq {\hat{ϵ}}_{A, r, b}$ . $X^{*}$ is the feasible solution of the fully perturbed Schatten p-norm minimization problem

$\min_{X \in ℝ^{m_{1} \times m_{2}}} {‖ X ‖}_{p}^{p} s .t . {‖ \hat{A} (X) - \hat{b} ‖}_{2} \leq {\hat{ϵ}}_{A, r, b},$ (16)

The error estimation of X and $X^{*}$ fulfills

${‖ X - X^{*} ‖}_{p}^{p} \leq C {({\hat{ϵ}}_{A, r, b})}^{p} + D {‖ X_{r}^{c} ‖}_{p}^{p},$ (17)

where

$C = \frac{2^{p + 1} {(2 r M)}^{1 - \frac{p}{2}}}{2 - (1 + δ_{2 r (t + 1)}) {(1 + ϵ_{A}^{2 r (t + 1)})}^{p} - (1 + δ_{2 t r}) {(1 + ϵ_{A}^{2 t r})}^{p} t^{\frac{p}{2} - 1}},$

$D = 2 + \frac{4 (1 + δ_{2 t r}) t^{\frac{p}{2} - 1} {(1 + ϵ_{A}^{2 t r})}^{p}}{2 - (1 + δ_{2 r (t + 1)}) {(1 + ϵ_{A}^{2 r (t + 1)})}^{p} - (1 + δ_{2 t r}) {(1 + ϵ_{A}^{2 t r})}^{p} t^{\frac{p}{2} - 1}} .$

Theorem 2.2: For a given $ϵ_{A}$ , supposing that the linear measured operator $A$ fulfills the p-null space property (p-NSP) with constants $0 < s < 1 - 2 τ ϵ_{A}^{p} {‖ A ‖}_{p}^{p}$

and $0 < v < \frac{1}{ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}$ and conditions of (13) and (14) hold. Suppose $X \in ℝ^{m \times n}$ , $r a n k (X) \leq r$ , $\hat{A} = A + Φ$ , ${‖ \hat{A} (X) - \hat{b} ‖}_{2} \leq {\hat{ϵ}}_{A, r, b}$ holds. $X^{*}$ is the feasible solution to the completely perturbed Schatten p-norm minimization problem

$\min_{X \in ℝ^{m_{1} \times m_{2}}} {‖ X ‖}_{p}^{p} s .t . {‖ \hat{A} (X) - \hat{b} ‖}_{2} \leq {\hat{ϵ}}_{A, r, b},$ (18)

then error estimation of X and $X^{*}$ satisfies

${‖ X - X^{*} ‖}_{p}^{p} \leq C^{'} {({\hat{ϵ}}_{A, r, b})}^{p} + D^{'} {‖ X_{r}^{c} ‖}_{p}^{p},$ (19)

where

$C^{'} = \frac{v M^{1 - \frac{p}{2}} 2^{p + 1}}{1 - s - 2 v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}, D^{'} = \frac{2 (1 + s)}{1 - s - 2 v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}} .$

3. Proof of Key Results

Proofs of our key results are presented in this section. To prove theorem 2.1 and theorem 2.2 such that we need the support of the following five lemmas and their proofs. We start this section with a lemma with respect to Schatten p norm.

Lemma 3.1: If $0 < p \leq 1$ . Presume that $D, E \in R^{m \times n}$ are matrices with $D^{T} E = 0$ and $D E^{T} = 0$ . Then

1) ${‖ D + E ‖}_{p}^{p} = {‖ D ‖}_{p}^{p} + {‖ E ‖}_{p}^{p}$ ; 2) ${‖ D + E ‖}_{p} \geq {‖ D ‖}_{p} + {‖ E ‖}_{p}$ .

When $p = 1$ , ${‖ D ‖}_{p}^{p}$ and ${‖ D ‖}_{p}$ are the trace(or nuclear ) norm of matrix D.

Lemma 3.2: For any $p \in (0, 1]$ , there is

${‖ Z_{r}^{c} ‖}_{p}^{p} \leq {‖ Z_{r} ‖}_{p}^{p} + 2 {‖ X_{r}^{c} ‖}_{p}^{p} .$ (20)

Proof of the lemma 3.2 suppose $Z = X - X^{*}$ and $X^{*}$ is the optimal solution to the problem (8), which yields

${‖ X ‖}_{p}^{p} \geq {‖ X^{*} ‖}_{p}^{p} = {‖ X - Z ‖}_{p}^{p} .$ (21)

Applying the inverse triangle inequality to the above Equation (21), we get

${‖ X - Z ‖}_{p}^{p} = {‖ X_{r} - Z_{r}^{c} + X_{r}^{c} - Z_{r} ‖}_{p}^{p} \geq {‖ X_{r} - Z_{r}^{c} ‖}_{p}^{p} - {‖ X_{r}^{c} - Z_{r} ‖}_{p}^{p} .$ (22)

Again, by Lemma 3.1 and inequality (22), we obtain

$\begin{matrix} {‖ X_{r} - Z_{r}^{c} ‖}_{p}^{p} - {‖ X_{r}^{c} - Z_{r} ‖}_{p}^{p} = {‖ X_{r} ‖}_{p}^{p} + {‖ Z_{r}^{c} ‖}_{p}^{p} - {‖ X_{r}^{c} - Z_{r} ‖}_{p}^{p} \\ \geq {‖ X_{r} ‖}_{p}^{p} + {‖ Z_{r}^{c} ‖}_{p}^{p} - {‖ X_{r}^{c} ‖}_{p}^{p} - {‖ Z_{r} ‖}_{p}^{p} . \end{matrix}$ (23)

Combining (21), (22), (23) and integrating their shifted terms, it is easy to show that

$\begin{matrix} {‖ Z_{r}^{c} ‖}_{p}^{p} \leq {‖ X ‖}_{p}^{p} - {‖ X_{r} ‖}_{p}^{p} + {‖ X_{r}^{c} ‖}_{p}^{p} + {‖ Z_{r} ‖}_{p}^{p} \\ \leq {‖ X - X_{r} ‖}_{p}^{p} + {‖ X_{r}^{c} ‖}_{p}^{p} + {‖ Z_{r} ‖}_{p}^{p} \\ \leq {‖ Z_{r} ‖}_{p}^{p} + 2 {‖ X_{r}^{c} ‖}_{p}^{p} \end{matrix}$ (24)

which finishes the proof of the above lemma.

Lemma 3.3: For any vector $y \in ℝ^{b}$ , there is

${‖ y ‖}_{p}^{p} \leq b^{1 - \frac{p}{2}} {‖ y ‖}_{2}^{p} .$ (25)

Proof of the lemma 3.3 By definition of the $l_{p}$ norm, there is ${‖ y ‖}_{p} = {(\sum_{i = 1}^{b} {| y_{i} |}^{p})}^{\frac{1}{p}}$ , and exploiting the Holder’s inequality, we suffice to obtain

${‖ y ‖}_{p}^{p} = \sum_{i = 1}^{b} {| y_{i} |}^{p} \cdot 1^{p} \leq {(\sum_{i = 1}^{b} {({| y_{i} |}^{p})}^{\frac{2}{p}})}^{\frac{p}{2}} \cdot {(\sum_{i = 1}^{b} 1)}^{1 - \frac{p}{2}} = b^{1 - \frac{p}{2}} {‖ y ‖}_{2}^{p} .$

Therefore, the lemma 3.3 is proved.

Next, the restricted isometry constant RIC $δ_{r}$ and the relative disturbance upper bound $ϵ_{A}^{(r)}$ of the measured operator $A$ which does not have perturbations are already present, and Lemma 3.4 gives p-RIP condition of the perturbed measurement operator $\hat{A}$ and ${\hat{δ}}_{r}$ .

Lemma 3.4: (The perturbed measurement operator $\hat{A}$ of the P-RIP) Assume that the r-order RIC of the $A$ is denoted as $δ_{r}$ , and the upper bound on the relative perturbation corresponding to the operator Φ is $ϵ_{A}^{(r)}$ and fix constant ${\hat{δ}}_{r, \max} = (1 + δ_{r}) {(1 + ϵ_{A}^{(r)})}^{p} - 1$ , then $\hat{A} = A + Φ$ of the RIC ${\hat{δ}}_{r} \leq {\hat{δ}}_{r, \max}$ , ${\hat{δ}}_{r}$ as the smallest and positive number that obeys

$(1 - δ_{r}) X_{F}^{p} \leq {‖ \hat{A} (X) ‖}_{p}^{p} \leq (1 + δ_{r}) {‖ X ‖}_{F}^{p}$ (26)

for any matrices X which are r-rank.

Proof of the lemma 3.4 inspired by [20] , first we define $t_{r}$ and $μ_{r}$ are the smallest positive or zero numbers that satisfy

$(1 - t_{r}) {‖ X ‖}_{F}^{p} \leq {‖ \hat{A} (X) ‖}_{p}^{p} \leq (1 + μ_{r}) {‖ X ‖}_{F}^{p}$ (27)

for any matrix X with rank at most r.

Using the triangle inequality, (9) and (11), we acquire

$\begin{matrix} {‖ \hat{A} (X) ‖}_{p}^{p} = {({‖ A (X) ‖}_{p} + {‖ Φ (X) ‖}_{p})}^{p} \\ \leq {(\sqrt[p]{1 + δ_{r}} + {‖ Φ ‖}_{p}^{(r)})}^{p} {‖ X ‖}_{F}^{p} \\ \leq {(\sqrt[p]{1 + δ_{r}} + ϵ_{A}^{(r)} {‖ A ‖}_{p}^{(r)})}^{p} {‖ X ‖}_{F}^{p} \\ \leq {(\sqrt[p]{1 + δ_{r}} + ϵ_{A}^{(r)} \sqrt[p]{1 + δ_{r}})}^{p} {‖ X ‖}_{F}^{p} \\ \leq (1 + δ_{r}) {(1 + ϵ_{A}^{(r)})}^{p} {‖ X ‖}_{F}^{p} \end{matrix}$ (28)

Because of the concept of $μ_{r}$ , it means that

$1 + μ_{r} \leq (1 + δ_{r}) {(1 + ϵ_{A}^{(r)})}^{p}$ , (29)

by applying the above inequality (29), we obtain a minimum upper bound

$μ_{r} = (1 + δ_{r}) {(1 + ϵ_{A}^{(r)})}^{p} - 1.$ (30)

Similarly, taking advantage of the inverse triangle inequality, combined with the concept of RIC and (11) yields

$t_{r} = 1 - (1 - δ_{r}) {(1 - ϵ_{A}^{(r)})}^{p}$ . (31)

Observe carefully that $1 - μ_{r} \leq 1 - t_{r}$ and $1 + t_{r} \leq 1 + μ_{r}$ . Based on the given $δ_{r}$ and $ϵ_{A}^{(r)}$ , we choose $μ_{r} = {\hat{δ}}_{r, \max}$ the smallest nonnegative constant that makes (27) symmetric. Clearly, the true RIC of $\hat{A} {\hat{δ}}_{r}$ satisfies ${\hat{δ}}_{r} \leq {\hat{δ}}_{r, \max}$ . The proof of Lemma 3.4 is completed.

Finally, the following lemma 3.5 clarifies that the perturbed measurement operator $\hat{A}$ can also comply with p-NSP, provided that the constants s and τ satisfy specific conditions and the measurement operator $\hat{A}$ satisfies p-NSP.

Lemma 3.5: For the given $ϵ_{A}$ , the measured operator $\hat{A}$ satisfies the p-null space property with a constant $0 < s < 1 - 2 v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}$ and a constant

$0 < v < \frac{1}{ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}$ , which satisfies the condition of (10) for any $X \in ℝ^{m \times n}$ , and fix two constants $\hat{v} = \frac{v}{1 - v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}$ and $\hat{s} = s + \frac{v ϵ_{A}^{p} (ρ + 1)}{1 - v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}$ . Then two constants

$0 < \hat{s} < 1$ and $\hat{v} > 0$ , and the perturbed measurement operator $\hat{A}$ satisfies p-NSP.

Proof of the lemma 3.5 Utilizing (11) and the inequality, there are

$\begin{matrix} {‖ \hat{A} (X) ‖}_{p}^{p} \leq {‖ A (X) ‖}_{p}^{p} + {‖ Φ (X) ‖}_{p}^{p} \\ \leq {‖ A (X) ‖}_{p}^{p} + {‖ Φ ‖}_{p}^{p} {‖ X ‖}_{p}^{p} \\ \leq {‖ A (X) ‖}_{p}^{p} + ϵ_{A}^{p} {‖ A ‖}_{p}^{p} {‖ X ‖}_{p}^{p} . \end{matrix}$ (32)

Since $A$ satisfies the p-null space property and ${‖ X ‖}_{p}^{p} \leq {‖ X_{r} ‖}_{p}^{p} + {‖ X_{r}^{c} ‖}_{p}^{p}$ holds, we achieve

$\begin{matrix} {‖ X_{r} ‖}_{p}^{p} \leq s {‖ X_{r}^{c} ‖}_{p}^{p} + v {‖ A (X) ‖}_{p}^{p} \\ \leq s {‖ X_{r}^{c} ‖}_{p}^{p} + v {‖ \hat{A} (X) ‖}_{p}^{p} + v ϵ_{A}^{p} {‖ A ‖}_{p}^{p} {‖ X ‖}_{p}^{p} \\ \leq s {‖ X_{r}^{c} ‖}_{p}^{p} + v {‖ \hat{A} (X) ‖}_{p}^{p} + v ϵ_{A}^{p} {‖ A ‖}_{p}^{p} ({‖ X_{r} ‖}_{p}^{p} + {‖ X_{r}^{c} ‖}_{p}^{p}) . \end{matrix}$ (33)

Then we collapse the inequality (33) to conclude that

${‖ X_{r} ‖}_{p}^{p} \leq \frac{s + v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}{1 - v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}} {‖ X_{r}^{c} ‖}_{p}^{p} + \frac{v}{1 - v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}} {‖ \hat{A} (X) ‖}_{p}^{p} .$

Let $\hat{v} = \frac{v}{1 - v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}$ and $\hat{s} = \frac{s + v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}{1 - v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}$ , basing the fact that $v > 0$ to make $\hat{v} > 0$ , we need to solve the following inequality $\frac{v}{1 - v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}} > 0$ , which imitates $0 < v < \frac{1}{ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}$ .

In view of $ϵ_{A}$ in (11), and it is also known that $s > 0$ to make $\hat{s} > 0$ , we must solve

$0 < \frac{s + v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}{1 - v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}} < 1,$

after sorting out the above inequality, we obtain

$v ϵ_{A}^{p} {‖ A ‖}_{p}^{p} < s < 1 - 2 v ϵ_{A}^{p} {‖ A ‖}_{p}^{p} .$

Combining the above, while two constants $0 < v < \frac{1}{ϵ_{A}^{p} {‖ A ‖}_{p}^{p}}$ and

$0 < s < 1 - 2 v ϵ_{A}^{p} {‖ A ‖}_{p}^{p}$ , we are able to come true $\hat{v} > 0$ and $0 < \hat{s} < 1$ , hence the measurement operator $\hat{A}$ obeys p-RNSP. In summary, we prove the Lemma 3.5.

After the previous preparations, we now prove two theorems. The upper bound estimation of the error of a real matrix X to be recovered with the optimal solution $X^{*}$ of problem (8) is derived from the restricted isometry property, i.e., theorem 2.1.

Proof of the theorem 2.1 Let $Z = X - X^{*}$ , X be the real matrix that we expect to recover, $X^{*}$ be the optimal solution to the problem (8). We apply the block decomposition of the SVD of the matrix Z given by

$U^{T} Z V = (\begin{matrix} Z_{11} & Z_{12} \\ Z_{21} & Z_{22} \end{matrix}),$

where $Z_{i j} \in ℝ^{m_{i} \times n_{j}}$ with $m_{1} = n_{1} = r$ and $m_{2} = n_{2} = m - r$ , $Z = Z_{r} + Z_{r}^{c}$ ,

$Z_{r} = U (\begin{matrix} Z_{11} & Z_{12} \\ Z_{21} & 0 \end{matrix}) V^{T}, Z_{r}^{c} = U (\begin{matrix} 0 & 0 \\ 0 & Z_{22} \end{matrix}) V^{T},$

it is clear that $r a n k (Z_{r}) \leq r a n k [(Z_{11} Z_{12})] + r a n k [(Z_{21} 0)] \leq 2 r$ , and $X_{r} {(Z_{r}^{c})}^{T} = 0$ and ${(X_{r})}^{T} Z_{r}^{c} = 0$ .

Let SVD of $Z_{22} \in ℝ^{(m - r) \times (m - r)}$ be described by

$Z_{22} = H d i a g (σ (Z_{22})) G^{T},$

where matrices $H, G \in ℝ^{(m - r) \times (m - r)}$ are orthogonal, $σ (Z_{22}) = {(σ_{1} (Z_{22}), σ_{2} (Z_{22}), \dots, σ_{m - r} (Z_{22}))}^{T}$ denotes a vector consisting of the singular values of $Z_{22}$ and $σ_{1} (Z_{22}) \geq σ_{2} (Z_{22}) \geq \dots \geq σ_{m - r} (Z_{22}) \geq 0$ . We divide $σ (Z_{22})$ into the sum of the vectors $σ_{T_{i}} (Z_{22}) (i = 1, 2, \dots)$ , $T_{i} = 2 r (i - 1) + 1, \dots, 2 r i$ , $T_{1} \cup T_{2} \cup \dots \cup T_{J = 1, 2, \dots, m - r}$ , each $T_{i}$ has sparsity 2r (except possibly $T_{J}$ ). Then, $Z_{T_{1}}$ is the portion of $Z_{r}^{c}$ that corresponds to the $k (k = 2 t r)$ largest singular values, $Z_{T_{2}}$ is the part that corresponds to the next k largest singular values, then so forth. Obviously, for all $i \neq j$ , $Z_{T_{i}}^{T} Z_{T_{j}} = 0$ and $Z_{T_{i}} Z_{T_{j}}^{T} = 0$ , and $r a n k (Z_{T_{i}}) \leq k$ .

The following results can be derived from [17]

${‖ Z_{T_{j}} ‖}_{F}^{p} \leq {(2 t r)}^{\frac{p}{2} - 1} {‖ Z_{T_{j - 1}} ‖}_{p}^{p},$ (34)

${‖ Z_{r} ‖}_{p}^{p} \leq {(2 r)}^{\frac{p}{2} - 1} {‖ Z_{r} ‖}_{F}^{p} \leq {(2 r)}^{\frac{p}{2} - 1} {‖ Z_{r} + Z_{T_{1}} ‖}_{F}^{p} .$ (35)

According to (34), we get

$\sum_{j \geq 2} {‖ Z_{T_{j}} ‖}_{F}^{p} \leq k^{\frac{p}{2} - 1} \sum_{j \geq 2} {‖ Z_{T_{j - 1}} ‖}_{p}^{p} = k^{\frac{p}{2} - 1} {‖ Z_{r}^{c} ‖}_{p}^{p} .$ (36)

Applying Lemma 3.2 to (36), we can have

$\begin{matrix} \sum_{j \geq 2} {‖ Z_{T_{j}} ‖}_{F}^{p} \leq k^{\frac{p}{2} - 1} ({‖ Z_{r} ‖}_{p}^{p} + 2 {‖ X_{r}^{c} ‖}_{p}^{p}) \\ \leq k^{\frac{p}{2} - 1} {(2 r)}^{\frac{p}{2} - 1} {‖ Z_{r} + Z_{T_{1}} ‖}_{F}^{p} + 2 k^{\frac{p}{2} - 1} {‖ X_{r}^{c} ‖}_{p}^{p} \\ = t^{\frac{p}{2} - 1} {‖ Z_{r} + Z_{T_{1}} ‖}_{F}^{p} + 2 {(2 t r)}^{\frac{p}{2} - 1} {‖ X_{r}^{c} ‖}_{p}^{p} . \end{matrix}$ (37)

Since ${‖ \hat{A} (X) - \hat{b} ‖}_{2} \leq {\hat{ϵ}}_{A, r, b}$ and the triangular inequality, it suffices to

${‖ \hat{A} (Z) ‖}_{2} \leq {‖ \hat{A} (X) - \hat{b} ‖}_{2} + {‖ \hat{A} (X^{*}) - \hat{b} ‖}_{2} \leq 2 {\hat{ϵ}}_{A, r, b} .$

Therefore, by combining the above equation with lemma 3.3, we have

${‖ \hat{A} (Z) ‖}_{p}^{p} \leq M^{1 - \frac{p}{2}} {‖ \hat{A} (Z) ‖}_{2}^{p} \leq M^{1 - \frac{p}{2}} {(2 {\hat{ϵ}}_{A, r, b})}^{p} .$ (38)

Moreover, by Lemma 3.4 and (37), we get

$\begin{matrix} {‖ \hat{A} (Z) ‖}_{p}^{p} = {‖ \hat{A} (Z_{r} + Z_{T_{1}}) + \hat{A} (Z_{r}^{c} - Z_{T_{1}}) ‖}_{p}^{p} \\ \geq {‖ \hat{A} (Z_{r} + Z_{T_{1}}) ‖}_{p}^{p} - \sum_{j \geq 2} {‖ \hat{A} (Z_{T_{j}}) ‖}_{p}^{p} \\ \geq (1 - δ_{2 r + 2 t r}) {‖ Z_{r} + Z_{T_{1}} ‖}_{F}^{p} - (1 + δ_{2 t r}) \sum_{j \geq 2} {‖ Z_{T_{j}} ‖}_{F}^{p} \\ \geq [(1 - δ_{2 r (t + 1)}) - (1 + δ_{2 t r}) t^{\frac{p}{2} - 1}] {‖ Z_{r} + Z_{T_{1}} ‖}_{F}^{p} - 2 (1 + δ_{2 t r}) {(2 t r)}^{\frac{p}{2} - 1} {‖ X_{r}^{c} ‖}_{p}^{p} \\ \geq [2 - (1 + δ_{2 r (t + 1)}) {(1 + ϵ_{A}^{2 r (t + 1)})}^{p} - (1 + δ_{2 t r}) {(1 + ϵ_{A}^{2 t r})}^{p} t^{\frac{p}{2} - 1}] {‖ Z_{r} + Z_{T_{1}} ‖}_{F}^{p} \\ - 2 (1 + δ_{2 t r}) {(1 + ϵ_{A}^{2 t r})}^{p} {(2 t r)}^{\frac{p}{2} - 1} {‖ X_{r}^{c} ‖}_{p}^{p}, \end{matrix}$ (39)

in that case

${‖ Z_{r} + Z_{T_{1}} ‖}_{F}^{p} \leq \frac{M^{1 - \frac{p}{2}} {(2 {\hat{ϵ}}_{A, r, b})}^{p} + 2 {(2 t r)}^{\frac{p}{2} - 1} (1 + δ_{2 t r}) {(1 + ϵ_{A}^{2 t r})}^{p} {‖ X_{r}^{c} ‖}_{p}^{p}}{2 - (1 + δ_{2 r (t + 1)}) {(1 + ϵ_{A}^{2 r (t + 1)})}^{p} - (1 + δ_{2 t r}) {(1 + ϵ_{A}^{2 t r})}^{p} t^{\frac{p}{2} - 1}}$ (40)

From (40) and Lemma 3.2, we have that

$\begin{matrix} {‖ Z ‖}_{p}^{p} \leq {‖ Z_{r} + Z_{r}^{c} ‖}_{p}^{p} \leq {‖ Z_{r} ‖}_{p}^{p} + {‖ Z_{r}^{c} ‖}_{p}^{p} \leq 2 {‖ Z_{r} ‖}_{p}^{p} + 2 {‖ Z_{r}^{c} ‖}_{p}^{p} \\ \leq 2 {(2 r)}^{1 - \frac{p}{2}} {‖ Z_{r} + Z_{T_{1}} ‖}_{F}^{p} + 2 {‖ X_{r}^{c} ‖}_{p}^{p} \\ \leq \frac{2^{p + 1} {(2 r M)}^{1 - \frac{p}{2}}}{2 - (1 + δ_{2 r (t + 1)}) {(1 + ϵ_{A}^{2 r (t + 1)})}^{p} - (1 + δ_{2 t r}) {(1 + ϵ_{A}^{2 t r})}^{p} t^{\frac{p}{2} - 1}} {({\hat{ϵ}}_{A, r, b})}^{p} \\ + (2 + \frac{4 (1 + δ_{2 t r}) t^{\frac{p}{2} - 1} {(1 + ϵ_{A}^{2 t r})}^{p}}{2 - (1 + δ_{2 r (t + 1)}) {(1 + ϵ_{A}^{2 r (t + 1)})}^{p} - (1 + δ_{2 t r}) {(1 + ϵ_{A}^{2 t r})}^{p} t^{\frac{p}{2} - 1}}) {‖ X_{r}^{c} ‖}_{p}^{p} \end{matrix}$ (41)

which finishes the proof of theorem 2.1.

When constants $0 < p \leq 1$ and $t > 1$ , setting $k = 2 t r$ , a new RIP condition that can robustly and accurately recover low-rank matrices is obtained i.e., (2 - 3) in Theorem 2.1, and it is slightly weaker than the sufficient condition of Zhang M [16] $δ_{k} + θ δ_{2 r + k} < θ - 1$ .

Theorem2.2 is based on this element of the NSP of matrix, and provides an error-bound estimation of the actual r-rank matrix X to be found and the optimal solution $X^{*}$ of problem (8). Its proof steps as below:

Proof According to Lemma 3.4, while $\hat{A}$ meets the p-null space property (pNSP), there exist

${‖ X_{r} ‖}_{p}^{p} \leq \hat{s} {‖ X_{r}^{c} ‖}_{p}^{p} + \hat{v} {‖ \hat{A} (X) ‖}_{p}^{p} .$ (42)

Let $Z = X - X^{*}$ , it is obvious that we get that by the lemma 3.2 and (42), we

${‖ {(X - X^{*})}_{r}^{c} ‖}_{p}^{p} \leq {‖ {(X - X^{*})}_{r} ‖}_{p}^{p} + 2 {‖ X_{r}^{c} ‖}_{p}^{p} \leq \hat{s} {‖ X_{r}^{c} ‖}_{p}^{p} + \hat{v} {‖ \hat{A} (X) ‖}_{p}^{p} + 2 {‖ X_{r}^{c} ‖}_{p}^{p},$ (43)

holds through the lemma 3.2 and (42). After simplifying, we further obtain

${‖ {(X - X^{*})}_{r}^{c} ‖}_{p}^{p} \leq \frac{1}{1 - \hat{s}} (\hat{v} {‖ \hat{A} (X) ‖}_{p}^{p} + 2 {‖ X_{r}^{c} ‖}_{p}^{p}) .$ (44)

Finally, we conclude from (38), (43) and (44) that

$\begin{matrix} {‖ X - X^{*} ‖}_{p}^{p} \leq {‖ {(X - X^{*})}_{r} ‖}_{p}^{p} + {‖ {(X - X^{*})}_{r}^{c} ‖}_{p}^{p} \\ \leq \hat{s} {‖ X_{r}^{c} ‖}_{p}^{p} + \hat{v} {‖ \hat{A} (X) ‖}_{p}^{p} + {‖ {(X - X^{*})}_{r}^{c} ‖}_{p}^{p} \\ \leq (1 + \hat{s}) {‖ X_{r}^{c} ‖}_{p}^{p} + \hat{v} {‖ \hat{A} (X) ‖}_{p}^{p} \\ \leq (\frac{1 + \hat{s}}{1 - \hat{s}} + 1) \hat{v} {‖ \hat{A} (X) ‖}_{p}^{p} + \frac{2 (1 + \hat{s})}{1 - \hat{s}} {‖ X_{r}^{c} ‖}_{p}^{p} \\ \leq \frac{2 \hat{v}}{1 - \hat{s}} M^{1 - \frac{p}{2}} {(2 {\hat{ϵ}}_{A, r, b})}^{p} + \frac{2 (1 + \hat{s})}{1 - \hat{s}} {‖ X_{r}^{c} ‖}_{p}^{p} \\ \leq \frac{v M^{1 - \frac{p}{2}} 2^{p + 1}}{1 - s - 2 v ϵ_{A}^{p}} {({\hat{ϵ}}_{A, r, b})}^{p} + \frac{2 (1 + s)}{1 - s - 2 v ϵ_{A}^{p}} {‖ X_{r}^{c} ‖}_{p}^{p} . \end{matrix}$

holds for $0 < p \leq 1$ . Meanwhile, we finish the proof of the theorem 2.2.

In brief, the above completes all proofs of the two theorems.

4. Conclusion

We primarily investigate mainly study fully perturbed problem of reconstructing a low-rank matrix through nonconvex Schatten p-norm minimization and give sufficient conditions for recovering the error along with corresponding upper bound estimations. These results show that nonconvex Schatten p-minimization provides a stable and accurate guarantee for reconstructing low-rank matrix in the existence of overall noise. The obtained results involve two implications, firstly, it suffices to guide the selection of measurement operators for low-rank matrix reconstruction, i.e., operators that satisfy weaker RIC sufficient conditions can also better promote the recovery capacity; secondly, it provides theoretical support for boundary error by taking advantage of the following two properties, respectively p-RIP and p-NSP.

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

References

[1]	Recht, B., Fazel, M. and Parrilo, P.A. (2010) Guaranteed Minimum Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization. SIAM Review, 52, 471-501. https://doi.org/10.1137/070697835
[2]	Chang, X., Zhong, Y., Wang, Y. and Lin, S. (2019) Unified Low-Rank Matrix Estimate via Penalized Matrix Least Squares Approximation. IEEE Transactions on Neural Networks and Learning Systems, 30, 474-485. https://doi.org/10.1109/TNNLS.2018.2844242
[3]	Batselier, K. (2022) Low-Rank Tensor Decompositions for Nonlinear System Identification: A Tutorial with Examples. IEEE Control Systems Magazine, 42, 54-74. https://doi.org/10.1109/MCS.2021.3122268
[4]	Dass, J., Wu, S., Shi, H., et al. (2023) Vitality: Unifying Low-Rank and Sparse Approximation for Vision Transformer Acceleration with a Linear Taylor Attention. 2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA). Montreal, 25 February-01 March 2023, 415-428. https://doi.org/10.1109/HPCA56546.2023.10071081
[5]	Wang, Z., Qian, C., Guo, D., et al. (2022) One-Dimensional Deep Low-Rank and Sparse Network for Accelerated MRI. IEEE Transactions on Medical Imaging, 42, 79-90. https://doi.org/10.1109/TMI.2022.3203312
[6]	Wang, Y., Lin, L., Zhao, Q., et al. (2017) Compressive Sensing of Hyperspectral Images via Joint Tensor Tucker Decomposition and Weighted Total Variation Regularization. IEEE Geoscience and Remote Sensing Letters, 14, 2457-2461. https://doi.org/10.1109/LGRS.2017.2771212
[7]	Gundupalli, S.P., Hait, S. and Thakur, A. (2017) A Review on Automated Sorting of Source-Separated Municipal Solid Waste for Recycling. Waste Management, 60, 56-74. https://doi.org/10.1016/j.wasman.2016.09.015
[8]	Ye, G., Pan, C., Dong, Y., et al. (2021) A Novel Multi-Image Visually Meaningful Encryption Algorithm Based on Compressive Sensing and Schur Decomposition. Transactions on Emerging Telecommunications Technologies, 32, e4071. https://doi.org/10.1002/ett.4071
[9]	Chartrand, R. (2007) Exact Reconstruction of Sparse Signals via Nonconvex Minimization. IEEE Signal Processing Letters, 14, 707-710. https://doi.org/10.1109/LSP.2007.898300
[10]	Lai, M.J., Xu, Y. and Yin, W. (2013) Improved Iteratively Reweighted Least Squares for Unconstrained Smoothed Minimization. SIAM Journal on Numerical Analysis, 51, 927-957. https://doi.org/10.1137/110840364
[11]	Wang, Y., Wang, J. and Xu, Z. (2014) Restricted P-Isometry Properties of Nonconvex Block-Sparse Compressed Sensing. Signal Processing, 104, 188-196. https://doi.org/10.1016/j.sigpro.2014.03.040
[12]	Wang, J., Zhang, J., Wang, W., et al. (2015) A Perturbation Analysis of Nonconvex Block-Sparse Compressed Sensing. Communications in Nonlinear Science and Numerical Simulation, 29, 416-426. https://doi.org/10.1016/j.cnsns.2015.05.022
[13]	Chartrand, R. (2007) Nonconvex Compressed Sensing and Error Correction. 2007 IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP’07. Honolulu, 15-20 April 2007, III-889-III-892. https://doi.org/10.1109/ICASSP.2007.366823
[14]	Candes, E.J. and Tao, T. (2005) Decoding by Linear Programming. IEEE Transactions on Information Theory, 51, 4203-4215. https://doi.org/10.1109/TIT.2005.858979
[15]	Kong, L.C. and Xiu, N.H. (2011) Exact Low-Rank Matrix Recovery via Nonconvex Mp-Minimization. Optimization.
[16]	Zhang, M., Huang, Z.H. and Zhang, Y. (2013) Restricted P-Isometry Properties of Nonconvex Matrix Recovery. IEEE Transactions on Information Theory, 59, 4316-4323. https://doi.org/10.1109/TIT.2013.2250577
[17]	Kong, L. and Xiu, N. (2013) Exact Low-Rank Matrix Recovery via Nonconvex Schatten P-Minimization. Asia-Pacific Journal of Operational Research, 30, Article 1340010. https://doi.org/10.1142/S0217595913400101
[18]	Gao, Y., Peng, J., Yue, S., et al. (2015) On the Null Space Property of l_p-Minimization for 0<q≤1 in Compressed Sensing. Journal of Function Spaces, 2015, Article ID 579853. https://doi.org/10.1155/2015/579853
[19]	Herman, M.A. and Strohmer, T. (2010) General Deviants: An Analysis of Perturbations in Compressed Sensing. IEEE Journal of Selected Topics in Signal Processing, 4, 342-349. https://doi.org/10.1109/JSTSP.2009.2039170
[20]	Huang, J., Wang, J., Zhang, F., et al. (2021) Perturbation Analysis of Low-Rank Matrix Stable Recovery. International Journal of Wavelets, Multiresolution and Information Processing, 19, Article 2050091. https://doi.org/10.1142/S0219691320500915

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies