A Comparison of CNN and PLSR for Glucose Monitoring Using Mid-Infrared Absorption Spectroscopy

Baorong Fu; Yongji Meng; Xianwen Zhang; Zhushanying Zhang

doi:10.4236/ojapps.2023.133031

Open Journal of Applied Sciences > Vol.13 No.3, March 2023

A Comparison of CNN and PLSR for Glucose Monitoring Using Mid-Infrared Absorption Spectroscopy

Baorong Fu¹, Yongji Meng², Xianwen Zhang^3*, Zhushanying Zhang²
¹Wuhan Great Sea Hi-Tech Co., Ltd., Wuhan, China.
²College of Biomedical Engineering, South-Central Minzu University, Wuhan, China.
³Linyi Grepo Garden Machinery Co., Ltd., Linyi, China.
DOI: 10.4236/ojapps.2023.133031 PDF HTML XML 73 Downloads 436 Views

Abstract

With the development of mid-infrared (MIR) photoelectric devices, mid-infrared spectroscopy has become one of the important methods for non-invasive detection of blood glucose. The mid-infrared region (4000 - 400 cm^-1) has the well-known fingerprint region (1200 - 800 cm^-1) of glucose, which has clearer characteristic absorption peaks and better specificity. There is a lot of molecular information about glucose in the MIR. The non-invasive detection of blood glucose by mid-infrared spectroscopy needs to achieve certain accuracy, and the quantitative model is an important factor affecting the accuracy of glucose detection. In this paper, the samples of imitation solution containing only glucose and the samples of imitation mixed solution are taken as the research objects, and the mid-infrared spectral data of the samples are collected. The full spectrum partial least squares Regression (PLSR) model, SNV + Ctr-PLSR model, MSC + Ctr-PLSR model, and convolutional neural networks (CNN) model of 3000 - 900 cm^-1 band were constructed. Full spectrum PLS model and CNN model of 1200 - 900 cm^-1 band were constructed. The experimental results show that the optimal model of the two bands is CNN, then the correlation coefficient of prediction set (Rp) of 3000 - 900 cm^-1 band is 0.95, and the root mean square error of pre-diction set (RMSEP) value is 22.10. The Rp of 1200 - 900 cm^-1 band is 0.95, and the RMSEP value is 22.54. The research results show that CNN is a promising method, which has higher accuracy than PLSR, and is especially suitable for modeling human complex environment. In addition, the study provides a theoretical and practical basis for CNN in feature selection and model interpretation.

Keywords

Mid-Infrared, Convolutional Neural Networks (CNN), Partial Least Square Regression (PLSR), Glucose

Share and Cite:

Fu, B. , Meng, Y. , Zhang, X. and Zhang, Z. (2023) A Comparison of CNN and PLSR for Glucose Monitoring Using Mid-Infrared Absorption Spectroscopy. Open Journal of Applied Sciences, 13, 383-395. doi: 10.4236/ojapps.2023.133031.

1. Introduction

Normal people’s blood glucose concentration ranges from 70 to 110 mg/dL. Under normal condition, human blood glucose concentration is relatively stable, only slightly increasing after eating, and then returning to normal. Sugar metabolism disorders can cause hypoglycemia or hyperglycemia, and severe cases can lead to diabetes. Diabetes can bring a variety of complications. At present, there is no complete cure for diabetes [1]. It can only be controlled by constantly monitoring the blood sugar levels of diabetics and adjusting their diet and intake of related drugs. Accordingly, the detection of blood glucose has important guiding significance for the prevention and diagnosis of diseases [2].

Traditional blood glucose measurement is done by dripping blood, which increases the risk of infection and the pain of patient. Recently, continuous glucose monitoring (CGM) devices appeared in the market, although it provides a continuous detection method. It uses an enzyme sugar sensor that needs to be replaced regularly, and it needs to detect glucose in the interstitial fluid [3] [4]. Both of these measurements are intrusive. Optical methods provided a great solution, because it has the advantages of high sensitivity, noninvasive, etc. At present, the Non-invasive measure of glucose is generally studied in the near-infrared (NIR) region, but the absorption of glucose in the near-infrared region is weak and the scattering is high, so the prediction accuracy is always low [5]. The mid-infrared (MIR) region (4000 - 400 cm⁻¹) has the well-known fingerprint region (1200 - 800 cm⁻¹) of glucose [6], which has clearer characteristic absorption peaks and better specificity than NIR, and its basic molecular vibration is the most useful. There is a lot of molecular information about glucose in the MIR.

In recent years, some progress has been made in the study of glucose based on the MIR spectroscopy. The measurement of blood glucose based on three wavenumbers using multiple linear regression (MLR) model was developed by Kasahara R et al. in 2018 [7]. In order to compare the prediction accuracy of glucose in two fiber-coupled measurement setups based on attenuated total reflection spectroscopy and direct transmission spectroscopy, partial least squares regression (PLSR) model was used for glucose prediction by Jernelv I L et al., and both of the root-mean-square error of cross-validation (RMSECV) of less than 20 mg/dL [8]. Traditionally, chemometrics has been used to process spectral data, such as MLR, principal component analysis (PCA), PLSR, and so on. However, In the MLR model, the variables participating in the regression cannot exceed the number of samples in the training set, and there is a problem of collinearity in the spectral matrix, which makes it impossible to solve the inverse matrix. Although both the PCA and the PLSR models solve the problem of collinearity, the absorption band of the interfering substance overlaps the absorption band of glucose in the mixed solution, which makes it more difficult to predict grapes and cannot solve the nonlinear problem. These difficulties can be sloved by Artificial Neural Networks (ANN). For example, Miguel Sanchez-Brito et al. used ANN to estimate the glucose value in saliva correctly [9]. However, the traditional ANN is a fully connected neural network, which makes the network learn too many weight parameters, resulting in slow training and over-fitting. Therefore, it is necessary to establish a new quantitative analysis model. Convolutional neural network (CNN) provides a better solution, and it has the characteristics of local conjunctions, shared weights, and translation invariance, etc [10]. It has been widely used in processing image recognition [11], speech detection, etc [12]. At the same time, it was reviewed that advances in one-dimensional spectral data analysis, for example, Qianqian Li et al. used the CNN model combined with MIR spectrum to analyze the corn syrup adulteration in acacia honey, and proposed that the convolution kernel could amplify signals in important spectral interval [13]. Xu Yan et al. used CNN combined with online Raman spectroscopy to monitoring of Cornu Caprae Hircus (GH) hydrolysis, and proposed that the convolutional kernel has a function similar to spectral pretreatment [14]. These examples provide a good direction. But how to make the combination of CNN and MIR predict glucose in mixed solution requires further study.

In views of the above, this paper reports the quantitative analysis of glucose in pure solution and mixed solution in the MIR. The accuracy of their predictions for glucose of CNN was compared to PLSR model. The selecting features automatically of CNN become an example of modeling under nonlinear factors.

2. Materials and Methods

2.1. Experimental Setup

The study was conducted at Wuhan (China) using FTIR spectrometer (Brucker IN-VENIO-R, Germany) with liquid nitrogen cooled-mercury cadmium telluride (LN-MCT) detector and attenuated total reflection (ATR) equipment. Data acquisition was done with an dual channel 24-bit dynamic range analog-to-digital converter card (ADC), Each sample is detected in the range of 4000 cm⁻¹ to 800 cm⁻¹ with a resolution of 4 cm⁻¹ by 16 scans, After each measurement, the ATR is cleaned with distilled water and dried with mirror paper, the output of spectrometer is obtained by calculating the absorbance (A = −lgT(v)), Where T(v) is the light transmittance at wavenumber (v) after absorption by the sample. Finally, OPUS software (Brucker, Germany) was used for exporting spectral data.

2.2. Sample Overview

20% Fat Emulsion Injection diluted (C14-24, FRESENIUS KABI SSPC, Jiang su, China) to 10% was used as the background solution, which was used to simulate the human environment and maintain a constant pH. 32 samples with only glucose in 10% Fat Emulsion Injection were prepared with a concentration range of 40 - 280 mg/dL and interval of 10 mg/dL, which was used to test the stability of system.

The mixed solution consists of different concentrations of glucose, albumin, lactate, urea, cholesterol and fructose, the concentration ranges of which are shown in Table 1. We aim at exploring whether the presence of interferers affects glucose prediction, the 64 mixed solution samples were prepared using DOE algorithm, in order to maintain the robustness of the experiment when using fewer experimental samples.

2.3. Data Analysis Methods

2.3.1. PLSR and CNN

PLSR is a common multiple linear regression model. Preprocessing is needed before PLSR modeling to reduce the impact of noise on the model, and the preprocessing used in this paper includes multiplicative signal correction (MSC), standard normal variate (SNV), scaling (SCA), normalization, centering (Ctr), first derivative, second derivative, moving window average smoothing (MWA) and Savitzky-Golay smoothing (SG).

The CNN model is a feed forward neural network. The structure of CNN used in the article is shown in Figure 1 and the algorithm can be divided into the following three steps: Initialize parameters, Forward propagation and Loss function and back propagation. Forward propagation means that information spreads in a single direction in CNN without any feedback. This paper uses 8 layers of network structure, including the input layer, convolution layer, nonlinear layer, BN layer, pooling layer, dropout layer, FC layer and output layer. The input layer and the output layer respectively refer to the input of 1D spectral data and the output of predicted concentration. In order to speed up the training speed of the network, the spectral data and the corresponding concentration are respectively standardized before inputting them. This paper sets the number of convolution kernels number, the size of convolution kernels, the step size and the filling method to 20, 15, 1, and same Padding, respectively. But putting the BN layer behind the ReLU layer has a better effect [15], so this paper uses the same method. This paper sets the pooling size, the step size and the Filling method to 2, 1, and Same Padding, respectively. Loss function is used to measure the accuracy of the network. The back propagation algorithm calculates the gradient of each weight by taking the chain derivative of the loss function, and then updates the weight. The number of training is set as 1200 in this paper. This function is realized by creating an Adam optimize. The learning rate is an important hyper parameter, and it is 0.001 in this paper.

2.3.2. Evaluation Method

In the paper, the leave one out cross validation (LOOCV), the correlation coefficient of training set (Rc), the correlation coefficient of prediction set (Rp), the root mean square error of training set (RMSEC), the root mean square error of prediction set (RMSEP), and the Root Mean Square Error of cross validation (RMSECV) were used to evaluate the results. The CNN model was built in python 3.7, Tensor Flow 2.1.0 and keras 2.3.1.

Table 1. The concentration ranges of the mixed solution.

Figure 1. The structure of CNN model.

3. Results and Discussion

3.1. Spectrum of Pure Solution

After the two samples with concentrations of 100 mg/dL and 320 mg/dL were lost duing to technical errors, the spectra of 30 glucose samples were shown in Figure 2. There is a lot of machine noise below 900 wave number and too much water noise above 3000 wave number. These noises are needed to be manually removed. Figure 3 shows four examples of noise removal, and other than that without any spectral pretreatment. The main chemical bond absorption is as follows: the stretching vibration of C-O is about in the range of 1200 - 900 cm⁻¹, the stretching vibration of C-H is about in the range of 3000 - 2800 cm⁻¹.

Figure 4 shows the prediction results of glucose in the pure solution using PLSR and LOOCV. The PLSR model uses 14 LVs with RMSECV and R values of 24.70 and 0.94, respectively. From the results, it can be concluded that there is a strong correlation between glucose concentration and absorbance, and it is easy to predict glucose in the pure solution.

3.2. Spectrum of the Mixed Solution

The absorbance spectra of the 64 samples in the mixed solution are shown in Figure 5, which have a large amount of machine noise and water noise just like

Figure 2. Absorbance spectra of different glucose concentrations.

Figure 3. Absorbance spectra of four pure glucose solutions after removing machine noise and water noise.

pure glucose. The same treatment is applied to these samples, and the remaining spectra after removing the noise are shown in Figure 6.

During the configuration of the mixed solution, duing to the addition of too many substances, personal operationl or machine errors can interfere with the actual concentration of glucose. In the analysis of infrared Spectroscopy, there is a certain correlation between the absorbance spectra and the chemical value of the measured object. The existence of abnormal samples weakens the correlation

Figure 4. Glucose in the pure solutions was predicted using PLSR model and LOOCV.

Figure 5. Absorbance spectra of mixed solutions with different concentrations.

between the two, thus affecting the performance of the algorithm. Therefore, Principal Component Analysis-Mahalanobis Distance (PCA-MD) was used to eliminate abnormal samples, in which PCA extracted 1 principal component number and played a role in dimensionality reduction of data. The result is shown in Figure 7, where the red circle represents abnormal samples 49, 53 and 54 were eliminated; the corresponding glucose concentrations were 260 mg/dL, 260 mg/dL and 280 mg/dL, respectively.

Figure 6. Absorbance spectrum of mixed solution after removing machine noise and water noise.

Figure 7. Abnormal samples were removed by PCA-MD.

Figure 8 shows the prediction results of glucose in mixed solution using PLSR and LOOCV. The PLSR model uses 14 LVs with RMSECV and R values of 27.82 and 0.89, respectively. Compared with the results in Figure 4, it is much more difficult to predict glucose in mixed solutions.

Then the Kennard-Stone (KS) algorithm was used to divide the remaining 61 samples, of which 40 were used for the training set and 21 were used for the prediction set.

Figure 8. Glucose in the mixed solution was predicted using PLSR model and LOOCV.

The prediction results of mixed solution using PLSR and the CNN models in the wavenumber range 3000 to 900 cm⁻¹ are illustrated in Table 2. Among them, the pretreatments used in this paper include single pretreatments or the combination of two different pretreatments. And only the better predicted results after preprocessing were listed in the table. The best result was the combination of MSC + Center, then the PLSR model extracted 11 LVs, and the RMSEP and R of the prediction set are 27.17 mg/dL and 0.92, respectively. For CNN, the model parameters of CNN are model1 in Table 4, the RMSEP and R of the prediction set are 22.10 mg/dL and 0.95, respectively. The result shows that in this wave-number region, PLS needs to be combined with many pretreatments to achieve a better modeling effect. Compared with PLS, the nonlinear nature of CNN makes it have better prediction ability.

Many studies have found that there is a fingerprint region of glucose in the mid-infrared [16] [17], and there are obvious characteristic absorption peaks in the wave-number range from 1200 to 900 cm⁻¹. In order to explore their correlation, this wave number range is used as a way of wavelength selection to build the model.

The results are shown in Table 3. For the PLSR model, the results of PLSR model without pretreatment were the best, then PLSR model extracted 6 LVs, and the RMSEP and R of the prediction set were 30.94 mg/dL and 0.90, respectively. For the CNN model, the model parameters of CNN are model 2 in Table 4, the RMSEP and R of prediction set are 22.54 mg/dL and 0.95, respectively. The result shows that, compared with the previous selected wavenumber range, the prediction result of the PLSR model without pretreatment is better, indicating that the region is highly correlated with glucose absorption, and excessive

Table 2. In the 3000 to 900 wavenumber range, PLSR and CNN were used to predict the glucose in the mixed solution.

Table 3. In the 1200 to 900 wavenumber range, PLSR and CNN were used to predict the glucose in the mixed solution.

Table 4. Optimized hyperparameters for each CNN model.

pretreatments may cause the loss of important spectral information. At the same time, it can be found that CNN only changes the regularizer and rate hyperparameters, which can get the prediction result similar to the wavenumber range from 3000 to 900 cm⁻¹. It indicates that increasing regularization and discarding rate can automatically reduce the interference of noise, and reducing regularization and discarding rate can reduce the loss of spectral information.

The Clarke error grid Analysis was used to evaluate the accuracy of glucose prediction, where regions A and B are clinically accepted [18] [19]. Figure 9 shows the results of different models of Clarke error grid. In the two selected wavenumber region, the PLSR and CNN models both reported that the prediction result of a sample with a concentration of 60 mg/dL was in region D, indicating that the sample deviated greatly from the fitting model and was regarded as an abnormal sample. Although the predictions of the PLSR and the CNN models are in regions A, B and D, the number of the CNN predictions in region A is significantly more than in the PLSR model.

4. Conclusion

This article uses PLSR and CNN to quantitatively analyze glucose in imitation solution. The Rp of the PLSR model is lower than that of the CNN model, and the RMSEP value of the PLSR model is higher than that of the CNN model, no matter which band and what pretreatment methods were used. In addition, the PLSR model of two bands without any pretreatment, the Rp of 1200 - 900 cm⁻¹

Figure 9. The Clarke error grid analysis plots for PLS model and CNN model was used in different wave-length ranges: In the 3000 to 900 wavenumber range: (a) CNN (Model 1); (b) PLSR (MSC + Ctr), In the 1200 to 900 wavenumber range: (c) CNN (Model 2); (d) PLSR (none).

band is 0.9 which is higher than 0.84 of 3000 - 900 cm⁻¹ band. The RMSEP value of 1200 - 900 cm⁻¹ band is 30.94 which is less than 38.47 of 3000 - 900 cm⁻¹ band. It indicates that 3000 - 900 cm⁻¹ band will introduce more noise, and more bands and spectral data will not help improve accuracy. However, the detection accuracy of CNN model in two bands is almost the same, indicating that CNN model has better adaptability. With the nonlinearity and generalization capabilities of CNN, it is suitable for processing spectral data under multiple interferences. In the future, our goal is to study how interferences affect glucose prediction and obtain more spectral data to validate our model.

Acknowledgements

This research was funded by the National Natural Science Foundation of China (Grant Nos. 61501526 and 61178087). This research was also funded by the Fundamental Research Funds for the Central Universities, South-Central Minzu University (Grant Number: CZQ22006). This research was also funded by the horizontal projects (Grant Nos. HZY21099 and HZY22132).

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

References

[1]	Harding, J.L., Pavkov, M.E., Magliano, D.J., Shaw, J.E. and Gregg, E.W. (2018) Global Trends in Diabetes Complications: A Review of Current Evidence. Diabetologia, 62, 3-16. https://doi.org/10.1007/s00125-018-4711-2
[2]	Vashist, S.K., Zheng, D., Al-Rubeaan, K., Luong, J.H. and Sheu, F.-S. (2011) Technology behind Commercial Devices for Blood Glucose Monitoring in Diabetes Management: A Review. Analytica Chimica Acta, 703, 124-136. https://doi.org/10.1016/j.aca.2011.07.024
[3]	Enter, B.J. and Hauff, E. (2018) Challenges and Perspectives in Continuous Glucose Monitoring, Chemical Communications, 54, 5032-5045. https://doi.org/10.1039/C8CC01678J
[4]	Wagner, J., Tennen, H. and Wolpert, H. (2012) Continuous Glucose Monitoring: A Review for Behavioral Researchers. Psychosomatic Medicine, 74, 356-365. https://doi.org/10.1097/PSY.0b013e31825769ac
[5]	Yadav, J., Rani, A., Singh, V. and Murari, B.M. (2015) Prospects and Limitations of Non-Invasive Blood Glucose Monitoring Using Near-Infrared Spectroscopy. Biomedical Signal Processing and Control, 18, 214-227. https://doi.org/10.1016/j.bspc.2015.01.005
[6]	Pleitez, M.A., Lieblein, T., Bauer, A., Hertzberg, O., von Lilienfeld-Toal, H. and Mäntele, W. (2013) In Vivo Noninvasive Monitoring of Glucose Concentration in Human Epidermis by Mid-Infrared Pulsed Photoacoustic Spectroscopy. Analytical Chemistry, 85, 1013-1020. https://doi.org/10.1021/ac302841f
[7]	Kasahara, R., Kino, S., Soyama, S. and Matsuura, Y. (2018) Noninvasive Glucose Monitoring Using Mid-Infrared Absorption Spectroscopy Based on a Few Wavenumbers. Biomedical Optics Express, 9, 289-302. https://doi.org/10.1364/BOE.9.000289
[8]	Jernelv, I.L., Strom, K., Hjelme, D.R. and Aksnes, A. (2019) Infrared Spectroscopy with a Fiber-Coupled Quantum Cascade Laser for Attenuated Total Reflection Measurements Towards Biomedical Applications. Sensors, 19, Article 5130. https://doi.org/10.3390/s19235130
[9]	Sánchez-Brito, M., Luna-Rosas, F.J., Mendoza-González, R., Mata-Miranda, M.M., Martínez-Romo, J.C. and Vázquez-Zapién, G.J. (2021) A Machine-Learning Strategy to Evaluate the Use of FTIR Spectra of Saliva for a Good Control of Type 2 Diabetes. Talanta, 221, Article ID: 121650. https://doi.org/10.1016/j.talanta.2020.121650
[10]	Lecun, Y., Bengio, Y. and Hinton, G. (2015) Deep Learning. Nature, 521, 436-444. https://doi.org/10.1038/nature14539
[11]	Wu, H., Bie, R.F., Guo J.Q., Meng, X. and Wang, S.L. (2018) Optimized CNN Based Image Recognition through Target Region Selection. Optik, 156, 772-777. https://doi.org/10.1016/j.ijleo.2017.11.153
[12]	Yousefi, M. and Hansen, J.H.L. (2021) Block-Based High Performance CNN Architectures for Frame-Level Overlapping Speech Detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, 28-40. https://doi.org/10.1109/TASLP.2020.3036237
[13]	Li, Q.Q., Zeng, J.Q., Lin L., Zhang, J., Zhu, J.Y., Yao, L., Wang, S.Y., Du, J. and Wu, Z.S. (2021) Mid-Infrared Spectra Feature Extraction and Visualization by Convolutional Neural Network for Sugar Adulteration Identification of Honey and Real-World Application. LWT, 140, Article ID: 110856. https://doi.org/10.1016/j.lwt.2021.110856
[14]	Yan, X., Zhang, S., Fu, H. and Qu, H. (2020) Combining Convolutional Neural Networks and on-Line Raman Spectroscopy for Monitoring the Cornu Caprae Hircus hydrolysis Process. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 226, Article ID: 117589. https://doi.org/10.1016/j.saa.2019.117589
[15]	Mishkin, D., Sergievskiy, N. and Matas, J. (2017) Systematic Evaluation of Convolution Neural Network Advances on the Imagenet. Computer Vision and Image Understanding, 161, 11-19. https://doi.org/10.1016/j.cviu.2017.05.007
[16]	Werth, A., Liakat, S., Dong, A., Woods, C.M. and Gmachl, C.F. (2018) Implementation of an Integrating Sphere for the Enhancement of Noninvasive Glucose Detection Using Quantum Cascade Laser Spectroscopy. Applied Physics B, 124, Article No. 75. https://doi.org/10.1007/s00340-018-6946-5
[17]	Brandstetter, M., Sumalowitsch, T., Genner, A., Posch, A.E., Herwig, C., Drolz, A., Fuhrmann, V., Perkmann, T. and Lendl, B. (2013) Reagent-Free Monitoring of Multiple Clinically Relevant Parameters in Human Blood Plasma Using a Mid-Infrared Quantum Cascade Laser Based Sensor System. Analyst, 138, 4022-4028. https://doi.org/10.1039/c3an00300k
[18]	Gani, A., Gribok, A.V., Lu, Y.H., Ward, W.K., Vigersky, R.A. and Reifman, J. (2010) Universal Glucose Models for Predicting Subcutaneous Glucose Concentration in Humans. IEEE Transactions on Information Technology in Biomedicine, 14, 157-165. https://doi.org/10.1109/TITB.2009.2034141
[19]	Clarke, W.L. (2005) The Original Clarke Error Grid Analysis (EGA). Diabetes Technology & Therapeutics, 7, 776-779. https://doi.org/10.1089/dia.2005.7.776

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies