Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Band Target Entropy Minimization and Target Partial Least Squares for Spectral Recovery and Calibration

Band Target Entropy Minimization and Target Partial Least Squares for Spectral Recovery and... The resolution and calibration of pure spectra of minority components in measurements of chemical mixtures without prior knowledge of the mixture is a challenging problem. In this work, a combination of band target en- tropy minimization (BTEM) and target partial least squares (T-PLS) was used to obtain estimates for single pure component spectra and to calibrate those estimates in a true, one-at-a-time fashion. This approach allows for mi- nor components to be targeted and their relative amounts estimated in the presence of other varying components in spectral data. The use of T-PLS estimation is an improvement to the BTEM method because it overcomes the need to identify all of the pure components prior to estimation. Es- timated amounts from this combination were found to be similar to those obtained from a standard method, multivariate curve resolution-alternating least squares (MCR-ALS), on a simple, three component mixture dataset. Studies from two experimental datasets demonstrate where the combination of BTEM and T-PLS could model the pure component spectra and obtain concentration pro les of minor components but MCR-ALS could not. Keywords: band target entropy minimization, recovery, target partial least squares 1. Introduction Resolving poorly represented low intensity pure component spectra and concentration pro les from chemical mixtures without a priori knowledge sdb@udel.edu Preprint submitted to ArXiv March 28, 2018 arXiv:1802.03839v2 [stat.ML] 27 Mar 2018 is an open eld of research. Provided that varying amounts of the tar- get compound are present in the data matrix, many methods, such as self modeling curve resolution [1],evolving factor analysis [2, 3], window factor analysis [4], heuristic evolving latent projections [5], iterative target trans- formation factor analysis [6], simple-to-use interactive self-modeling mixture analysis (SIMPLISMA) [7], and the standard multivariate curve resolution- alternating least squares (MCR-ALS) [8] have been used to resolve the spec- tral signatures of major contributors from mixture data. However, when the component of interest is present at relatively low levels, the resolution of that component by the these methods is problematic because its spectral response can be lost in the contributions of noise [9] or its variation can be small enough relative to those of the other contributions that a given method will fail to resolve it from the larger components [6]. Band target entropy minimization (BTEM) takes a di erent approach to the problem of spectral signature recovery. Band target entropy minimiza- tion estimates a single, pure component spectrum from a linear combination of weighted loadings obtained by principal component analysis [10]. This ap- proach allows orthogonal contributions of variance to be combined to form a spectral estimate. One potential advantage with the use of BTEM is that the technique does not require that all components in a mixture have adequate variation in the samples, as long as the single target for resolution is repre- sented in the PCA loadings [10, 11]. This advantage cannot be attributed to many other methods because most methods attempt to simultaneously discover and resolve the spectral signatures of every component in a mixture [1, 6, 7, 8]. Although BTEM has been used for the recovery of individual components in a mixture, in order to obtain estimates of relative amounts, the resulting pure component spectral estimates have been used as a regression coe- cients in classical least squares (CLS) models [10, 11]. This approach poses a problem because classical least squares regression requires that all pure component responses embedded in a mixture be included in the regression model. If this condition is not met, the concentration estimate matrix that results from CLS has insucient rank and cannot be used to discriminate the additive e ects of all mixture components from a given target compound. Therefore, previous work using BTEM spectral recovery required either the discovery of all pure component responses for the mixture in order to ob- tain their approximate amounts, or that a regression is performed on spectra which have one component or on regions where only the available pure com- 2 ponent spectra contribute. In many cases, an analyst may only be interested in the relative amount of one component for quantitation due to the presence of an undersampled or poorly represented background. Similarly, to charac- terize an unknown, minor component of a mixture, the entire spectral range may need to be considered. Thus, the potential advantage of being able to recover single component spectra from BTEM has not been utilized in a way that allows for determining the relative amounts of single components. It is well known that calibration methods such as partial least squares or principal component regression [12] can overcome issues associated with rank de ciencies that arise when the responses for all components of a mixture are not fully known. However, in curve resolution experiments, prior knowledge of property values is not available. Thus, the usual regression methods cannot be used because only the target spectrum and the spectral data are available. To avoid the problems associated with conventional regression methods, target partial least squares (T-PLS) [13] can be used for creating regression models built from single, pure component spectral estimates obtained by BTEM from near-infrared and infrared spectra. Target partial least squares has been shown to take advantage of the partial least squares regression framework in a way that uses pure component spectra as a property vector to estimate relative amounts of minor components and avoid contributions from background [13]. The quantitative ecacy of this combination can be compared with that of MCR-ALS, a standard technique for modeling pure component spectra and concentration pro les from infrared spectra [14], hyperspectral images [15], and a variety of other applications [16]. MCR-ALS was selected as a base of comparison for this study because it has been presented for the quantitation of trace [17, 18] components in mixtures. MCR-ALS and other curve resolution methods have not been demon- strated to work well in resolving components with relatively small signatures or components in mixtures that have poorly represented background contri- butions [6, 18]. Components that have low concentrations can have relatively large signals due to their instrumental responses. Previous MCR-ALS studies related to analytes at low concentrations did not address the case where the low concentration components also had low signals for quantitation [17, 18]. MCR-ALS estimates cannot be obtained because collecting data with suf- cient independent variation of each component in a given mixture is not possible due to a lack of experimental control [19]. Components that su er from these conditions are said to have poor representation because they can- 3 not be adequately sampled for resolution. A primary goal of this study is to investigate the ecacy of BTEM and T-PLS for resolving spectra and obtain- ing semiquantitative estimates under conditions where MCR-ALS estimates cannot be readily obtained. This study rst shows that T-PLS can be used to construct models that are more accurate than individual CLS estimates obtained from BTEM re- covered signals, and that these models can be competitive with those from MCR-ALS on a dataset where the response of the components were each relatively large and where there were minimal background e ects. Then, the utility of BTEM and T-PLS for obtaining pure component estimates is inves- tigated for situations where the spectral signatures of the pure components are low in magnitude and where sucient representations of every analyte are dicult to attain. The ecacy of the hybrid method used under these more challenging conditions is demonstrated with two experimental datasets in which BTEM and T-PLS succeeded at modeling the pure components known to be present in those mixtures, but MCR-ALS failed. 2. Theory 2.1. Multivariate Curve Resolution Alternating Least Squares The goal of multivariate curve resolution (MCR) is to obtain estimates for the pure component spectra (S) and the respective relative concentration pro les (C ) for the components of an unknown mixture. Because this study focuses on spectroscopic responses obtained from chemical mixtures, the re- lationship between the matrix of measured absorbances (A) and the pure components can be considered as an application of Beer-Lambert-Bouguer law where A = CS . Given some absorbance matrix A, MCR can be used to model candidate solutions for C and S. Due to inherent ambiguities in posing this inverse problem, however, there is often not an exact solution for C or S [20]. Instead, practitioners seek chemically plausible solutions. Multivariate curve resolution alternating least squares has been shown to provide, in many instances, satisfactory empirical estimates for both the spectra of pure components, and their respective concentration pro les [16, 8]. Many variations of MCR-ALS have been presented in the literature. One commonality shared across all of them is that all estimated pure components present in a mixture are modeled simultaneously. Alternating least squares solves the inverse Beer-Lambert-Bouguer prob- lem through iteritive least squares projections of both undetermined matrices 4 onto one another, T 1 C = A(S ) (1) S = C A (2) The matrix inverses of both S and C are often singular due to rank de - ciencies in the spectral data. As a way to avoid rank-related diculties, the Moore-Penrose pseudoinverse is commonly employed [8]. Because MCR-ALS decomposes the spectral responses in the A matrix based on all of the plausible information in C and S, an important factor in building an e ective MCR-ALS model is the selection of an appropriate number of pure components. Without an adequate estimate of the number of components in the mixture, the pure component spectra and concentration pro les often become unreliable [21]. Domain knowledge can be used to provide reasonable estimates for the number of pure components. When domain knowledge is not available, the number of components for a MCR- ALS model can be selected heuristically by a Scree plot of the eigenvalues that result from the singular value decomposition of the mixture matrix A [22]. Even if an appropriate number of pure component estimates has been selected, ambiguous solutions may still be obtained from the nonconvex op- timization of C and S[23]. Many schemes have been employed to constrain the alternating least squares solutions to chemically plausible regions. The most commonly used constraints are those based on physical laws such as nonnegative concentrations and mass balance. Constraints based on domain knowledge such as unimodality have also been applied [8]. Perhaps the sim- plest way to reduce ambiguity in equations 1 and 2 is by selecting regions of interest in the mixture spectra, a method sometimes referred to as band targeting [10]. Starting from suitable initial estimates for C and S is another common way to obtain useful MCR-ALS models. Evolving factor analysis and SIM- PLISMA are two curve resolution methods that have been used to provide reasonable initial estimates of the pure component spectra in matrix S[24]. Many of the other curve resolution methods are now commonly used as start- ing estimates for iterative algorithms like MCR-ALS rather than as stand alone techniques [19, 25, 24]. 5 2.2. Band Target Entropy Minimization Band target entropy minimization is another spectral recovery technique that may be used to model estimated pure spectral components, but un- like MCR-ALS, it does not simultaneously estimate concentration pro les. BTEM is used to resolve pure component spectra from mixtures through the use of singular value decomposition (SVD) of the A matrix. Singular value decomposition changes the basis of a matrix such that the mutually orthogonal axes that contain the most variance are contained in the loadings matrix (V ), the square magnitude of variance explained by the loadings is contained in the diagonal entries of the axis of S, and the scores matrix (U ) are populated with the projections of each observation vector in A onto the loading axes. A = USV (3) BTEM utilizes an optimization technique, typically simulated annealing, to nd an estimated spectrum vector, (a ^), whose normalized rst (or higher order) derivative with respect to wavelength () has minimal Shannon En- tropy (H ), which is de ned for any probability value(p) as follows, H (p) = p log (p) (4) The use of Shannon entropy in the calculation of the BTEM objective function is not rigorously founded because derivatives of spectra are not stochastic vectors following the laws of probability. However, by normalizing da^ the entries j j by their maximum value, the spectral derivatives are con- strained between zero and one, similar to a probability vector for a stochastic process. The core idea behind the use of the entropy argument in BTEM is that it allows for the recovery of a spectrum that is minimal in di erential information through a projection of the loadings. Those spectra are attained da^ by minimizing the objective function(O), where O = argminH (j j), and where a ^ is de ned as a ^ = tS V (5) trunc trunc In Equation 5, t is the vector obtained from the optimization, and S trunc and V are the singular values and their respective loadings that contain trunc a component of interest, respectively. The singular values included in the BTEM model are usually decided upon by band-targeting spectral regions from loading plots obtained by SVD that appear to contain a pure compo- nent [10]. In our experiments, however, using the entire spectrum rather 6 than band targeting select regions was often equally e ective (S = S trunc and V = V ). Typically, the estimated pure spectral response ^a is con- trunc strained to be non-negative based on heuristic rules. The computational aspects and suggestions for implementation of the nonnegativity constraints used in BTEM can be found in greater detail in [10]. In essense, BTEM is used to nd a linear combination of loadings obtained from SVD that gives a smooth, simple representation. Such simple represen- tations have been previously shown to be e ective estimates of the spectral responses of pure components in mixtures [10, 11]. The most notable limita- tion to BTEM curve resolution is that the algorithm requires that each pure analyte must vary independently enough relative to the other components in the mixture for it to be reasonably represented by linear combinations of the loadings matrix obtained from the singular value decomposition[10]. Garland et al. quanti ed the pure component spectral estimates obtained from BTEM by using the classical least squares regression framework[10]. The equation for the relative quanti cation of BTEM recovered spectra is, T T 1 C = Aa ^ (a ^a ^ ) (6) Equation 6 is equivalent to replacing the regression weights in a typical classi- cal least squares model with a pure component spectral estimate. Typically, classical least squares modeling is used to relate multivariate instrumental responses to linear changes in the amount of the pure component spectra, but, for CLS to give rank sucient solutions, the spectra of all pure compo- nents present in the mixture must be included in the model. Equation 6 has been used in earlier work [10], on a single recovered spectrum, or a vector, which implies that the equations intended use is for pure spectral regions. However, if the response of a component in a mixture does not have a pure region, as is common in near infrared spectroscopy, there is no direct exten- sion of Equation 6 to use on multiple recovered spectra. Users of BTEM are limited to the quanti cation of each component one-at-a-time knowing that rank arguments are not satis ed. This requirement imposes signi cant limitations on BTEM quanti cation. 2.3. Target Partial Least Squares Regression Calibration of one component in a mixture in the absence of property information is a challenge. An alternative calibration method, target partial least squares regression (T-PLS), rst introduced by Feudale, et al. allows for 7 the semiquantitation of a single pure analyte spectrum in a mixture of compo- nents [13]. Target partial least squares regression employs the same algorithm as conventional partial least squares regression, with a few changes. Partial least squares regression describes the relation between vector of property val- ues, y and a matrix X through a reduced rank linear model that maximizes the covariance of X and y [26]. The major di erence between the conven- tional PLS and T-PLS algorithms is that the T-PLS algorithm is used to project y into the observation space of X , rather than seeking latent features as a result of the projection of y onto the variable space of X . The y vector for T-PLS is a pure spectrum of a target component that is hypothesized to be present in a mixture rather than a latent property. In this work, we refer to these projections as latent projections rather than latent variables, although the mathematics are similar. The only adjustments required to al- low for the nonlinear iterative partial least squares (NIPALS) algorithm to be used for the creation of T-PLS models is that the calculation of the weight matrix (W) and all subsequent calculations must be transposed accordingly. The NIPALS algorithm for T-PLS, which follows the same nomenclature as that of traditional PLS [27], is provided in Algorithm 1. Although T-PLS and PLS models are created from similar algorithms, the interpretations of these models and their validation processes are di erent [13]. This is because a pure spectrum is used in place of the property vector and the regression coecient (m) may be used to provide relative abundances of the target. The optimal number of latent projections for T-PLS models cannot be obtained from cross-validated results, as is common for most PLS models, because T-PLS is designed for use when property values, such as concentrations, are not known. A plot of the percent variance explained in X or y calculated from the residual sum of squares (SSQ) vs number of latent projections may be used decide on the number of latent projections used in the model. This number can be selected by the point of diminishing return [13], similar to K-means clustering or by setting a threshold on the number of projections required to account for a set percentage (e.g. 95%) of the explained variance. This approach is most useful if the noise of the measurements is either known or estimable. 8 Algorithm 1 Target Partial Least Squares 1: procedure 2: u = y 3: for(L in 1 to LatentProjections) 4: while(jjt t jj < tolerance) new old 2 5: w = jjuX jj 6: t = t old new 7: t = X w 8: q = 1 t X new 9: p = t t new new 10: end while ut 11: b = t t 12: X = X pt L L1 13: Y = Y Bt L L1 P P L 2 14: SSQ = ( ) j i X L1 P P y 2 15: SSQ = ( ) j i L1 16: Append w, t, q, and p vectors to matrices W, T, Q, and P 17: Append b to a vector B 18: end for // Where  is the Kronecker Delta i;i T 1 19: m = (W (P W ) (b  )) k; i; i; i i;i ;i 9 3. Data 3.1. Triliquid Data The triliquid dataset is a designed, near-infrared data set collected at wavelengths of 1,100-2,500 nm using a FOSS 6500 instrument in transmission mode. The dataset is composed of spectra of acetic acid, methanol, and water at volume fractions of 0, 25, 50, 75, and 100%. Every sample in the design was prepared and measured in duplicate [28]. For the curve resolution comparison experiments, the samples that con- tained 100% concentration of a single liquid were not included in the training data. These pure samples were used only for comparisons with the pure com- ponent spectral estimates and calibration methods. 3.2. Milk Adulteration Data The milk adulteration dataset was collected with a di use re ectance TM microPHAZIR near infrared spectrometer over the wavelength range of 1,595.7 - 2,396.3 nm. The purpose of this dataset is to determine whether a sample is milk powder (48 samples) or milk powder adulterated with melamine (29 samples). 3.3. Vapor Release Data The vapor release dataset consists of infrared hyperspectral measurements of dimethyl methylphosphonate (DMMP) vapors released from a gas stack at 170 C. The release was measured against a xed background at a distance of 1.5 km using a hyperspectral infrared spectrometer created by Physical Sciences Inc. The infrared spectrometer measured spectral responses over a 1 1 range of 1,270 - 920 cm at a spectral resolution of 10cm . The hyperspec- tral images collected from the imaging spectrometer formed a data cube of 64 by 64 spatial pixels with 36 spectral measurements obtained at each spatial pixel. A black body radiation correction was applied because the excitation source used here was ambient sunlight [13]. 4. Results and Discussion 4.1. Comparison with Known Methods The triliquid data is a designed set of three polar liquids with volume fractions that varied in 25 v/v % increments. Because this dataset had known chemical composition and minimal contributions from background, 10 it was used for assessing the BTEM estimation of pure spectral components and the accuracy of quanti cation for each known component in the mixture. The pure component spectra for water, acetic acid and methanol were estimated from the mixture spectra using band targeting and entropy mini- mization. The pure component spectral estimates obtained from BTEM were subjected to a second-order Savitsky-Golay smoothing lter to remove noise artifacts, as can be seen in Figure 1. The parameters of the Savitsky-Golay lter were manually tuned until the spectra resembled smooth line shapes. Procrustes distance analysis was employed to compare the shape similarities of the resolved pure component spectra and the experimentally-obtained, pure component spectra after rescaling the smoothed spectral estimates by their maximum value. The Procrustes distances from the Savitsky-Golay smoothed BTEM estimated pure component spectra to the experimentally obtained pure spectra of methanol, acetic acid, and water were 0.758, 1.32, and 1.66 rescaled absorbance units, respectively. Figure 1: Rescaled experimental and smoothed BTEM estimates of acetic acid (left), methanol (center), and water (right) spectra. By the Procrustes distance metric, the smooth, BTEM-recovered methanol spectrum had a shape that was most similar to its experimental spectrum. The reason why the BTEM methanol estimate and experimentally collected methanol spectrum were roughly two-fold more similar than acetic acid and 11 water were to their respective experimental spectra may have been method- ological, but it may have also been a result of the chemistry of the mixtures studied. In both the resolved components for acetic acid and water, there were relatively large distances from the experimental spectra at small wave- lengths that may have been artifacts of the BTEM curve resolution method. It is known that shifts in vibrational energy needed to excite the transitions in acetic acid can be bought on by the interaction between polar components of mixtures, dimerization, and because of proton dissociation equilibria [29]. Because of these chemical interactions, the BTEM estimates of acetic acid and those of water likely can be expected to not match the experimentally- obtained pure acetic acid spectra Some evidence that suggested that di erent or shifted species existed for acetic acid was the presence of a band at approximately 2300 nm in the BTEM estimated pure component water spectrum. This claim was assessed by performing principal components analysis on the data in the 2250 - 2300 nm range and a second PCA excluding that range using only 3 principal components, without any sample replicates. Ideal simplex experiments form linear simplices in the scores space of a PCA [28]. Figure 2 demonstrates that the PCA formed by using only the spectral information from 2250 - 2300 nm was more nonlinear than that obtained from the remaining wavelengths. The scores of the rst three principal components of the band of interest had a coecient of variation for the nearest neighbor distances of 235%. The coecient of variation obtained from the wavelengths without the 2250-2300 nm band was only 163%. This indicated that the spacing between the points of the simplices in the scores space were less dispersed, and therefore more linear. Because the experimental design should be linear at rank 3 it is likely that the band is representative of an interaction e ect but it could also be explained as an artifact of a peak in acetic acid or methanol such as a CH combination band that covaried with the bands attributed to water. Due to potential interactions between the component species in the triliq- uid mixture and possible methodological errors present from BTEM curve resolution, a detailed comparison between experimentally obtained and curve resolved pure component spectra could not be made. Our intended usage of the pure component spectral estimates obtained from BTEM was semiquan- titative analysis. This dataset featured known concentration values so that quantitative ecacy could be explored in a way similar to that done with typical calibration models. The predictive error was calculated for each com- ponent by range scaling [30] the predictions between 0-100% and comparing 12 Figure 2: Scores plots of the triliquid data of the rst two principal components of the triliquid data without the absorbance measurements obtained at 2250 - 2300 nm (left), and of only the absorbance measurements obtained in the region of 2250 - 2300 nm (right). them with the known v/v % amounts. The root mean squared errors of esti- mation (RMSE) obtained from CLS and T-PLS regression models built from experimental, BTEM resolved, and second-order Savitsky-Golay smoothed BTEM resolved pure component spectra were compared with results attained from MCR-ALS quanti cation with three pure components, as shown in Ta- ble 1. 13 Curve Resolution Calibration Acetic Acid Methanol Water Method Method (% RMSE) (% RMSE) (% RMSE) None CLS 36.43 35.31 30.28 (Experimental) T-PLS 12.72 4.89 5.66 BTEM CLS 44.89 44.27 49.55 (raw) T-PLS 18.24 6.19 9.78 BTEM CLS 47.27 45.96 48.53 (smoothed) T-PLS 11.73 4.69 9.75 MCR ALS 7.19 16.51 17.25 Table 1: Percent root mean squared errors of range scaled relative concentration estimates obtained from classical least squares, target partial least squares, and MCR-ALS for the known pure components in the triliquid dataset. Entries in bold text represent the lowest errors of the curve resolved calibrations, and those in italics are the entries which had the second lowest curve resolved errors. As expected, the CLS regression models of single pure components all produced the largest predictive errors. The failure of CLS to adequately model single components is in agreement with the theoretical understanding that CLS requires full rank regression weights and that individually regressing each component of a three-component mixture gives rise to a rank de cient solution. The T-PLS models that were built from the smoothed BTEM pure component spectra tended to yield the lowest errors of estimation. Although the calibration of curve resolved spectra was the focus of this study, it was interesting that the errors that resulted from T-PLS models that were calibrated using the smoothed, BTEM-resolved spectra for acetic acid and methanol were lower in magnitude than those obtained from the experi- mentally collected spectra. The nding that these curve resolved components had lower errors of estimation than those from experimentally obtained pure spectra supports our claim of chemical interactions in the triliquid mixture. In this experiment, the target partial least squares models built from the three smoothed BTEM pure component spectra had lower or nearly commen- surate errors as those from the three-component MCR-ALS model. These MCR-ALS models were built using spectral non-negativity and closure con- straints. Many more varieties of MCR-ALS models were investigated; details of these are omitted from this report, but in our experiments, no three com- ponent MCR-ALS model had lower predictive errors for methanol and water than those from BTEM and T-PLS. MCR-ALS models with 1, 2, 4, and 5 pure components were also built 14 using spectral non-negativity and closure constraints so that an assessment of the variability of the MCR-ALS models could be undertaken. The errors of estimation for these experiments are displayed in Table 2. The one com- ponent MCR-ALS model only resolved the water spectrum, likely because the NIR water bands are of greatest intensity in these spectra. It is com- mon for curve resolution methods to resolve the component which has the greatest intensity rst [6]. On the triliquid dataset, it was possible to man- ually recover individual spectra of pure components with BTEM regardless of a component's relative intensity, provided the experimental spectra were band targeted appropriately. The two component MCR-ALS model featured a pure component spectrum with bands that were present in both the ex- perimentally obtained acetic acid and methanol spectra; thus, the errors of prediction for both analytes were assessed for both components using the one spectrum. The two component MCR-ALS model was unable to exclusively recover the spectra of either methanol or acetic acid, although a 3 component MCR-ALS model was able to do so. For this dataset, the Scree plot indicated that there were three or four pure components in the mixture. In the case of three components, BTEM and T-PLS calibration resulted in lower predictive errors for water and methanol, but not for acetic acid, as stated previously. The four component MCR-ALS model, however, had lower errors of estimation for water and methanol, but not for acetic acid. The errors of estimation for methanol from the four com- ponent MCR-ALS model was only 10% di erent than the T-PLS calibrated BTEM estimate. The variability between estimation errors obtained with MCR-ALS by choosing 3 or 4 components in the mixture resulted in a di er- ence of 10.06% RMSE for acetic acid. The di erence in quantitative ecacy between both models with the selection of a single parameter is notewor- thy. Overall, the MCR-ALS models with three or four components resulted in errors that were similar to those obtained from the BTEM and T-PLS modeling. 15 Pure Components Acetic Acid Methanol Water (#) (% RMSE) (% RMSE) (% RMSE) 1 N/A N/A 20.79 2 43.66 44.97 8.50 4 15.82 4.20 7.68 5 17.24 6.41 9.55 Table 2: Percent root mean squared errors of range scaled relative concentration estimates obtained from MCR-ALS with 1, 2, 4, and 5 estimated components in the triliquid dataset. Entries denoted by N/A indicate that concentration estimates could not be obtained using MCR-ALS. The similarities between the errors obtained from the best MCR-ALS models (3 or 4 components) and those obtained from T-PLS calibrated BTEM estimates were interesting because these two approaches de ne very di erent curve resolution experiments and have di erent methods of oper- ation. Modeling with BTEM and T-PLS involves resolving a single pure component spectrum followed by the quanti cation of only that component. This analysis is performed independent of other components, while MCR- ALS simultaneously decomposes all components. For experiments where a training set with known pure components and concentrations are available, such as the triliquid data, nding the suitable number of components in an MCR-ALS model is usually a simple task. How- ever, there are many experiments where the recovery and quanti cation of only a speci c pure component, if it exists, in a mixture is the goal. Two studies where BTEM and T-PLS modeling can be used in conjunction to estimate relative amounts of one unknown chemical component in a mixture are presented below. 4.2. Analysis of Contaminant in Milk Powder The rst study concerns a quality control process, where the goal of the analysis is to assess the purity of a raw material and to rapidly identify contamination present in a sample. The approach is simple; it relies on vari- ation induced by serial dilution. The idea is that newly acquired material of unknown quality can be diluted with pure stock material, and analyzed after each dilution. If any components other than the pure stock material are present, band target entropy minimization can be used to estimate their pure component spectra due to the changes in their signal intensity across the diluted samples, because the primary sources of variation should be limited 16 to the concentration of the interferent and any artifacts of the measurement itself. To ensure that the contaminant spectrum obtained by BTEM is actu- ally present in the sample matrix and not an artifact of the recovery method, T-PLS may then be used to construct a calibration model for the isolated, pure component spectrum and to reduce background e ects present in the dilution measurements [13]. The serial dilution experiment was assessed on a milk powder dataset. Five spectra that had melamine mass fractions of 0.05, 0.1, 0.5, 1, and 2% were sampled. Band target entropy minimization of the entire spectral range using the rst two singular values of the normalized spectral data resulted in the extraction of a pure component spectrum that was visually comparable to that of melamine [31] (Figure 3). The entire spectral region was targeted in this analysis because the nature of the contaminant and its spectral signature were presumed to be unknown, and there were no visible adulterant bands in the NIR spectra. A T-PLS semi-quantitative calibration using two latent projections cre- ated from the second-order Savitsky Golay smoothed BTEM pure compo- nent spectral estimate yielded a 6.59% relative RMSE (0.10 w/w % abso- lute RMSE) over the same ve sample range. This analysis was performed twice more to assess variability, using di erent samples that had the same diluted concentrations. A mean relative RMSE of 11% was calculated. The R-squared values obtained from linear ts for the actual versus predicted con- centration plots generated from the T-PLS calibrations were all >0.90, which indicated a linear calibration. The linearity of the T-PLS calibrations were lower than what might be expected from an ordinary partial least squares regression, but still strongly supported the claim that the component recov- ered by BTEM was not a spectral artifact, and was in fact a component that varied proportionally with the dilution process. MCR-ALS modeling with SIMPLISMA-estimated initial pure component spectra was also applied to the same samples used in the BTEM T-PLS mod- eling in an attempt to discover the adulterant. In our one-component MCR- ALS analysis, non-negativity constraints were applied, but the only pure component spectral estimate that was obtained was visually similar to the unadulterated milk samples, and not to melamine. Even though the SVD Scree plot indicated only one major component, we investigated a second model using two pure components. This resulted in two highly similar spec- tra, both of which were again representative of the unadulterated milk pow- der. For this dataset, MCR-ALS modeling was unable to nd the melamine 17 contaminant, nor to quantify it. We attribute the failure of the MCR-ALS models to recover the pure components associated with melamine to the rel- atively low intensity of the contaminated response, and to the scatter present in the re ectance measurements. Figure 3: The raw and Savitsky-Golay smoothed BTEM estimated pure component spec- tra obtained from the serially diluted adulterated milk samples (left). An actual versus predicted plot of melamine concentrations obtained from the target partial least squares model using the Savitsky-Golay smoothed spectral estimate (right). The average predicted values are shown with 1 sigma error bars (N = 3) for their scaled predictions. BTEM pure component spectral estimates obtained from samples that were reported to contain only pure milk powder were employed as a con- trol. The spectra recovered from these experiments were most similar to the unadulterated milk powder spectra themselves, as can be observed in Figure 4. Calibration by T-PLS was not attempted because no other components appeared in the loading plots obtained from the singular values, and the ma- jority of the variance across samples was attributable to noise and scatter e ects. This result was repeated two more times, and each time no com- ponents were found using BTEM other than the raw milk powder spectra. In this case, MCR-ALS modeling provided the same result as BTEM; only recovered spectra which resembled the raw milk powder were obtained. 18 Figure 4: Overlay of ve unadulterated milk powder spectra (left). The raw spectra and smoothed band target entropy minimization estimate obtained from the rst two singular values of the milk powder spectra (right). 4.3. Analysis of Hyperspectral Images The second study concerns the spectral recovery and calibration of chem- ical components obtained from concentration gradients that occur in hy- perspectral images. The remote monitoring or imaging of chemicals in the environment is a challenging experimental problem. However, concentration gradients occur naturally from physical processes such as di usion, advec- tion, and convection. It was hypothesized that band target entropy mini- mization could resolve pure component spectra from natural gradients in the concentration of a target species in chemical images and that target partial least squares could be used to perform semiquantitative targeted calibrations despite a dynamic background. The dataset we investigated for the concentration gradient study was the vapor release dataset rst reported by Feudale et al [13]. The vapor release dataset was collected to monitor the release of dimethyl methylphosphonate (DMMP) from a gas stack via mid-infrared frequencies that are associated with phosphonate functional groups. A pure component spectral estimate was resolved from seven of the pixels near the gas stack by applying entropy minimization with three singular values using the entire spectral region. It can be seen from Figure 5 that the extracted pure component spectral re- sponse is visually similar to that of a NIST reference spectrum of DMMP 19 Figure 5: A spectral overlay of band target entropy minimization pure component spectral estimate from selected pixels in the vapor release data and a NIST reference spectra for DMMP (top). A z-score normalized hyperspectral image of the vapor release data (bottom left). This normalization set values that were  2  away from the mean intensity to the mean. This normalization was used only for this display and not for any of the reported analysis'. The raw heat map generated from T-PLS calibration of the BTEM pure spectral estimate of DMMP on the hyperspectral image (bottom right). [32] which had more than an eight-fold greater spectral resolution than that used to collect the hyperspectral images. In a previous study, target partial least squares modeling was shown to be an e ective method for calibrating DMMP on this dataset from a separate, experimentally-obtained spectrum of pure DMMP [13]. In this study, the pure component spectral estimate obtained via BTEM from the hyperspec- tral data was utilized as the target spectrum in T-PLS. A T-PLS calibration 20 model was made with six latent projections (100% variance explained in y, and >98% in X ) on the entire hyperspectral image using the spectral esti- mate obtained from BTEM. The results from the calibration are shown in Figure 6. The greatest estimated intensity of the recovered DMMP spectrum was found near the gas stack, similar to what was found in the previous study [13]. A small fraction (14/4096) of pixels appeared to have relatively large calibrated amounts of the estimated DMMP signature but were not located within a 10 pixel radius of the gas stack from which the vapor was released. It was hypothesized that those high intensity pixels were artifacts of random variation because they represented only 0.34% of the hyperspectral image and were not part of the release experiment. To further assess the variability of the methods and those pixels, a resampling study was performed. Jackknife mean and variance estimations[33] were performed to assess how variable BTEM and T-PLS estimates of DMMP concentration were for the vapor release data. Both the BTEM spectral estimates and the intensity maps obtained by T-PLS were individually range scaled between zero and one[30] so that the semiquantitative T-PLS predictions could be pooled. It was found that the same pixel (X = 35, Y = 34) contained the greatest intensity for the jackknifed DMMP target estimate across all leave one out trials, which indicated uniform quantitation across the hold out trials. More evidence supporting the uniformity of the calibration was found by the fact that the estimated standard deviation heat map showed the least variation in the predictions nearest the gas stack chimney and the most variation in the background. Interestingly, in the jackknife mean estimate, the fourteen pixels of rela- tively large intensity believed to be artifacts of random variations were ob- served to still be relatively high in intensity (Figure 7). However, those pixels also tended to have large jackknife variance estimates (relative to those pix- els nearest the chimney), a result which further supported the notion that those pixels were artifacts in the data, and did not contain the target sig- nature. This nding was congruent with an earlier T-PLS analysis on the same dataset that used an experimentally obtained target DMMP spectrum. They demonstrated that it was statistically unlikely that the pixels located away from the chimney were from the same probability distribution as the pixels near the chimney using a Kolmogorov-Smirnov test [13]. Although MCR-ALS has been shown to be a valuable tool for the anal- ysis of hyperspectral images when the components of interest are well rep- 21 resented and relatively high in intensity[15], this dataset posed a challenge. This dataset features many contributions to variance because the spectral background depended upon the spatial location and because DMMP is a relatively minor component. MCR-ALS with SIMPLISMA initial estimates failed to recover the target DMMP spectrum using both the seven pixels selected for BTEM and the entire hyperspectral image. Figure 6: The leave one out jackknife estimated mean (left) and standard deviation (right) intensity map estimate obtained by T-PLS and BTEM on the vapor release data (left). 5. Conclusion The results from this study demonstrated that the errors of estimation obtained from target partial least squares models built by using band target entropy minimization to extract pure component spectra were lower than those from classical least squares and were similar to those from multivariate curve resolution alternating least squares for at least one dataset. It was also shown that BTEM and T-PLS can be used together to identify minor contaminant species and calibrate their presence by a simple serial dilution experiment despite scatter and noise in the measurements. Because the only major requirement for BTEM is that the signal of the species of interest varies across samples, it was also demonstrated that it is possible to extract a spectral estimate of a minor component present in the advection of vapor released from a gas stack. It was also shown that T-PLS could be used to quantify the estimated pure component spectra of the vapors present across 22 a hyperspectral image despite the presence of spatially varying spectral uc- tuations in an unknown background. We believe that MCR-ALS failed to estimate pure spectral components in the two mixtures due to insucient representation of all components. The milk powder spectra appeared to be a ected by scattered light and instru- mental noise. Although MCR-ALS was capable of obtaining an estimate for the reference milk powder spectra, we were unable to estimate the melamine contaminant, despite the variation in its concentration presented in the sam- ples. The vapor release dataset had unknown background e ects due to the size and emissivity of the physical area which was measured and di erent tem- peratures, even after attempting to band target the vapor plume. In these cases, it was found that obtaining estimates for pure component spectra that were not linked to the success of nding the other components present in the spectra, nor their respective concentration pro les, was advantageous. Even though the mixture components were too poorly represented for MCR-ALS to obtain estimates for them, BTEM and T-PLS could be used to successfully model pure spectral estimates and to calibrate their presence. The combination of BTEM and T-PLS appears to be a useful one. Very few tools and strategies exist for both qualitatively and quantitatively ex- tracting information about minor components without pure component spec- tra or property values. We hope that more studies which demonstrate the utility of these tools, and their variations, for hyperspectral imaging and quality control applications are performed. 6. Acknowledgments This work was supported by the United States National Science Founda- tion grant 1506853. 7. Con ict of Interest The authors declare no con ict of interest. 8. References References [1] Lawton, W. H. Sylvestre, E. A. Self modeling curve resolution. Techno- metrics. 13, 3, (1971), 617-633. 23 [2] Maeder, M. Evolving factor analysis for the resolution of overlapping chromatographic peaks. Anal. Chem. 59, 3, (1987), 527-530. [3] Keller, H.R. Massart, D.L. Evolving factor analysis. Chemometrics and Intelligent Laboratory Systems. 12, 3, (1991), 209-224. [4] Malinowski, E. R. Window factor analysis: theoretical derivation and application to ow injection analysis data. Journal of Chemometrics. 6, 1, (1992), 29-40. [5] Zeng, Y. Liang, Kvalheim, O. M. Keller, H. R. Massart, D. L. Kiechle, P. Erni, F. Heuristic evolving latent projections: resolving two-way mul- ticomponent data 2. Detection and resolution of minor constituents. Analytical Chemistry. 64, 8, (1992), 946-953. [6] Vandeginste, B. G.M. Derks, W. Kateman, G. Multicomponent self- modelling curve resolution in high-performance liquid chromatography by iterative target transformation analysis. Analytica Chimica Acta. 173, (1985), 253-264. [7] Windig, W. Antalek, B. Lippert, J. L. Batonneau, Y. Br emard, C. Combined Use of Conventional and Second-Derivative Data in the SIM- PLISMA Self-Modeling Mixture Analysis Approach. Analytical Chem- istry. 74, 6, (2002), 1371-1379. [8] Tauler, R. Izquierdo-Ridorsa, A. Casassas, E. Simultaneous analysis of several spectroscopic titrations with self-modelling curve resolution. Chemometrics and Intelligent Laboratory Systems. 18, 3, (1993), 293- [9] Shen, H. Stordrange, L. Manne, R. Kvalheim, O. M. Liang, Y. The morphological score and its application to chemical rank determination. Chemometrics and Intelligent Laboratory Systems. 51, 1, (2000), 37-47. [10] Chew, W. Widjaja, W. Garland, M. Band-target entropy minimization (BTEM): an advanced method for recovering unknown pure component spectra. Application to the FTIR Spectra of Unstable Organometallic Mixtures. Organometallics. 21, 9, (2002), 1982-1990. [11] Widjaja, E. Li, C. Chew, W. Garland, M. Band-Target Entropy Min- imization. A robust algorithm for pure component spectral recovery. 24 Application to Complex Randomized Mixtures of Six Components. An- alytical Chemistry. 75, 17, (2003), 4499-4507. [12] Geladi, P. Kowalski, B. R. Partial least-squares regression: a tutorial. Analytica Chimica Acta. 185, 1, (1986), 1-17. [13] Feudale, R. N. Brown, S. D. An inverse model for target detection. Chemometrics and Intelligent Laboratory Systems. 77, 1-2, (2005), 75- [14] Shariati-Rad, M. Hasani, M. Application of multivariate curve resolution-alternating least squares (MCR-ALS) for secondary structure resolving of proteins. Biochimie. 91, 7, (2009), 850-856. [15] Colares, C. J.G, Pastore, T. C.M. Coradin, V. T.R. Marques, L. F, Mor- eira, A. C.O. Alexandrino, G. L. Poppi, R. J. Braga, J. W.B. Near in- frared hyperspectral imaging and MCR-ALS applied for mapping chem- ical composition of the wood specie Swietenia Macrophylla King (Ma- hogany) at microscopic level. Microchemical Journal. 124 (2016), 356- [16] Juan, A. D. Tauler, R. Multivariate curve resolution (MCR) from 2000: Progress in concepts and applications. Critical Reviews in Analytical Chemistry. 36, 3, (2006), 163-176. [17] Tauler, R. Lacorte, S. Barcel, D. Application of multivariate self- modeling curve resolution to the quantitation of trace levels of organophosphorus pesticides in natural waters from interlaboratory studies. Journal of Chromatography A. 730, 1-2, (1996), 177-183. [18] Li, Q. Tang, Y. Yan, Z. Zhang, P. Identi cation of trace additives in polymer materials by attenuated total re ection Fourier transform in- frared mapping coupled with multivariate curve resolution. Spectrochim- ica Acta Part A: Molecular and Biomolecular Spectroscopy. 180, (2017), 1386-1425. [19] Richards, S. E. Becker, E. Tauler, R. Walmsley, A. A novel approach to the quanti cation of industrial mixtures from the Vinyl Acetate Monomer (VAM) process using Near Infrared spectroscopic data and a Quantitative Self Modeling Curve Resolution (SMCR) methodology. Chemometrics and Intelligent Laboratory Systems. 94, 1, (2008), 9-18. 25 [20] Tauler, R. Age, S. Kowalski, B. Selectivity, local rank, three-way data analysis and ambiguity in multivariate curve resolution. Journal of Chemometrics. 9, 1, (1995), 31-58. [21] Motegi, H. Identi cation of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): A Data-Driven Ap- proach across Metabolic Processes. Scienti c Reports. 5 (2015), 1-12. [22] Cattell, R. B. The Scree test for the number of factors. Multivariate Behavioral Research. 1, 2, (1996), 245-276. [23] Abdollahi, H. Tauler, R. Uniqueness and rotation ambiguities in Multi- variate Curve Resolution methods. Chemometrics and Intelligent Labo- ratory Systems. 108, 2, (2011) 100-111. [24] Jaumot, J. Gargallo, R. Juan, A. D Tauler, R. A graphical user-friendly interface for MCR-ALS: a new tool for multivariate curve resolution in MATLAB. Chemometrics and Intelligent Laboratory Systems. 76, 1, (2005), 101-110. [25] Richards, S.E. Walmsley, A. Quantitative iterative target transforma- tion factor analysis. Journal of Chemometrics. 22, 1, (2008), 63-80. [26] S. Wold, M. Sj ostr om, L. Eriksson. PLS-regression: a basic tool of chemometrics. Chemometrics and Intelligent Laboratory Systems. 58, 2, (2001), 109-130. [27] Haaland, D. M. Thomas, E. V. Partial least-squares methods for spec- tral analyses. 1. Relation to other quantitative calibration methods and the extraction of qualitative information. Analytical Chemistry. 60, 11, (1988), 1193-1202. [28] Mark, H. Chemometrics in near-infrared spectroscopy. Analytica Chim- ica Acta. 223, (1989), 75-93. [29] Nakabayashi, T. Nishi, N. States of Molecular Associates in Binary Mix- tures of Acetic Acid with Protic and Aprotic Polar Solvents: A Raman Spectroscopic Study. The Journal of Physical Chemistry A. 106, 14, (2002), 3491-3500. 26 [30] Sharaf, M. A. Illman, D. L. Kowalski, B. R. Chemometrics. Wiley- Interscience: New York, NY. (1986), 193. [31] Baeten, V. Dardenne, P. NIR-based detection of contaminants in food and feed. Broadening Horizons N35, November 2016. URL: https://www.feedipedia.org/content/nir-based-detection-contaminants- food-and-feed (accessed September 10, 2017) [32] Stein, S.E. NIST Mass Spec Data Center, "Infrared Spectra" in NIST Chemistry WebBook, NIST Standard Reference Database Number 69, Eds. P.J. Linstrom and W.G. Mallard, National Institute of Standards and Technology, Gaithersburg MD, 1970, (retrieved January 5, 2018). [33] Efron, B. Stein, C. The jackknife estimate of variance. The Annals of Statistics. 9, 3, (1981), 586-596. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Statistics arXiv (Cornell University)

Band Target Entropy Minimization and Target Partial Least Squares for Spectral Recovery and Calibration

Statistics , Volume 2018 (1802) – Feb 11, 2018

Loading next page...
 
/lp/arxiv-cornell-university/band-target-entropy-minimization-and-target-partial-least-squares-for-urGE0Ct7F9
ISSN
0003-2670
eISSN
ARCH-3347
DOI
10.1016/j.aca.2018.07.054
Publisher site
See Article on Publisher Site

Abstract

The resolution and calibration of pure spectra of minority components in measurements of chemical mixtures without prior knowledge of the mixture is a challenging problem. In this work, a combination of band target en- tropy minimization (BTEM) and target partial least squares (T-PLS) was used to obtain estimates for single pure component spectra and to calibrate those estimates in a true, one-at-a-time fashion. This approach allows for mi- nor components to be targeted and their relative amounts estimated in the presence of other varying components in spectral data. The use of T-PLS estimation is an improvement to the BTEM method because it overcomes the need to identify all of the pure components prior to estimation. Es- timated amounts from this combination were found to be similar to those obtained from a standard method, multivariate curve resolution-alternating least squares (MCR-ALS), on a simple, three component mixture dataset. Studies from two experimental datasets demonstrate where the combination of BTEM and T-PLS could model the pure component spectra and obtain concentration pro les of minor components but MCR-ALS could not. Keywords: band target entropy minimization, recovery, target partial least squares 1. Introduction Resolving poorly represented low intensity pure component spectra and concentration pro les from chemical mixtures without a priori knowledge sdb@udel.edu Preprint submitted to ArXiv March 28, 2018 arXiv:1802.03839v2 [stat.ML] 27 Mar 2018 is an open eld of research. Provided that varying amounts of the tar- get compound are present in the data matrix, many methods, such as self modeling curve resolution [1],evolving factor analysis [2, 3], window factor analysis [4], heuristic evolving latent projections [5], iterative target trans- formation factor analysis [6], simple-to-use interactive self-modeling mixture analysis (SIMPLISMA) [7], and the standard multivariate curve resolution- alternating least squares (MCR-ALS) [8] have been used to resolve the spec- tral signatures of major contributors from mixture data. However, when the component of interest is present at relatively low levels, the resolution of that component by the these methods is problematic because its spectral response can be lost in the contributions of noise [9] or its variation can be small enough relative to those of the other contributions that a given method will fail to resolve it from the larger components [6]. Band target entropy minimization (BTEM) takes a di erent approach to the problem of spectral signature recovery. Band target entropy minimiza- tion estimates a single, pure component spectrum from a linear combination of weighted loadings obtained by principal component analysis [10]. This ap- proach allows orthogonal contributions of variance to be combined to form a spectral estimate. One potential advantage with the use of BTEM is that the technique does not require that all components in a mixture have adequate variation in the samples, as long as the single target for resolution is repre- sented in the PCA loadings [10, 11]. This advantage cannot be attributed to many other methods because most methods attempt to simultaneously discover and resolve the spectral signatures of every component in a mixture [1, 6, 7, 8]. Although BTEM has been used for the recovery of individual components in a mixture, in order to obtain estimates of relative amounts, the resulting pure component spectral estimates have been used as a regression coe- cients in classical least squares (CLS) models [10, 11]. This approach poses a problem because classical least squares regression requires that all pure component responses embedded in a mixture be included in the regression model. If this condition is not met, the concentration estimate matrix that results from CLS has insucient rank and cannot be used to discriminate the additive e ects of all mixture components from a given target compound. Therefore, previous work using BTEM spectral recovery required either the discovery of all pure component responses for the mixture in order to ob- tain their approximate amounts, or that a regression is performed on spectra which have one component or on regions where only the available pure com- 2 ponent spectra contribute. In many cases, an analyst may only be interested in the relative amount of one component for quantitation due to the presence of an undersampled or poorly represented background. Similarly, to charac- terize an unknown, minor component of a mixture, the entire spectral range may need to be considered. Thus, the potential advantage of being able to recover single component spectra from BTEM has not been utilized in a way that allows for determining the relative amounts of single components. It is well known that calibration methods such as partial least squares or principal component regression [12] can overcome issues associated with rank de ciencies that arise when the responses for all components of a mixture are not fully known. However, in curve resolution experiments, prior knowledge of property values is not available. Thus, the usual regression methods cannot be used because only the target spectrum and the spectral data are available. To avoid the problems associated with conventional regression methods, target partial least squares (T-PLS) [13] can be used for creating regression models built from single, pure component spectral estimates obtained by BTEM from near-infrared and infrared spectra. Target partial least squares has been shown to take advantage of the partial least squares regression framework in a way that uses pure component spectra as a property vector to estimate relative amounts of minor components and avoid contributions from background [13]. The quantitative ecacy of this combination can be compared with that of MCR-ALS, a standard technique for modeling pure component spectra and concentration pro les from infrared spectra [14], hyperspectral images [15], and a variety of other applications [16]. MCR-ALS was selected as a base of comparison for this study because it has been presented for the quantitation of trace [17, 18] components in mixtures. MCR-ALS and other curve resolution methods have not been demon- strated to work well in resolving components with relatively small signatures or components in mixtures that have poorly represented background contri- butions [6, 18]. Components that have low concentrations can have relatively large signals due to their instrumental responses. Previous MCR-ALS studies related to analytes at low concentrations did not address the case where the low concentration components also had low signals for quantitation [17, 18]. MCR-ALS estimates cannot be obtained because collecting data with suf- cient independent variation of each component in a given mixture is not possible due to a lack of experimental control [19]. Components that su er from these conditions are said to have poor representation because they can- 3 not be adequately sampled for resolution. A primary goal of this study is to investigate the ecacy of BTEM and T-PLS for resolving spectra and obtain- ing semiquantitative estimates under conditions where MCR-ALS estimates cannot be readily obtained. This study rst shows that T-PLS can be used to construct models that are more accurate than individual CLS estimates obtained from BTEM re- covered signals, and that these models can be competitive with those from MCR-ALS on a dataset where the response of the components were each relatively large and where there were minimal background e ects. Then, the utility of BTEM and T-PLS for obtaining pure component estimates is inves- tigated for situations where the spectral signatures of the pure components are low in magnitude and where sucient representations of every analyte are dicult to attain. The ecacy of the hybrid method used under these more challenging conditions is demonstrated with two experimental datasets in which BTEM and T-PLS succeeded at modeling the pure components known to be present in those mixtures, but MCR-ALS failed. 2. Theory 2.1. Multivariate Curve Resolution Alternating Least Squares The goal of multivariate curve resolution (MCR) is to obtain estimates for the pure component spectra (S) and the respective relative concentration pro les (C ) for the components of an unknown mixture. Because this study focuses on spectroscopic responses obtained from chemical mixtures, the re- lationship between the matrix of measured absorbances (A) and the pure components can be considered as an application of Beer-Lambert-Bouguer law where A = CS . Given some absorbance matrix A, MCR can be used to model candidate solutions for C and S. Due to inherent ambiguities in posing this inverse problem, however, there is often not an exact solution for C or S [20]. Instead, practitioners seek chemically plausible solutions. Multivariate curve resolution alternating least squares has been shown to provide, in many instances, satisfactory empirical estimates for both the spectra of pure components, and their respective concentration pro les [16, 8]. Many variations of MCR-ALS have been presented in the literature. One commonality shared across all of them is that all estimated pure components present in a mixture are modeled simultaneously. Alternating least squares solves the inverse Beer-Lambert-Bouguer prob- lem through iteritive least squares projections of both undetermined matrices 4 onto one another, T 1 C = A(S ) (1) S = C A (2) The matrix inverses of both S and C are often singular due to rank de - ciencies in the spectral data. As a way to avoid rank-related diculties, the Moore-Penrose pseudoinverse is commonly employed [8]. Because MCR-ALS decomposes the spectral responses in the A matrix based on all of the plausible information in C and S, an important factor in building an e ective MCR-ALS model is the selection of an appropriate number of pure components. Without an adequate estimate of the number of components in the mixture, the pure component spectra and concentration pro les often become unreliable [21]. Domain knowledge can be used to provide reasonable estimates for the number of pure components. When domain knowledge is not available, the number of components for a MCR- ALS model can be selected heuristically by a Scree plot of the eigenvalues that result from the singular value decomposition of the mixture matrix A [22]. Even if an appropriate number of pure component estimates has been selected, ambiguous solutions may still be obtained from the nonconvex op- timization of C and S[23]. Many schemes have been employed to constrain the alternating least squares solutions to chemically plausible regions. The most commonly used constraints are those based on physical laws such as nonnegative concentrations and mass balance. Constraints based on domain knowledge such as unimodality have also been applied [8]. Perhaps the sim- plest way to reduce ambiguity in equations 1 and 2 is by selecting regions of interest in the mixture spectra, a method sometimes referred to as band targeting [10]. Starting from suitable initial estimates for C and S is another common way to obtain useful MCR-ALS models. Evolving factor analysis and SIM- PLISMA are two curve resolution methods that have been used to provide reasonable initial estimates of the pure component spectra in matrix S[24]. Many of the other curve resolution methods are now commonly used as start- ing estimates for iterative algorithms like MCR-ALS rather than as stand alone techniques [19, 25, 24]. 5 2.2. Band Target Entropy Minimization Band target entropy minimization is another spectral recovery technique that may be used to model estimated pure spectral components, but un- like MCR-ALS, it does not simultaneously estimate concentration pro les. BTEM is used to resolve pure component spectra from mixtures through the use of singular value decomposition (SVD) of the A matrix. Singular value decomposition changes the basis of a matrix such that the mutually orthogonal axes that contain the most variance are contained in the loadings matrix (V ), the square magnitude of variance explained by the loadings is contained in the diagonal entries of the axis of S, and the scores matrix (U ) are populated with the projections of each observation vector in A onto the loading axes. A = USV (3) BTEM utilizes an optimization technique, typically simulated annealing, to nd an estimated spectrum vector, (a ^), whose normalized rst (or higher order) derivative with respect to wavelength () has minimal Shannon En- tropy (H ), which is de ned for any probability value(p) as follows, H (p) = p log (p) (4) The use of Shannon entropy in the calculation of the BTEM objective function is not rigorously founded because derivatives of spectra are not stochastic vectors following the laws of probability. However, by normalizing da^ the entries j j by their maximum value, the spectral derivatives are con- strained between zero and one, similar to a probability vector for a stochastic process. The core idea behind the use of the entropy argument in BTEM is that it allows for the recovery of a spectrum that is minimal in di erential information through a projection of the loadings. Those spectra are attained da^ by minimizing the objective function(O), where O = argminH (j j), and where a ^ is de ned as a ^ = tS V (5) trunc trunc In Equation 5, t is the vector obtained from the optimization, and S trunc and V are the singular values and their respective loadings that contain trunc a component of interest, respectively. The singular values included in the BTEM model are usually decided upon by band-targeting spectral regions from loading plots obtained by SVD that appear to contain a pure compo- nent [10]. In our experiments, however, using the entire spectrum rather 6 than band targeting select regions was often equally e ective (S = S trunc and V = V ). Typically, the estimated pure spectral response ^a is con- trunc strained to be non-negative based on heuristic rules. The computational aspects and suggestions for implementation of the nonnegativity constraints used in BTEM can be found in greater detail in [10]. In essense, BTEM is used to nd a linear combination of loadings obtained from SVD that gives a smooth, simple representation. Such simple represen- tations have been previously shown to be e ective estimates of the spectral responses of pure components in mixtures [10, 11]. The most notable limita- tion to BTEM curve resolution is that the algorithm requires that each pure analyte must vary independently enough relative to the other components in the mixture for it to be reasonably represented by linear combinations of the loadings matrix obtained from the singular value decomposition[10]. Garland et al. quanti ed the pure component spectral estimates obtained from BTEM by using the classical least squares regression framework[10]. The equation for the relative quanti cation of BTEM recovered spectra is, T T 1 C = Aa ^ (a ^a ^ ) (6) Equation 6 is equivalent to replacing the regression weights in a typical classi- cal least squares model with a pure component spectral estimate. Typically, classical least squares modeling is used to relate multivariate instrumental responses to linear changes in the amount of the pure component spectra, but, for CLS to give rank sucient solutions, the spectra of all pure compo- nents present in the mixture must be included in the model. Equation 6 has been used in earlier work [10], on a single recovered spectrum, or a vector, which implies that the equations intended use is for pure spectral regions. However, if the response of a component in a mixture does not have a pure region, as is common in near infrared spectroscopy, there is no direct exten- sion of Equation 6 to use on multiple recovered spectra. Users of BTEM are limited to the quanti cation of each component one-at-a-time knowing that rank arguments are not satis ed. This requirement imposes signi cant limitations on BTEM quanti cation. 2.3. Target Partial Least Squares Regression Calibration of one component in a mixture in the absence of property information is a challenge. An alternative calibration method, target partial least squares regression (T-PLS), rst introduced by Feudale, et al. allows for 7 the semiquantitation of a single pure analyte spectrum in a mixture of compo- nents [13]. Target partial least squares regression employs the same algorithm as conventional partial least squares regression, with a few changes. Partial least squares regression describes the relation between vector of property val- ues, y and a matrix X through a reduced rank linear model that maximizes the covariance of X and y [26]. The major di erence between the conven- tional PLS and T-PLS algorithms is that the T-PLS algorithm is used to project y into the observation space of X , rather than seeking latent features as a result of the projection of y onto the variable space of X . The y vector for T-PLS is a pure spectrum of a target component that is hypothesized to be present in a mixture rather than a latent property. In this work, we refer to these projections as latent projections rather than latent variables, although the mathematics are similar. The only adjustments required to al- low for the nonlinear iterative partial least squares (NIPALS) algorithm to be used for the creation of T-PLS models is that the calculation of the weight matrix (W) and all subsequent calculations must be transposed accordingly. The NIPALS algorithm for T-PLS, which follows the same nomenclature as that of traditional PLS [27], is provided in Algorithm 1. Although T-PLS and PLS models are created from similar algorithms, the interpretations of these models and their validation processes are di erent [13]. This is because a pure spectrum is used in place of the property vector and the regression coecient (m) may be used to provide relative abundances of the target. The optimal number of latent projections for T-PLS models cannot be obtained from cross-validated results, as is common for most PLS models, because T-PLS is designed for use when property values, such as concentrations, are not known. A plot of the percent variance explained in X or y calculated from the residual sum of squares (SSQ) vs number of latent projections may be used decide on the number of latent projections used in the model. This number can be selected by the point of diminishing return [13], similar to K-means clustering or by setting a threshold on the number of projections required to account for a set percentage (e.g. 95%) of the explained variance. This approach is most useful if the noise of the measurements is either known or estimable. 8 Algorithm 1 Target Partial Least Squares 1: procedure 2: u = y 3: for(L in 1 to LatentProjections) 4: while(jjt t jj < tolerance) new old 2 5: w = jjuX jj 6: t = t old new 7: t = X w 8: q = 1 t X new 9: p = t t new new 10: end while ut 11: b = t t 12: X = X pt L L1 13: Y = Y Bt L L1 P P L 2 14: SSQ = ( ) j i X L1 P P y 2 15: SSQ = ( ) j i L1 16: Append w, t, q, and p vectors to matrices W, T, Q, and P 17: Append b to a vector B 18: end for // Where  is the Kronecker Delta i;i T 1 19: m = (W (P W ) (b  )) k; i; i; i i;i ;i 9 3. Data 3.1. Triliquid Data The triliquid dataset is a designed, near-infrared data set collected at wavelengths of 1,100-2,500 nm using a FOSS 6500 instrument in transmission mode. The dataset is composed of spectra of acetic acid, methanol, and water at volume fractions of 0, 25, 50, 75, and 100%. Every sample in the design was prepared and measured in duplicate [28]. For the curve resolution comparison experiments, the samples that con- tained 100% concentration of a single liquid were not included in the training data. These pure samples were used only for comparisons with the pure com- ponent spectral estimates and calibration methods. 3.2. Milk Adulteration Data The milk adulteration dataset was collected with a di use re ectance TM microPHAZIR near infrared spectrometer over the wavelength range of 1,595.7 - 2,396.3 nm. The purpose of this dataset is to determine whether a sample is milk powder (48 samples) or milk powder adulterated with melamine (29 samples). 3.3. Vapor Release Data The vapor release dataset consists of infrared hyperspectral measurements of dimethyl methylphosphonate (DMMP) vapors released from a gas stack at 170 C. The release was measured against a xed background at a distance of 1.5 km using a hyperspectral infrared spectrometer created by Physical Sciences Inc. The infrared spectrometer measured spectral responses over a 1 1 range of 1,270 - 920 cm at a spectral resolution of 10cm . The hyperspec- tral images collected from the imaging spectrometer formed a data cube of 64 by 64 spatial pixels with 36 spectral measurements obtained at each spatial pixel. A black body radiation correction was applied because the excitation source used here was ambient sunlight [13]. 4. Results and Discussion 4.1. Comparison with Known Methods The triliquid data is a designed set of three polar liquids with volume fractions that varied in 25 v/v % increments. Because this dataset had known chemical composition and minimal contributions from background, 10 it was used for assessing the BTEM estimation of pure spectral components and the accuracy of quanti cation for each known component in the mixture. The pure component spectra for water, acetic acid and methanol were estimated from the mixture spectra using band targeting and entropy mini- mization. The pure component spectral estimates obtained from BTEM were subjected to a second-order Savitsky-Golay smoothing lter to remove noise artifacts, as can be seen in Figure 1. The parameters of the Savitsky-Golay lter were manually tuned until the spectra resembled smooth line shapes. Procrustes distance analysis was employed to compare the shape similarities of the resolved pure component spectra and the experimentally-obtained, pure component spectra after rescaling the smoothed spectral estimates by their maximum value. The Procrustes distances from the Savitsky-Golay smoothed BTEM estimated pure component spectra to the experimentally obtained pure spectra of methanol, acetic acid, and water were 0.758, 1.32, and 1.66 rescaled absorbance units, respectively. Figure 1: Rescaled experimental and smoothed BTEM estimates of acetic acid (left), methanol (center), and water (right) spectra. By the Procrustes distance metric, the smooth, BTEM-recovered methanol spectrum had a shape that was most similar to its experimental spectrum. The reason why the BTEM methanol estimate and experimentally collected methanol spectrum were roughly two-fold more similar than acetic acid and 11 water were to their respective experimental spectra may have been method- ological, but it may have also been a result of the chemistry of the mixtures studied. In both the resolved components for acetic acid and water, there were relatively large distances from the experimental spectra at small wave- lengths that may have been artifacts of the BTEM curve resolution method. It is known that shifts in vibrational energy needed to excite the transitions in acetic acid can be bought on by the interaction between polar components of mixtures, dimerization, and because of proton dissociation equilibria [29]. Because of these chemical interactions, the BTEM estimates of acetic acid and those of water likely can be expected to not match the experimentally- obtained pure acetic acid spectra Some evidence that suggested that di erent or shifted species existed for acetic acid was the presence of a band at approximately 2300 nm in the BTEM estimated pure component water spectrum. This claim was assessed by performing principal components analysis on the data in the 2250 - 2300 nm range and a second PCA excluding that range using only 3 principal components, without any sample replicates. Ideal simplex experiments form linear simplices in the scores space of a PCA [28]. Figure 2 demonstrates that the PCA formed by using only the spectral information from 2250 - 2300 nm was more nonlinear than that obtained from the remaining wavelengths. The scores of the rst three principal components of the band of interest had a coecient of variation for the nearest neighbor distances of 235%. The coecient of variation obtained from the wavelengths without the 2250-2300 nm band was only 163%. This indicated that the spacing between the points of the simplices in the scores space were less dispersed, and therefore more linear. Because the experimental design should be linear at rank 3 it is likely that the band is representative of an interaction e ect but it could also be explained as an artifact of a peak in acetic acid or methanol such as a CH combination band that covaried with the bands attributed to water. Due to potential interactions between the component species in the triliq- uid mixture and possible methodological errors present from BTEM curve resolution, a detailed comparison between experimentally obtained and curve resolved pure component spectra could not be made. Our intended usage of the pure component spectral estimates obtained from BTEM was semiquan- titative analysis. This dataset featured known concentration values so that quantitative ecacy could be explored in a way similar to that done with typical calibration models. The predictive error was calculated for each com- ponent by range scaling [30] the predictions between 0-100% and comparing 12 Figure 2: Scores plots of the triliquid data of the rst two principal components of the triliquid data without the absorbance measurements obtained at 2250 - 2300 nm (left), and of only the absorbance measurements obtained in the region of 2250 - 2300 nm (right). them with the known v/v % amounts. The root mean squared errors of esti- mation (RMSE) obtained from CLS and T-PLS regression models built from experimental, BTEM resolved, and second-order Savitsky-Golay smoothed BTEM resolved pure component spectra were compared with results attained from MCR-ALS quanti cation with three pure components, as shown in Ta- ble 1. 13 Curve Resolution Calibration Acetic Acid Methanol Water Method Method (% RMSE) (% RMSE) (% RMSE) None CLS 36.43 35.31 30.28 (Experimental) T-PLS 12.72 4.89 5.66 BTEM CLS 44.89 44.27 49.55 (raw) T-PLS 18.24 6.19 9.78 BTEM CLS 47.27 45.96 48.53 (smoothed) T-PLS 11.73 4.69 9.75 MCR ALS 7.19 16.51 17.25 Table 1: Percent root mean squared errors of range scaled relative concentration estimates obtained from classical least squares, target partial least squares, and MCR-ALS for the known pure components in the triliquid dataset. Entries in bold text represent the lowest errors of the curve resolved calibrations, and those in italics are the entries which had the second lowest curve resolved errors. As expected, the CLS regression models of single pure components all produced the largest predictive errors. The failure of CLS to adequately model single components is in agreement with the theoretical understanding that CLS requires full rank regression weights and that individually regressing each component of a three-component mixture gives rise to a rank de cient solution. The T-PLS models that were built from the smoothed BTEM pure component spectra tended to yield the lowest errors of estimation. Although the calibration of curve resolved spectra was the focus of this study, it was interesting that the errors that resulted from T-PLS models that were calibrated using the smoothed, BTEM-resolved spectra for acetic acid and methanol were lower in magnitude than those obtained from the experi- mentally collected spectra. The nding that these curve resolved components had lower errors of estimation than those from experimentally obtained pure spectra supports our claim of chemical interactions in the triliquid mixture. In this experiment, the target partial least squares models built from the three smoothed BTEM pure component spectra had lower or nearly commen- surate errors as those from the three-component MCR-ALS model. These MCR-ALS models were built using spectral non-negativity and closure con- straints. Many more varieties of MCR-ALS models were investigated; details of these are omitted from this report, but in our experiments, no three com- ponent MCR-ALS model had lower predictive errors for methanol and water than those from BTEM and T-PLS. MCR-ALS models with 1, 2, 4, and 5 pure components were also built 14 using spectral non-negativity and closure constraints so that an assessment of the variability of the MCR-ALS models could be undertaken. The errors of estimation for these experiments are displayed in Table 2. The one com- ponent MCR-ALS model only resolved the water spectrum, likely because the NIR water bands are of greatest intensity in these spectra. It is com- mon for curve resolution methods to resolve the component which has the greatest intensity rst [6]. On the triliquid dataset, it was possible to man- ually recover individual spectra of pure components with BTEM regardless of a component's relative intensity, provided the experimental spectra were band targeted appropriately. The two component MCR-ALS model featured a pure component spectrum with bands that were present in both the ex- perimentally obtained acetic acid and methanol spectra; thus, the errors of prediction for both analytes were assessed for both components using the one spectrum. The two component MCR-ALS model was unable to exclusively recover the spectra of either methanol or acetic acid, although a 3 component MCR-ALS model was able to do so. For this dataset, the Scree plot indicated that there were three or four pure components in the mixture. In the case of three components, BTEM and T-PLS calibration resulted in lower predictive errors for water and methanol, but not for acetic acid, as stated previously. The four component MCR-ALS model, however, had lower errors of estimation for water and methanol, but not for acetic acid. The errors of estimation for methanol from the four com- ponent MCR-ALS model was only 10% di erent than the T-PLS calibrated BTEM estimate. The variability between estimation errors obtained with MCR-ALS by choosing 3 or 4 components in the mixture resulted in a di er- ence of 10.06% RMSE for acetic acid. The di erence in quantitative ecacy between both models with the selection of a single parameter is notewor- thy. Overall, the MCR-ALS models with three or four components resulted in errors that were similar to those obtained from the BTEM and T-PLS modeling. 15 Pure Components Acetic Acid Methanol Water (#) (% RMSE) (% RMSE) (% RMSE) 1 N/A N/A 20.79 2 43.66 44.97 8.50 4 15.82 4.20 7.68 5 17.24 6.41 9.55 Table 2: Percent root mean squared errors of range scaled relative concentration estimates obtained from MCR-ALS with 1, 2, 4, and 5 estimated components in the triliquid dataset. Entries denoted by N/A indicate that concentration estimates could not be obtained using MCR-ALS. The similarities between the errors obtained from the best MCR-ALS models (3 or 4 components) and those obtained from T-PLS calibrated BTEM estimates were interesting because these two approaches de ne very di erent curve resolution experiments and have di erent methods of oper- ation. Modeling with BTEM and T-PLS involves resolving a single pure component spectrum followed by the quanti cation of only that component. This analysis is performed independent of other components, while MCR- ALS simultaneously decomposes all components. For experiments where a training set with known pure components and concentrations are available, such as the triliquid data, nding the suitable number of components in an MCR-ALS model is usually a simple task. How- ever, there are many experiments where the recovery and quanti cation of only a speci c pure component, if it exists, in a mixture is the goal. Two studies where BTEM and T-PLS modeling can be used in conjunction to estimate relative amounts of one unknown chemical component in a mixture are presented below. 4.2. Analysis of Contaminant in Milk Powder The rst study concerns a quality control process, where the goal of the analysis is to assess the purity of a raw material and to rapidly identify contamination present in a sample. The approach is simple; it relies on vari- ation induced by serial dilution. The idea is that newly acquired material of unknown quality can be diluted with pure stock material, and analyzed after each dilution. If any components other than the pure stock material are present, band target entropy minimization can be used to estimate their pure component spectra due to the changes in their signal intensity across the diluted samples, because the primary sources of variation should be limited 16 to the concentration of the interferent and any artifacts of the measurement itself. To ensure that the contaminant spectrum obtained by BTEM is actu- ally present in the sample matrix and not an artifact of the recovery method, T-PLS may then be used to construct a calibration model for the isolated, pure component spectrum and to reduce background e ects present in the dilution measurements [13]. The serial dilution experiment was assessed on a milk powder dataset. Five spectra that had melamine mass fractions of 0.05, 0.1, 0.5, 1, and 2% were sampled. Band target entropy minimization of the entire spectral range using the rst two singular values of the normalized spectral data resulted in the extraction of a pure component spectrum that was visually comparable to that of melamine [31] (Figure 3). The entire spectral region was targeted in this analysis because the nature of the contaminant and its spectral signature were presumed to be unknown, and there were no visible adulterant bands in the NIR spectra. A T-PLS semi-quantitative calibration using two latent projections cre- ated from the second-order Savitsky Golay smoothed BTEM pure compo- nent spectral estimate yielded a 6.59% relative RMSE (0.10 w/w % abso- lute RMSE) over the same ve sample range. This analysis was performed twice more to assess variability, using di erent samples that had the same diluted concentrations. A mean relative RMSE of 11% was calculated. The R-squared values obtained from linear ts for the actual versus predicted con- centration plots generated from the T-PLS calibrations were all >0.90, which indicated a linear calibration. The linearity of the T-PLS calibrations were lower than what might be expected from an ordinary partial least squares regression, but still strongly supported the claim that the component recov- ered by BTEM was not a spectral artifact, and was in fact a component that varied proportionally with the dilution process. MCR-ALS modeling with SIMPLISMA-estimated initial pure component spectra was also applied to the same samples used in the BTEM T-PLS mod- eling in an attempt to discover the adulterant. In our one-component MCR- ALS analysis, non-negativity constraints were applied, but the only pure component spectral estimate that was obtained was visually similar to the unadulterated milk samples, and not to melamine. Even though the SVD Scree plot indicated only one major component, we investigated a second model using two pure components. This resulted in two highly similar spec- tra, both of which were again representative of the unadulterated milk pow- der. For this dataset, MCR-ALS modeling was unable to nd the melamine 17 contaminant, nor to quantify it. We attribute the failure of the MCR-ALS models to recover the pure components associated with melamine to the rel- atively low intensity of the contaminated response, and to the scatter present in the re ectance measurements. Figure 3: The raw and Savitsky-Golay smoothed BTEM estimated pure component spec- tra obtained from the serially diluted adulterated milk samples (left). An actual versus predicted plot of melamine concentrations obtained from the target partial least squares model using the Savitsky-Golay smoothed spectral estimate (right). The average predicted values are shown with 1 sigma error bars (N = 3) for their scaled predictions. BTEM pure component spectral estimates obtained from samples that were reported to contain only pure milk powder were employed as a con- trol. The spectra recovered from these experiments were most similar to the unadulterated milk powder spectra themselves, as can be observed in Figure 4. Calibration by T-PLS was not attempted because no other components appeared in the loading plots obtained from the singular values, and the ma- jority of the variance across samples was attributable to noise and scatter e ects. This result was repeated two more times, and each time no com- ponents were found using BTEM other than the raw milk powder spectra. In this case, MCR-ALS modeling provided the same result as BTEM; only recovered spectra which resembled the raw milk powder were obtained. 18 Figure 4: Overlay of ve unadulterated milk powder spectra (left). The raw spectra and smoothed band target entropy minimization estimate obtained from the rst two singular values of the milk powder spectra (right). 4.3. Analysis of Hyperspectral Images The second study concerns the spectral recovery and calibration of chem- ical components obtained from concentration gradients that occur in hy- perspectral images. The remote monitoring or imaging of chemicals in the environment is a challenging experimental problem. However, concentration gradients occur naturally from physical processes such as di usion, advec- tion, and convection. It was hypothesized that band target entropy mini- mization could resolve pure component spectra from natural gradients in the concentration of a target species in chemical images and that target partial least squares could be used to perform semiquantitative targeted calibrations despite a dynamic background. The dataset we investigated for the concentration gradient study was the vapor release dataset rst reported by Feudale et al [13]. The vapor release dataset was collected to monitor the release of dimethyl methylphosphonate (DMMP) from a gas stack via mid-infrared frequencies that are associated with phosphonate functional groups. A pure component spectral estimate was resolved from seven of the pixels near the gas stack by applying entropy minimization with three singular values using the entire spectral region. It can be seen from Figure 5 that the extracted pure component spectral re- sponse is visually similar to that of a NIST reference spectrum of DMMP 19 Figure 5: A spectral overlay of band target entropy minimization pure component spectral estimate from selected pixels in the vapor release data and a NIST reference spectra for DMMP (top). A z-score normalized hyperspectral image of the vapor release data (bottom left). This normalization set values that were  2  away from the mean intensity to the mean. This normalization was used only for this display and not for any of the reported analysis'. The raw heat map generated from T-PLS calibration of the BTEM pure spectral estimate of DMMP on the hyperspectral image (bottom right). [32] which had more than an eight-fold greater spectral resolution than that used to collect the hyperspectral images. In a previous study, target partial least squares modeling was shown to be an e ective method for calibrating DMMP on this dataset from a separate, experimentally-obtained spectrum of pure DMMP [13]. In this study, the pure component spectral estimate obtained via BTEM from the hyperspec- tral data was utilized as the target spectrum in T-PLS. A T-PLS calibration 20 model was made with six latent projections (100% variance explained in y, and >98% in X ) on the entire hyperspectral image using the spectral esti- mate obtained from BTEM. The results from the calibration are shown in Figure 6. The greatest estimated intensity of the recovered DMMP spectrum was found near the gas stack, similar to what was found in the previous study [13]. A small fraction (14/4096) of pixels appeared to have relatively large calibrated amounts of the estimated DMMP signature but were not located within a 10 pixel radius of the gas stack from which the vapor was released. It was hypothesized that those high intensity pixels were artifacts of random variation because they represented only 0.34% of the hyperspectral image and were not part of the release experiment. To further assess the variability of the methods and those pixels, a resampling study was performed. Jackknife mean and variance estimations[33] were performed to assess how variable BTEM and T-PLS estimates of DMMP concentration were for the vapor release data. Both the BTEM spectral estimates and the intensity maps obtained by T-PLS were individually range scaled between zero and one[30] so that the semiquantitative T-PLS predictions could be pooled. It was found that the same pixel (X = 35, Y = 34) contained the greatest intensity for the jackknifed DMMP target estimate across all leave one out trials, which indicated uniform quantitation across the hold out trials. More evidence supporting the uniformity of the calibration was found by the fact that the estimated standard deviation heat map showed the least variation in the predictions nearest the gas stack chimney and the most variation in the background. Interestingly, in the jackknife mean estimate, the fourteen pixels of rela- tively large intensity believed to be artifacts of random variations were ob- served to still be relatively high in intensity (Figure 7). However, those pixels also tended to have large jackknife variance estimates (relative to those pix- els nearest the chimney), a result which further supported the notion that those pixels were artifacts in the data, and did not contain the target sig- nature. This nding was congruent with an earlier T-PLS analysis on the same dataset that used an experimentally obtained target DMMP spectrum. They demonstrated that it was statistically unlikely that the pixels located away from the chimney were from the same probability distribution as the pixels near the chimney using a Kolmogorov-Smirnov test [13]. Although MCR-ALS has been shown to be a valuable tool for the anal- ysis of hyperspectral images when the components of interest are well rep- 21 resented and relatively high in intensity[15], this dataset posed a challenge. This dataset features many contributions to variance because the spectral background depended upon the spatial location and because DMMP is a relatively minor component. MCR-ALS with SIMPLISMA initial estimates failed to recover the target DMMP spectrum using both the seven pixels selected for BTEM and the entire hyperspectral image. Figure 6: The leave one out jackknife estimated mean (left) and standard deviation (right) intensity map estimate obtained by T-PLS and BTEM on the vapor release data (left). 5. Conclusion The results from this study demonstrated that the errors of estimation obtained from target partial least squares models built by using band target entropy minimization to extract pure component spectra were lower than those from classical least squares and were similar to those from multivariate curve resolution alternating least squares for at least one dataset. It was also shown that BTEM and T-PLS can be used together to identify minor contaminant species and calibrate their presence by a simple serial dilution experiment despite scatter and noise in the measurements. Because the only major requirement for BTEM is that the signal of the species of interest varies across samples, it was also demonstrated that it is possible to extract a spectral estimate of a minor component present in the advection of vapor released from a gas stack. It was also shown that T-PLS could be used to quantify the estimated pure component spectra of the vapors present across 22 a hyperspectral image despite the presence of spatially varying spectral uc- tuations in an unknown background. We believe that MCR-ALS failed to estimate pure spectral components in the two mixtures due to insucient representation of all components. The milk powder spectra appeared to be a ected by scattered light and instru- mental noise. Although MCR-ALS was capable of obtaining an estimate for the reference milk powder spectra, we were unable to estimate the melamine contaminant, despite the variation in its concentration presented in the sam- ples. The vapor release dataset had unknown background e ects due to the size and emissivity of the physical area which was measured and di erent tem- peratures, even after attempting to band target the vapor plume. In these cases, it was found that obtaining estimates for pure component spectra that were not linked to the success of nding the other components present in the spectra, nor their respective concentration pro les, was advantageous. Even though the mixture components were too poorly represented for MCR-ALS to obtain estimates for them, BTEM and T-PLS could be used to successfully model pure spectral estimates and to calibrate their presence. The combination of BTEM and T-PLS appears to be a useful one. Very few tools and strategies exist for both qualitatively and quantitatively ex- tracting information about minor components without pure component spec- tra or property values. We hope that more studies which demonstrate the utility of these tools, and their variations, for hyperspectral imaging and quality control applications are performed. 6. Acknowledgments This work was supported by the United States National Science Founda- tion grant 1506853. 7. Con ict of Interest The authors declare no con ict of interest. 8. References References [1] Lawton, W. H. Sylvestre, E. A. Self modeling curve resolution. Techno- metrics. 13, 3, (1971), 617-633. 23 [2] Maeder, M. Evolving factor analysis for the resolution of overlapping chromatographic peaks. Anal. Chem. 59, 3, (1987), 527-530. [3] Keller, H.R. Massart, D.L. Evolving factor analysis. Chemometrics and Intelligent Laboratory Systems. 12, 3, (1991), 209-224. [4] Malinowski, E. R. Window factor analysis: theoretical derivation and application to ow injection analysis data. Journal of Chemometrics. 6, 1, (1992), 29-40. [5] Zeng, Y. Liang, Kvalheim, O. M. Keller, H. R. Massart, D. L. Kiechle, P. Erni, F. Heuristic evolving latent projections: resolving two-way mul- ticomponent data 2. Detection and resolution of minor constituents. Analytical Chemistry. 64, 8, (1992), 946-953. [6] Vandeginste, B. G.M. Derks, W. Kateman, G. Multicomponent self- modelling curve resolution in high-performance liquid chromatography by iterative target transformation analysis. Analytica Chimica Acta. 173, (1985), 253-264. [7] Windig, W. Antalek, B. Lippert, J. L. Batonneau, Y. Br emard, C. Combined Use of Conventional and Second-Derivative Data in the SIM- PLISMA Self-Modeling Mixture Analysis Approach. Analytical Chem- istry. 74, 6, (2002), 1371-1379. [8] Tauler, R. Izquierdo-Ridorsa, A. Casassas, E. Simultaneous analysis of several spectroscopic titrations with self-modelling curve resolution. Chemometrics and Intelligent Laboratory Systems. 18, 3, (1993), 293- [9] Shen, H. Stordrange, L. Manne, R. Kvalheim, O. M. Liang, Y. The morphological score and its application to chemical rank determination. Chemometrics and Intelligent Laboratory Systems. 51, 1, (2000), 37-47. [10] Chew, W. Widjaja, W. Garland, M. Band-target entropy minimization (BTEM): an advanced method for recovering unknown pure component spectra. Application to the FTIR Spectra of Unstable Organometallic Mixtures. Organometallics. 21, 9, (2002), 1982-1990. [11] Widjaja, E. Li, C. Chew, W. Garland, M. Band-Target Entropy Min- imization. A robust algorithm for pure component spectral recovery. 24 Application to Complex Randomized Mixtures of Six Components. An- alytical Chemistry. 75, 17, (2003), 4499-4507. [12] Geladi, P. Kowalski, B. R. Partial least-squares regression: a tutorial. Analytica Chimica Acta. 185, 1, (1986), 1-17. [13] Feudale, R. N. Brown, S. D. An inverse model for target detection. Chemometrics and Intelligent Laboratory Systems. 77, 1-2, (2005), 75- [14] Shariati-Rad, M. Hasani, M. Application of multivariate curve resolution-alternating least squares (MCR-ALS) for secondary structure resolving of proteins. Biochimie. 91, 7, (2009), 850-856. [15] Colares, C. J.G, Pastore, T. C.M. Coradin, V. T.R. Marques, L. F, Mor- eira, A. C.O. Alexandrino, G. L. Poppi, R. J. Braga, J. W.B. Near in- frared hyperspectral imaging and MCR-ALS applied for mapping chem- ical composition of the wood specie Swietenia Macrophylla King (Ma- hogany) at microscopic level. Microchemical Journal. 124 (2016), 356- [16] Juan, A. D. Tauler, R. Multivariate curve resolution (MCR) from 2000: Progress in concepts and applications. Critical Reviews in Analytical Chemistry. 36, 3, (2006), 163-176. [17] Tauler, R. Lacorte, S. Barcel, D. Application of multivariate self- modeling curve resolution to the quantitation of trace levels of organophosphorus pesticides in natural waters from interlaboratory studies. Journal of Chromatography A. 730, 1-2, (1996), 177-183. [18] Li, Q. Tang, Y. Yan, Z. Zhang, P. Identi cation of trace additives in polymer materials by attenuated total re ection Fourier transform in- frared mapping coupled with multivariate curve resolution. Spectrochim- ica Acta Part A: Molecular and Biomolecular Spectroscopy. 180, (2017), 1386-1425. [19] Richards, S. E. Becker, E. Tauler, R. Walmsley, A. A novel approach to the quanti cation of industrial mixtures from the Vinyl Acetate Monomer (VAM) process using Near Infrared spectroscopic data and a Quantitative Self Modeling Curve Resolution (SMCR) methodology. Chemometrics and Intelligent Laboratory Systems. 94, 1, (2008), 9-18. 25 [20] Tauler, R. Age, S. Kowalski, B. Selectivity, local rank, three-way data analysis and ambiguity in multivariate curve resolution. Journal of Chemometrics. 9, 1, (1995), 31-58. [21] Motegi, H. Identi cation of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): A Data-Driven Ap- proach across Metabolic Processes. Scienti c Reports. 5 (2015), 1-12. [22] Cattell, R. B. The Scree test for the number of factors. Multivariate Behavioral Research. 1, 2, (1996), 245-276. [23] Abdollahi, H. Tauler, R. Uniqueness and rotation ambiguities in Multi- variate Curve Resolution methods. Chemometrics and Intelligent Labo- ratory Systems. 108, 2, (2011) 100-111. [24] Jaumot, J. Gargallo, R. Juan, A. D Tauler, R. A graphical user-friendly interface for MCR-ALS: a new tool for multivariate curve resolution in MATLAB. Chemometrics and Intelligent Laboratory Systems. 76, 1, (2005), 101-110. [25] Richards, S.E. Walmsley, A. Quantitative iterative target transforma- tion factor analysis. Journal of Chemometrics. 22, 1, (2008), 63-80. [26] S. Wold, M. Sj ostr om, L. Eriksson. PLS-regression: a basic tool of chemometrics. Chemometrics and Intelligent Laboratory Systems. 58, 2, (2001), 109-130. [27] Haaland, D. M. Thomas, E. V. Partial least-squares methods for spec- tral analyses. 1. Relation to other quantitative calibration methods and the extraction of qualitative information. Analytical Chemistry. 60, 11, (1988), 1193-1202. [28] Mark, H. Chemometrics in near-infrared spectroscopy. Analytica Chim- ica Acta. 223, (1989), 75-93. [29] Nakabayashi, T. Nishi, N. States of Molecular Associates in Binary Mix- tures of Acetic Acid with Protic and Aprotic Polar Solvents: A Raman Spectroscopic Study. The Journal of Physical Chemistry A. 106, 14, (2002), 3491-3500. 26 [30] Sharaf, M. A. Illman, D. L. Kowalski, B. R. Chemometrics. Wiley- Interscience: New York, NY. (1986), 193. [31] Baeten, V. Dardenne, P. NIR-based detection of contaminants in food and feed. Broadening Horizons N35, November 2016. URL: https://www.feedipedia.org/content/nir-based-detection-contaminants- food-and-feed (accessed September 10, 2017) [32] Stein, S.E. NIST Mass Spec Data Center, "Infrared Spectra" in NIST Chemistry WebBook, NIST Standard Reference Database Number 69, Eds. P.J. Linstrom and W.G. Mallard, National Institute of Standards and Technology, Gaithersburg MD, 1970, (retrieved January 5, 2018). [33] Efron, B. Stein, C. The jackknife estimate of variance. The Annals of Statistics. 9, 3, (1981), 586-596.

Journal

StatisticsarXiv (Cornell University)

Published: Feb 11, 2018

References