Download The Unscrambler Methods

Transcript
Camo Software AS
The Unscrambler User Manual
Non-orthogonal Scores and Weights
Orthogonality properties of scores and weights are seldom of too much practical concern in PLS regression.
Orthogonality is primarily important in the mathematical derivations and in developing algorithms.
In some situations, the non-orthogonal nature of scores and weights in tri-PLS may lead to surprising, though
correct, models. For example, two weight vectors of two different components may turn out very similar. This
can happen if the same variation in one variable mode is related to two different phenomena in the data.
For instance, a general increase over time (variable mode one) may occur for two different spectrally detected
substances (variable mode two). In such a case, the appearance of two similar weight vectors is merely a useful
flagging of the fact that the same time-trend affects different parts of the model.
Maximum Number of Components
The formula for determining the maximum possible number of components in PLS1 and PLS2 is min (I -1, K)
with I the number of samples in the calibration set and K the number of variables. In Three-way PLS there are
two variable modes, such that the maximum possible number of components is min(I-1, K*L) with K and L the
numbers of primary and secondary variables. If the data is not centered, the maximum number of components
is min(I,K*L).
Interpretation of a Tri-PLS Model
Once a three-way regression model is built, you have to diagnose it, i.e. assess its quality, before you can start
interpreting the relationship between X and Y. Finally, your model will be ready for use for prediction once
you have thoroughly checked and refined it.
Most tri-PLS results are interpreted in much the same way as in ordinary PLS (see Chapter “Main Results of
Regression” p. 111 for more details). Exceptions are listed in Chapter “Main Results of Tri-PLS Regression”
above.
Read more about specific details:

Interpretation of variances p. 101

Interpretation of the two sets of weights p. 183

Interpretation of non-orthogonal scores and weights p. 184

How to detect outliers in regression p. 115
Three-way Data Analysis in Practice
The sections that follow list menu options, dialogs and plots for three-way data analysis (nPLS). For a more
detailed description of each menu option, read The Unscrambler Program Operation, available as a PDF file
from Camo’s web site www.camo.com/TheUnscrambler/Appendices .
In practice, building and using a tri-PLS regression model consists of several steps:
1. Choose and implement an appropriate pre-processing method. Individual modes of a 3-D data array
may be transformed in the same way as a “normal” data vector (see Chapter Re-formatting and Preprocessing);
2. Build the model: calibration fits the model to the available data, while validation checks the model for
new data;
3. Choose the number of components to interpret, according to calibration and validation variances;
184 Three-way Data Analysis
The Unscrambler Methods