Odor Impression Prediction from Mass Spectra

doi:10.1371/journal.pone.0157030

Fig 1.

Schematic diagram of the predictive model.

The character or number in each box gives the name of the layer or the number of units (dimensionality), respectively.

More »

Expand

Fig 2.

Schematic diagram of an autoencoder.

x_ni corresponds to the ith element of the nth sample. b^(j) is the bias in the jth layer. y_n(x_n;W) = f(W⁽⁴⁾f(W⁽³⁾f(W⁽²⁾f(W⁽¹⁾x_n + b⁽⁰⁾) + b⁽¹⁾) + b⁽²⁾) + b⁽³⁾) is the output of an autoencoder for a given x_n, where W = {W⁽⁴⁾,W⁽³⁾,W⁽²⁾,W⁽¹⁾,b⁽⁰⁾,b⁽¹⁾,b⁽²⁾,b⁽³⁾}. f is a p-dimensional sigmoid function, f(a) = [1/(1 + exp(−a₁)),… 1/(1 + exp(−a_p))].

More »

Expand

Fig 3.

Pretraining procedure for autoencoder.

The weights of a 5-layer autoencoder (right) are copied from two 3-layer autoencoders (left and middle).

More »

Expand

Fig 4.

Mean reconstruction errors in cross validation.

A, error of autoencoder for sensory data with respect to the number of units K_S B, error for mass spectra with respect to the number of units K_M.

More »

Expand

Fig 5.

Mean reconstruction errors in cross-validation.

Optimal K_S (or K_M), giving the minimum error for each D_S (or D_M) with reference to Fig 4. Error bars indicate standard deviations of testing sample sets. A, error of autoencoder for sensory data with respect to the number of neurons in the hidden layer and error of PCA for sensory data with respect to the number of principal components. B, error for mass spectra with respect to the number of neurons in the hidden layer (in autoencoder) and error with respect to the number of principal components (in PCA).

More »

Expand

Table 1.

Number of neurons employed in 9-layer predictive model.

More »

Expand

Table 2.

Constant coefficients used in updating rule.

More »

Expand

Fig 6.

Experimental Result.

Examples of predictions by two models, which give a value close to the correlation coefficient for each method. 3024 (= 144 descriptors × 21 samples) data points are plotted in each Fig A, result for the 9-layer predictive model (R ≅ 0.76), B, result for the PLS model (R ≅ 0.61).

More »

Expand

Fig 7.

Mean prediction errors of 121 chemical samples.

The six most significant six samples (the top 5%) are indicated with the sample number.

More »

Expand

Fig 8.

Mean prediction error of dimethylpyrazine (No. 47).

The maximum error in the sample was found to have a sensory normalized value of about 0.3.

More »

Expand

Fig 9.

Scatter plots of the result of PCA applied to the original sensory evaluation data.

A, first and second principal components. B, first and third principal components.

More »

Expand