Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Predicting Thermodynamic Properties of PBXTHs with New Quantum Topological Indexes

  • Fangzhu Xiao ,

    Contributed equally to this work with: Fangzhu Xiao, Changming Nie

    Affiliation School of Public Health, University of South China, Hengyang 421001, China

  • Guowen Peng ,

    pgwnh@sohu.com

    Affiliation School of Chemistry and Chemical Engineering, University of South China, Hengyang 421001, China

  • Changming Nie ,

    Contributed equally to this work with: Fangzhu Xiao, Changming Nie

    Affiliation School of Chemistry and Chemical Engineering, University of South China, Hengyang 421001, China

  • Limei Yu

    Affiliation School of Chemistry and Chemical Engineering, University of South China, Hengyang 421001, China

Abstract

Novel group quantitative structure-property relationship (QSPR) models on the thermodynamic properties of PBXTHs were presented, by the multiple linear regression (MLR) analysis method. Four thermodynamic properties were studied: the entropy (Sθ), the standard enthalpy of formation (ΔfHθ), the standard Gibbs energy of formation (ΔfGθ), and the relative standard Gibbs energy of formation (ΔRGθ). The results by the formula indicate that the calculated and predicted data in this study are in good agreement with those in literature and the deviation is within the experimental errors. To validate the estimation reliability for internal samples and the predictive ability for other samples, leave-one-out (LOO) cross validation (CV) and external validation were performed, and the results show that the models are satisfactory.

Introduction

Quantitative structure–property relationship (QSPR) remains the focus of many studies aimed at the modeling and prediction of physicochemical properties or biological activities of molecules, because of their convenience and importance for practical use and molecular design when the physicochemical properties or biological activities of compounds are closely related with their structures [13].

In QSPR studies, developing topological index is a very crucial step, which is graph theoretical descriptor obtained by transforming molecular structures into the corresponding molecular graphs [45]. Since the first topological index W was proposed by Wiener in 1947, more and more topological indexes have been constructed because of their simpleness, speediness, and accuracy [6]. Many of them were based on the distance matrixes, such as Balaban index, Hyper-Wiener index, Hyper-Detour index, Detour index, Hosoya index, and Pasareti index. However, these distance matrices consisted of the shortest distances from vertex i to all other (n -1) vertices in the molecular graphs, and the shortest distance of two adjacent atoms vertex was regarded as ‘‘1”. In fact, the topological space distances is not ‘‘1”, therefore, most of them could not reveal the real connection among atoms, and are not suitable for heteroatom-containing and multiple bond organic compounds [79].

Recently, more useful and significant topological indexes have been derived from the molecular structural information and the chemical conditions of atoms, for example, Lu index based on the relative electro-negativity and the relative bond length of vertices [10]; the augmented eccentric connectivity index on the ground of the adjacency-cum-distance [11]. At the same time, our group proposed some new topological descriptors, such as PY1PY2 indexes on the basis of space distance matrix, equilibrium electro-negativity and the branching effect [9], PX1PX2 indexes based on topological distance matrix, the branch vertex of atoms and equilibrium electro-negativity [12], PE index on the ground of distance matrix and equilibrium electro-negativity[13].

XTH (xanthone) compound, which is the main component of gentianaceaescutellaria stonecrop, is a common folk medicine used as clearing heat, anti phlogosis, liver-protection, cholagogic, detoxification in naxi nationality, tibetan and miao nationality[14]. Due to their wide distribution and important application, xanthonederivants have gained the interest of researchers. For example, PBXTHs (polybrominatedxanthones) are important xanthonederivants[15].

Because the structures of PBXTHs are similar, which have 135 possible structures, according to the number of Br atoms and different replace locations of XTH. If the physical and chemical properties or thermodynamic properties of each PBXTH compounds are determined, it is not realistic in terms of both manpower and material resources. So, QSPRs have been extensively used in molecular structure description and property investigation on PBXTHs and demonstrated obvious advantages.

In this work, as a continuation of our earlier work[89, 1213, 1618], the new quantum topological indices XP1XP2 of XTH and 135 PBXTHs were constructed combined with the theory of quantum chemistry and topological chemistry. At the same time, the multiple linear regression (MLR) analysis was used to build novel group QSPR models for predicting of the thermodynamic properties (SθS, ΔfHθ, ΔfGθΔ and ΔRGθΔ)of XTH and 135 PBXTHs.

Materials and Methods

Data Set

All the experimental data of the thermodynamic properties(Sθ, ΔfHθ, ΔfGθ and ΔRGθS) of XTH and 135 PBXTHs used in this work, were obtained from the calculated values in literature [19].

Construction of new quantum topological indices XP1XP2

The QSPR studies of XTH and 135 PBXTHs were performed in four fundamental stages: (1) Selection of data set; (2) Construction of new quantum topological indices XP1XP2; (3)Multiple linear regression (MLR) statistical analysis; and (4) Model validation techniques. The first as well as the most crucial step is how to exactly extract sufficiently the molecular structure information with numerical format from the molecular graph [20].

Structure and atom label of XTH is given in Fig 1:

First, MOPAC 7.0 software was used to optimize the initial geometric parameters of molecular structures of XTH and 135 PBXTHs by constructing and using AM1 semi-empirical quantum chemistry methods. Then, the further geometric configuration optimization and vibration analysis were completed by using Gaussian03 software on the B3LYP/6-31+ G (d) basis set, with the application of density functional theory (DFT). When the stable molecular configuration forming, the potential energy surface scanning method was used to scan all possible bond angle, the dihedral angle, and the corresponding relationship between energy and geometric configuration will be set. On this basis, the spatial topological distance stdij were calculated between individual atoms of XTH and 135 PBXTHs.

In order to extract the molecular structure information of XTH and 135 PBXTHs sufficiently, we adopt the distance matrix D and the branching degree matrix V to descript molecular structure. The distance matrix D of n atoms in a molecule, a square symmetric matrix, can be expressed as D = [dij]n×n. where dij is the length of the shortest path between the vertices i and j in molecular skeleton graph. Instead, in this paper, dij was revised by using the spatial topological distance stdij. Therefore, the following distance matrix is 3 D topological distance matrix 3D.

As one of the main properties of atoms, electro-negativity represents the ability of atoms to obtain or lose electrons when it is in a compound. The larger the electro-negativity of an atom is, the stronger the ability of the atom to attract electrons is. Based on Pauling electro-negativity, the group electro-negativity xG can be calculated by the method of stepwise addition[21].

The group electro-negativity of a group structural tree is illustrated in Fig 2.

When the group is a single atom, its group electro-negativity is Pauling electro-negativity of this atom. For a group with more than two levels, all the atoms or groups attached to “anchor atom” are weighted equally, which can be expressed as follows [22].

  1. The equilibrium of the first level:
  2. The equilibrium of the second level:
  3. ……
  4. The equilibrium of the k-th level:

Then, the group electro-negativity χG is defined as: (1)

For a molecule with an equilibrium structure, the equilibrium electro-negativity of atom i is defined as: (2) Where χiA is the Pauling electro-negativity for atom i, χG is the electro-negativity of group directly attached to atom i calculated by Eq (1), and l is the group number directly attached to atom i.

For a two-level group such as “= CH2” and “-CHI2”, all of the atom are weighted equally, so that:

And:

For a group with more than two levels, all of the atoms or groups attached to the “anchor atom” are weighted equally. For example,

In this paper, the equilibrium electro-negativity matrix E is established to reflect every atomic chemical environmental change of a molecule, and the matrix E is defined as following, E = [χ1 χ2χn-1 χn]. T is the transpose of the matrix (the same below).

In addition, the branching degree matrix V is established with each atom bonding state and the coupling relationship between atoms, in order to reflect the branching effect of each atom in molecule. The matrix V is defined as following, V = [v1 v2vn-1 vn], vi is calculated by vi = zihi+1. Where zi represents the number of valence electron outside the atom nucleus, hi is the number of hydrogen atoms connecting with atom i.

Molecular structure and property are closely related with the atom space effect, the character of the bonding atoms (such as equilibrium electro-negativity) and the branching effect between the atoms. We think that these three factors cooperatively affect the molecular character and property. In this paper, we established a new extension matrix S on the basis of the topological distance matrix 3D, the equilibrium electro-negativity matrix E andthe branching degree matrix V. And the matrix S is defined: M = 3D×E×V. At the same time, the matrix S is expressed as following:

Then, the correctional matrix Q is established by Eq (3), on the basis of the extension matrix S.

(3)

The characteristic values λQ, n of the correctional matrix Q are calculated using MATLAB, which are arranged from small to big.

In this paper, the new quantum topological indices XP1XP2 will be defined as[9].: (4) Where λQ,min is the fisrt characteristic values λQ, 1 of the correctional matrix Q, and λQ,max is the the n-th characteristic values λQ, n.

For example, the molecular structure of 2,8-DBXTH is given in Fig 3. and the correctional matrix Q2,8-DBXTH is given below.

The characteristic values λQ of the correctional matrix Q2,8-DBXTH are -9.3541, -6.5978, -4.5987, -2.2512, -1.3575, -1.03279, -0.80915, -0.6308, 5.1485, 7.9062, 11.3299, 16.0541, 19.1825, 21.9817, 22.3654, 36.3718and 58.2157, respectively.

Then, the new quantum topological indices XP1XP2 of 2,8-DBXTH are XP1 = |-9.3541| = 9.3541, XP2 = |58.2157| = 58.2157, respectively.

According to the same method, the new quantum topological indices XP1XP2 of XTH and 135 PBXTHswere constructed. The calculation results are shown in Table 1.

thumbnail
Table 1. Thermodynamic Data and the Quantum Topological Indices XP1XP2 of XTH and 135 PBXTHs.

https://doi.org/10.1371/journal.pone.0147126.t001

Results and Discussion

Regression analysis

The simplest expression of the fundamentalprinciple of QSPR theory is a linear relationship P = a+bX between a property P and the chosen moleculardescriptor X, where a and b are real numbers determinedby a standard least-square procedure[23]. According tothe aforementioned method, the multiple linear regression (MLR) analysis using the new quantum topological indices XP1XP2 was performed for obtaining the QSPR models of the thermodynamic properties (SθfGθ and ΔRGθ) of XTH and 135 PBXTHs. At the same time, to test the stability of QSPR models, leave-one-out (LOO) cross validation (CV)was carried out. Thefinal QSPR models are conducted as follows: (5) (6) (7) Where n is the number of data points; R is the correlation coefficient; Rcv, S, F are the cross-validated correlation coefficient, the standard error of estimate, and the Fisher statistic value, respectively.

Particularly, if the correlation coefficient, the Fisher criterion and the cross-validated correlation coefficient are high, the new quantum topological indices XP1XP2 are better to explain the thermodynamic properties (SθfGθ and ΔRGθ) of XTH and 135 PBXTHs. From Eq (5) to Eq (7), the high correlation coefficient and the low standard deviation of the model indicate that there are very good correlation between the thermodynamic properties (SθfGθ and ΔRGθ) of XTH and 135 PBXTHs. The correlation coefficient Rs(R,Radj and RCV) of the three QSPR models are all above 0.99, belongs to the optimal level. And, the high correlation coefficient and cross-validated correlation coefficient demonstrate that the new proposed QSPR modelsare more robust and have increased predictive power. Table 1 gives the predicted (Pre.) values of the thermodynamic properties (SθfGθ and ΔRGθ) of XTH and 135 PBXTHsusing the Eq (5) to Eq (7).

The analysis of plots has shown to be very useful to confirm the quality of a model or to detect the anomalies. The plots of the calculated value in literature[19] versus the predicted values of the thermodynamic properties (SθfGθ and ΔRGθ) of XTH and 135 PBXTHsare presented in Figs 46, which show that they are very close. And the average relative error is only 0.85%, 1.19%, and 0.79%, respectively. All the results show that the three QSPR models have a good predictive power.

thumbnail
Fig 4. The predicted values versus calculated values in literature[19]of Sθ.

https://doi.org/10.1371/journal.pone.0147126.g004

thumbnail
Fig 5. The predicted values versus calculated values in literature[19]of ΔfGθ.

https://doi.org/10.1371/journal.pone.0147126.g005

thumbnail
Fig 6. The predicted values versus calculated values in literature[19]of ΔRGθ.

https://doi.org/10.1371/journal.pone.0147126.g006

QSPR models cross-validation

All predictive QSPR models require validation to decide whether they can be used to make predictions. If a QSPR model cannot be used to make a prediction, then it is of no practical use. The quality of goodness-of-fit of the models is quantified by the correlation coefficient (including R, Radj., and RCV), the standard error (S), the Fisher statistic value (F) and the average relative error (ARE). On the other hand, it is worth mentioning that the models having the best correlation potential need not have the best predictive value[24].

Generally, the most popular validation criterion to explore the robustness of a predictive model is to analyze the influence of each individual object that configures the final equation. This procedure is known as cross-validation (CV) or internal validation by leave-one-out (LOO) [25]. The leave-one-out cross-validations were performed in training test. And each time one compound is left out from the training set, and then the model based on the others is used to predict the compound extracted, that is, a model is built with n -1 compounds and the n-th compound is predicted. For the test set, the predicted values are obtained from the model using the whole training set. The parameters of the method can play important roles in assessing the performance of QSPR models, which are SS, DS and RCV[26].

The correlation coefficient for cross-validation (RCV) is then calculated by the following equation: where n is the number of compounds included in the QSPR models, yi,cal and yi,pre are the calculated value in literature [19] and the predicted values obtained in this paper using the Eqs (5), (6) and (7), respectively and yi,avg is the average calculated values in literature[19]. From Eq (5) to Eq (7), one can see that the quality of the models for the thermodynamic properties (Sθ, ΔfGθ and ΔRGθ) of XTH and 135 PBXTHs are satisfactory. And all the values of R and RCV are very close, which shows the good stability and predictivity of the three QSPR models.

In this study, the calculated values in literature [19]of the standard enthalpy of formation ΔfHθ were studied as test set, and the QSPR model were obtained between the new quantum topological indices XP1XP2 and the calculated values of the standard enthalpy of formation ΔfHθ, according to the topological model ΔfHθ = a1+ a2PX1 + a3PX2. The result shown as following: (8)

By comparing the calculated values in literature [10]of the standard enthalpy of formation ΔfHθ,with the predicted values obtained with Eq (8) in this paper, the results show that there are very good correlations. Fig 7 shows that the calculated versus the predicted values obtained with Eq (8) follows a straight line. Fig 7 shows the dispersion as a function of the predicted property. Horizontal lines in this figure indicate the standard deviation limits of ±2S. The residuals exceed seldom the standard deviation of ±2S from Fig 8. Accordingly, from Figs 7 and 8 and the statistical results of Eq (8), it can be concluded that the QSPR model is excellent. And the cross-validated RCV values (RCV = 0.9974) are very close to the corresponding R value (R = 0.9975). Clearly, the cross-validation demonstrates the final model to be statistically significant.

thumbnail
Fig 7. The predicted values versus calculated values in literature[19]of ΔfHθ.

https://doi.org/10.1371/journal.pone.0147126.g007

thumbnail
Fig 8. Residuals plot of the predicted values versus calculated values in literature[19]of ΔfHθ.

https://doi.org/10.1371/journal.pone.0147126.g008

Conclusions

  1. The new quantum topological indices XP1XP2 efficiently encode the information of chemical environment from the aspect of the equilibrium electro-negativity and the spatial topological distance by revising the traditional distance matrixwith the topological distance matrix 3D.
  2. Based on the new quantum topological indices XP1XP2, quantitative structure −property relationship modelsare built to study the thermodynamic properties(Sθ, ΔfHθ, ΔfGθand ΔRGθ) of XTH and 135 PBXTHs by the MLR method. Excellent structure−property modelsshow the efficiency of these indices in QSPR studies. In addition, the final model is validated to be statistically reliable and predictive using the general leave-one-out method.
  3. Comparison with reference models demonstrate that this new method is very efficient and provides satisfactory results with significant improvements, both in accuracy and stability for predicting the thermodynamic properties of XTH and 135 PBXTHs.

Acknowledgments

This paper was financially supported by the National Natural Science Foundation of China (No. 51574152,11205084), the Foundation of Hunan Province Science and Technology Department (2014GK3079), the social development and science &technology support of Hengyang city (No.2014KS26), andthe Hunan Provincial Key Laboratory of Materials Protection for Electric Power and Transportation (No. 2016CL03), Changsha University of Science & Technology, P. R. China. The helpful comments from anonymous reviewers are also gratefully acknowledged.

Author Contributions

Conceived and designed the experiments: GP. Analyzed the data: LY. Contributed reagents/materials/analysis tools: FX. Wrote the paper: FX CN.

References

  1. 1. Netzeva T I, Schultz T W.QSARs for the aquatic toxicity of aromatic aldehydes from Tetrahymena data. Chemosphere 2005 61: 1632. pmid:15950260
  2. 2. Liu F P, Cao C Z, Cheng B.A Quantitative Structure-Property Relationship (QSPR) study of Aliphatic Alcohol by the Method of Dividing the Molecular Structure into Substructure. Int. J. Mol. Sci 2011 12: 2448. pmid:21731451
  3. 3. Zhou C Y, Nie C M, Li S, Li Z H. A novel semi-empirical topological descriptor Nt and the application to study on QSPR/QSAR.J. Comput.Chem 2007 28: 2413. pmid:17476714
  4. 4. Kiralj R, Ferreira MMC.Chemometric modeling of core-electron binding energies. J.Mol. Grap. Mod.2003 21: 435.
  5. 5. Rose K, Hall LH, Kier LB.QSAR Approach in Study of Mutagenicityof Aromatic and Heteroaromatic Amines. J. Chem. Inf. Comput. Sci.2002 42: 651.
  6. 6. Nie C M, Wu Y X, Wu R Y, Jiang S H, Zhou C Y. Applications of a new topological indes EDm in some aliphatic hydrocarbons. J. Theor. Comput.Chem 20098: 19.
  7. 7. Yang F, Wang Z, Huang Y, Ding X. Modification of Wiener Index and Its Application. J. Chem. Inf. Comput. Sci.2003 43: 753. pmid:12767133
  8. 8. Peng G.W, Xiao F Z, Nie C M, Liao L F, Yang S Y.Quantum Topological Method Studies on QSRR for Chiral Organic Compounds. Acta Chim Sinica 2011 69: 305.
  9. 9. Xiao F Z,Peng G.W,Nie C M, Wu Y X, Dai Y M. Quantum Topological Method Studies on the Thermodynamic Properties of Polychlorinated Phenoxazines. J Mol Struct 2014 1074: 679.
  10. 10. Lu C H, Guo W M, Hu X F, Wang Y, Yin C S.A Lu index for QSAR/QSPR studies.Chem. Phys. Lett. 2006 417: 11.
  11. 11. Bajaj S, Sambi SS, Gupta S, Madan AK. Model for prediction of anti-HIV activity of 2-pyridinone derivatives using novel topological descriptor.QSAR Combin. Sci.2006 25: 813.
  12. 12. Xiao F Z, Peng G.W, Nie C M,Yang S Y. Prediction of thermodynamic properties of PCDEs withquantum topological indices.CIESC J 2011 62: 1808.
  13. 13. Zhou C Y, Nie C M.Molecular Descriptors of Topology and a Study onQuantitative Structure and Property Relationships. Biol.Chem. Soc. Jpn.2007 80: 1504.
  14. 14. Hu L H, Tan Z L. Chin. Molecular topology and pharmacological action: A QSAR study of tetrazoles using topological information content (IC). J. Syn. Chem.20024: 285.
  15. 15. Yang Y B, Pu X Y. Novel distance-based atom-type topological indices DAI for QSAR/QSPR studies of alcohols. Acta. Pharm. Sinica.1995 30: 440.
  16. 16. Peng G.W, Xiao F Z, Nie C M, Liao L F, Yang S Y, Xiao X L.Quantitative relationship between thermal conductivityand structure of liquid alkanes. CIESC J 2011 62: 604.
  17. 17. Peng G.W, Xiao F Z, Nie C M, Liao L F, Yang S Y. StudyonQSRR ofgaschromatographyretentionindexforaliphaticalcoholsondifferentpolarity. Chem. Res. Appl. 2010 22: 1498.
  18. 18. Nie C M,Peng G.W, Xiao F Z, Li S, He X M, Li Z H, Zhou C Y.Study on TopologicalChemistry ofGasChromatographyRetention Index for Sulfides.Chin. J. Anal. Chem.2006 34: 1560.
  19. 19. Shang G, Liu X J, Wang Z Y, Yang G Y.Chin.Application of Genetic Stochastic Resonance Algorithm to Quantitative Structure-Activity Relationship Study. J. Struct. Chem. 2010 29: 227.
  20. 20. Gharagheizi F, Eslamimanesh A, Ilani-Kashkouli P, Richon D, MohammadiA. H.Thermodynamic consistency test for experimental data of water content of methane. Chem. Eng. Sci. 2012 76: 99.
  21. 21. Nie C M. Group Electronegativity. J. Wuhan. Univ. (Nat Sci Ed), 200046: 176.
  22. 22. Dai Y M, Huang K L, Li X, Cao Z, Zhu Z P. Yang D W.Simulation of 13C NMR chemical shifts of carbinol carbon atoms usingquantitative structure−spectrum relationships. J. Cent. South. Univ. 2011 18: 324.
  23. 23. Duchowicz P R, Castro E A, Fernandez M F,Gonzalez M P. QSPR modelling of normal boiling points and octanol/water partition coefficient for acyclic and cyclic hydrocarbons using SMILES-based optimal descriptors.Chem. Phys. Lett. 2005 412: 377.
  24. 24. Dai Y M, Liu Y N, Li X, Cao Z, Zhu Z P. Yang D W.Estimation of surface tension of organic compounds usingquantitative structure-property relationship J. Cent. South. Univ.2012 19:99.
  25. 25. Lucic B, Trinajstic N, Sild S. QSPR study on soil sorption coefficient for persistent organic pollutants. J. Chem. Inf. Comput. Sci. 1999 39: 610.
  26. 26. Agrawal VK, Bano S, Khadikar PV. Novel distance-based atom-type topological indices DAI for QSAR/QSPR studies of alcohols. Bioorg. Med. Chem. 200311: 5519.