Search tips
Search criteria 


Logo of jcheminfoJournal of Cheminformatics
J Cheminform. 2010; 2(Suppl 1): P19.
Published online 2010 May 4. doi:  10.1186/1758-2946-2-S1-P19
PMCID: PMC2867151

Updating existing QSAR models: selection and weighting of new data

Computational chemistry and quantitative structure-activity relationships (QSAR) are foreseen to be extensively used in the implementation of the new REACH regulation for chemicals in Europe. However, for some compound groups the data are too few in number to permit both calibration and testing of a new model. Usage and previously developed or updated models are then viable alternatives.

Perfluorocarboxylic acids (PFCAs) and fluoroteleomer alcohols (FTOHs) are two groups of environmentally relevant compounds, with unique physical and chemical properties. The subcooled liquid vapour pressure (pL) is one such property, where experimental determinations are limited and far from consistent [1]. Updating is, however, challenging when the new compounds are far outside of the original calibration domain space. But by carefully selecting and weighting only three new compounds, we have been able to update a previously developed general QSAR model [2], to cover the new domain while maintaining predictive performance for the earlier calibration and test data. The optimal weighting scheme was determined from the sample leverages and residuals in the calibration phase [3].

The performance of this re-calibrated model greatly surpassed previous modelling attempts [4], when applied to an external test set of two PFCAs and four FTOHs with pL in the range 0.2-200 Pa; with Q2Ext = 0.994 and RMSEP = 0.190 units of log Pa. The domain coverage also increased from 1% to 51%, for 426 perfluoroalkylated compounds selected from the REACH registration list, the PhysProp database, and the OECD 2006 survey [5]. Selection and weighting of new calibration data can thus facilitate the extension and use of existing QSAR models.

This investigation was supported by the EU FP7 project CADASTER (grant agreement no. 212668).


  • Goss K-U, Bronner G, Harner T, Hertel M, Schmidt TC. Environ Sci Technol. 2006. p. 3572. [PubMed] [Cross Ref]
  • Öberg T, Liu T. QSAR Comb Sci. 2008. p. 273. [Cross Ref]
  • Stork CL, Kowalski BR. Chemometr Intell Lab Syst. 1999. p. 151. [Cross Ref]
  • Arp HP, Niederer C, Goss K-U. Environ Sci Technol. 2006. p. 7298. [PubMed] [Cross Ref]
  • Lists of PFOS, PFAS, PFOA, PFCA, related compounds and chemicals that may degrade to PFCA. ENV/JM/MONO(2006)16, OECD. 2007.

Articles from Journal of Cheminformatics are provided here courtesy of Springer