Current time in Korea 03:57 Aug 22 (Thu) Year 2019 KCS KCS Publications
KCS Publications
My Journal Log In Register
HOME > Search > Browsing(BKCS) > Archives

Bulletin of the Korean Chemical Society (BKCS)

ISSN 0253-2964(Print)
ISSN 1229-5949(Online)
Volume 33, Number 5
BKCSDE 33(5)
May 20, 2012 

A New Variable Selection Method Based on Mutual Information Maximization by Replacing Collinear Variables for Nonlinear Quantitative Structure-Property Relationship Models
Jahan B. Ghasemi,* Ehsan Zolfonoun
Mutual information, Variable selection, Quantitative structure-property relationship
Selection of the most informative molecular descriptors from the original data set is a key step for development of quantitative structure activity/property relationship models. Recently, mutual information (MI) has gained increasing attention in feature selection problems. This paper presents an effective mutual information-based feature selection approach, named mutual information maximization by replacing collinear variables (MIMRCV), for nonlinear quantitative structure-property relationship models. The proposed variable selection method was applied to three different QSPR datasets, soil degradation half-life of 47 organophosphorus pesticides, GC-MS retention times of 85 volatile organic compounds, and water-to-micellar cetyltrimethylammonium bromide partition coefficients of 62 organic compounds.The obtained results revealed that using MIMRCV as feature selection method improves the predictive quality of the developed models compared to conventional MI based variable selection algorithms.
1527 - 1535
Full Text