Current time in Korea 09:31 Apr 20 (Sat) Year 2024 KCS KCS Publications
KCS Publications
My Journal  Log In  Register
HOME > Search > Browsing(BKCS) > Archives

Bulletin of the Korean Chemical Society (BKCS)

ISSN 0253-2964(Print)
ISSN 1229-5949(Online)
Volume 33, Number 5
BKCSDE 33(5)
May 20, 2012 

 
Title
A New Variable Selection Method Based on Mutual Information Maximization by Replacing Collinear Variables for Nonlinear Quantitative Structure-Property Relationship Models
Author
Jahan B. Ghasemi,* Ehsan Zolfonoun
Keywords
Mutual information, Variable selection, Quantitative structure-property relationship
Abstract
Selection of the most informative molecular descriptors from the original data set is a key step for development of quantitative structure activity/property relationship models. Recently, mutual information (MI) has gained increasing attention in feature selection problems. This paper presents an effective mutual information-based feature selection approach, named mutual information maximization by replacing collinear variables (MIMRCV), for nonlinear quantitative structure-property relationship models. The proposed variable selection method was applied to three different QSPR datasets, soil degradation half-life of 47 organophosphorus pesticides, GC-MS retention times of 85 volatile organic compounds, and water-to-micellar cetyltrimethylammonium bromide partition coefficients of 62 organic compounds.The obtained results revealed that using MIMRCV as feature selection method improves the predictive quality of the developed models compared to conventional MI based variable selection algorithms.
Page
1527 - 1535
Full Text
PDF