Classification of cytochrome P450 1A2 inhibitors and noninhibitors by machine learning techniques

Vasanthanathan Poongavanam; Olivier Taboureau; Chris Oostenbrink; Nico P E Vermeulen; Lars Olsen; Flemming Steen Jørgensen

doi:10.1124/dmd.108.023507

Classification of cytochrome P450 1A2 inhibitors and noninhibitors by machine learning techniques

Vasanthanathan Poongavanam, Olivier Taboureau, Chris Oostenbrink, Nico P E Vermeulen, Lars Olsen, Flemming Steen Jørgensen

79 Citations (Scopus)

Abstract

The cytochrome P450 (CYP) superfamily plays an important role in the metabolism of drug compounds, and it is therefore highly desirable to have models that can predict whether a compound interacts with a specific isoform of the CYPs. In this work, we provide in silico models for classification of CYP1A2 inhibitors and non-inhibitors. Training and test sets consisted of about 400 and 7000 compounds, respectively. Various machine learning techniques, like binary QSAR, support vector machine (SVM), random forest, kappa nearest neighbors (kNN), and decision tree methods were used to develop in silico models, based on Volsurf and MOE descriptors. The best models were obtained using the SVM, random forest, and kNN methods in combination with the BestFirst variable selection method, resulting in models with 73 - 76 % of accuracy on the test set prediction (Matthews Correlation Coefficient of 0.51 and 0.52). Finally, a decision tree model based on Lipinski's Rule-of-five descriptors was also developed. This model predicts 67 % of the compounds correctly and gives a simple and interesting insight into the issue of classification. All the developed models in this work are fast and precise enough to be applicable for virtual screening of CYP1A2 inhibitors or non-inhibitors, or can be used as simple filters in the drug discovery process.

Original language	English
Journal	Drug Metabolism and Disposition
Volume	37
Issue number	3
Pages (from-to)	658-664
ISSN	0090-9556
DOIs	https://doi.org/10.1124/dmd.108.023507
Publication status	Published - 2009

Keywords

Former Faculty of Pharmaceutical Sciences

Access to Document

10.1124/dmd.108.023507

Cite this

Poongavanam, V., Taboureau, O., Oostenbrink, C., Vermeulen, N. P. E., Olsen, L., & Jørgensen, F. S. (2009). Classification of cytochrome P450 1A2 inhibitors and noninhibitors by machine learning techniques. Drug Metabolism and Disposition, 37(3), 658-664. https://doi.org/10.1124/dmd.108.023507

@article{4d5f09e0f8eb11ddb219000ea68e967b,

title = "Classification of cytochrome P450 1A2 inhibitors and noninhibitors by machine learning techniques",

abstract = "The cytochrome P450 (CYP) superfamily plays an important role in the metabolism of drug compounds, and it is therefore highly desirable to have models that can predict whether a compound interacts with a specific isoform of the CYPs. In this work, we provide in silico models for classification of CYP1A2 inhibitors and non-inhibitors. Training and test sets consisted of about 400 and 7000 compounds, respectively. Various machine learning techniques, like binary QSAR, support vector machine (SVM), random forest, kappa nearest neighbors (kNN), and decision tree methods were used to develop in silico models, based on Volsurf and MOE descriptors. The best models were obtained using the SVM, random forest, and kNN methods in combination with the BestFirst variable selection method, resulting in models with 73 - 76 % of accuracy on the test set prediction (Matthews Correlation Coefficient of 0.51 and 0.52). Finally, a decision tree model based on Lipinski's Rule-of-five descriptors was also developed. This model predicts 67 % of the compounds correctly and gives a simple and interesting insight into the issue of classification. All the developed models in this work are fast and precise enough to be applicable for virtual screening of CYP1A2 inhibitors or non-inhibitors, or can be used as simple filters in the drug discovery process.",

keywords = "Former Faculty of Pharmaceutical Sciences",

author = "Vasanthanathan Poongavanam and Olivier Taboureau and Chris Oostenbrink and Vermeulen, {Nico P E} and Lars Olsen and J{\o}rgensen, {Flemming Steen}",

year = "2009",

doi = "10.1124/dmd.108.023507",

language = "English",

volume = "37",

pages = "658--664",

journal = "Drug Metabolism and Disposition",

issn = "0090-9556",

publisher = "American Society for Pharmacology and Experimental Therapeutics",

number = "3",

}

TY - JOUR

T1 - Classification of cytochrome P450 1A2 inhibitors and noninhibitors by machine learning techniques

AU - Poongavanam, Vasanthanathan

AU - Taboureau, Olivier

AU - Oostenbrink, Chris

AU - Vermeulen, Nico P E

AU - Olsen, Lars

AU - Jørgensen, Flemming Steen

PY - 2009

Y1 - 2009

N2 - The cytochrome P450 (CYP) superfamily plays an important role in the metabolism of drug compounds, and it is therefore highly desirable to have models that can predict whether a compound interacts with a specific isoform of the CYPs. In this work, we provide in silico models for classification of CYP1A2 inhibitors and non-inhibitors. Training and test sets consisted of about 400 and 7000 compounds, respectively. Various machine learning techniques, like binary QSAR, support vector machine (SVM), random forest, kappa nearest neighbors (kNN), and decision tree methods were used to develop in silico models, based on Volsurf and MOE descriptors. The best models were obtained using the SVM, random forest, and kNN methods in combination with the BestFirst variable selection method, resulting in models with 73 - 76 % of accuracy on the test set prediction (Matthews Correlation Coefficient of 0.51 and 0.52). Finally, a decision tree model based on Lipinski's Rule-of-five descriptors was also developed. This model predicts 67 % of the compounds correctly and gives a simple and interesting insight into the issue of classification. All the developed models in this work are fast and precise enough to be applicable for virtual screening of CYP1A2 inhibitors or non-inhibitors, or can be used as simple filters in the drug discovery process.

AB - The cytochrome P450 (CYP) superfamily plays an important role in the metabolism of drug compounds, and it is therefore highly desirable to have models that can predict whether a compound interacts with a specific isoform of the CYPs. In this work, we provide in silico models for classification of CYP1A2 inhibitors and non-inhibitors. Training and test sets consisted of about 400 and 7000 compounds, respectively. Various machine learning techniques, like binary QSAR, support vector machine (SVM), random forest, kappa nearest neighbors (kNN), and decision tree methods were used to develop in silico models, based on Volsurf and MOE descriptors. The best models were obtained using the SVM, random forest, and kNN methods in combination with the BestFirst variable selection method, resulting in models with 73 - 76 % of accuracy on the test set prediction (Matthews Correlation Coefficient of 0.51 and 0.52). Finally, a decision tree model based on Lipinski's Rule-of-five descriptors was also developed. This model predicts 67 % of the compounds correctly and gives a simple and interesting insight into the issue of classification. All the developed models in this work are fast and precise enough to be applicable for virtual screening of CYP1A2 inhibitors or non-inhibitors, or can be used as simple filters in the drug discovery process.

KW - Former Faculty of Pharmaceutical Sciences

U2 - 10.1124/dmd.108.023507

DO - 10.1124/dmd.108.023507

M3 - Journal article

C2 - 19056915

SN - 0090-9556

VL - 37

SP - 658

EP - 664

JO - Drug Metabolism and Disposition

JF - Drug Metabolism and Disposition

IS - 3

ER -

Classification of cytochrome P450 1A2 inhibitors and noninhibitors by machine learning techniques

Abstract

Keywords

Access to Document

Fingerprint

Cite this