Abstract
We have designed a method to encode properties related to the electron densities of molecules (calculated 1H and 13C NMR shifts and atomic partial charges) in molecular fingerprints (EDprints). EDprints was evaluated in terms of their retrospective virtual screening accuracy against the Directory of Useful Decoys (DUD) and compared to the established ligand-based similarity search methods MOLPRINT 2D and FCFP-4. Although there are no significant differences in the overall virtual screening accuracies of the three methods, specific examples highlight interesting differences between the new EDprints fingerprint method and the atom-centered circular fingerprint methods of MOLPRINT 2D and FCFP-4. On one hand, EDprints similarity searches can be biased by the molecular protonation state, especially when reference ligands contain multiple ionizable groups. On the other hand, EDprints models are more robust toward subtle rearrangements of chemical groups and more suitable for screening against reference molecules with fused ring systems than MOLPRINT 2D and FCFP-4. EDprints is furthermore the fastest method under investigation in comparing fingerprints (average 56-233-fold increase in speed), which makes it highly suitable for all-against-all similarity searches and for repetitive virtual screening against large chemical databases of millions of compounds.
Original language | English |
---|---|
Journal | Journal of Chemical Information and Modeling |
Volume | 50 |
Issue number | 10 |
Pages (from-to) | 1772-1780 |
Number of pages | 9 |
ISSN | 1549-9596 |
DOIs | |
Publication status | Published - 25 Oct 2010 |
Externally published | Yes |