Trees | Indices | Help |
|
---|
|
SMARTS definitions for the publically available MACCS keys and a MACCS fingerprinter I compared the MACCS fingerprints generated here with those from two other packages (not MDL, unfortunately). Of course there are disagreements between the various fingerprints still, but I think these definitions work pretty well. Some notes: 1) most of the differences have to do with aromaticity 2) there's a discrepancy sometimes because the current RDKit definitions do not require multiple matches to be distinct. e.g. the SMILES C(=O)CC(=O) can match the (hypothetical) key O=CC twice in my definition. It's not clear to me what the correct behavior is. 3) Some keys are not fully defined in the MDL documentation 4) Two keys, 125 and 166, have to be done outside of SMARTS. 5) Key 1 (ISOTOPE) isn't defined Rev history: 2006 (gl): Original open-source release May 2011 (gl): Update some definitions based on feedback from Andrew Dalke
|
|||
|
|||
|
|||
|
|
|||
smartsPatts = {1:('?', 0), 2:('[#104]', 0), 3:('[#32,#33,#34,#
|
|||
maccsKeys = None hash(x) |
|||
GenMACCSKeys = rdMolDescriptors.GetMACCSKeysFingerprint
|
|||
FingerprintMol = rdMolDescriptors.GetMACCSKeysFingerprint
|
Imports: Chem, rdMolDescriptors, DataStructs
|
*Internal Use Only* generates SMARTS patterns for the keys, run once |
generates the MACCS fingerprint for a molecules **Arguments** - mol: the molecule to be fingerprinted - any extra keyword arguments are ignored **Returns** a _DataStructs.SparseBitVect_ containing the fingerprint. >>> m = Chem.MolFromSmiles('CNO') >>> bv = GenMACCSKeys(m) >>> tuple(bv.GetOnBits()) (24, 68, 69, 71, 93, 94, 102, 124, 131, 139, 151, 158, 160, 161, 164) >>> bv = GenMACCSKeys(Chem.MolFromSmiles('CCC')) >>> tuple(bv.GetOnBits()) (74, 114, 149, 155, 160) |
|
smartsPatts
|
Trees | Indices | Help |
|
---|
Generated by Epydoc 3.0.1 on Sat Apr 23 18:49:15 2016 | http://epydoc.sourceforge.net |