DOKK / manpages / debian 10 / python-chemfp / rdkit2fps.1.en
RDKIT2FPS(1) User Commands RDKIT2FPS(1)

rdkit2fps - rdkit2fps

usage: rdkit2fps [-h] [--fpSize INT] [--RDK] [--minPath INT] [--maxPath INT]

[--nBitsPerHash INT] [--useHs 0|1] [--morgan] [--radius INT] [--useFeatures 0|1] [--useChirality 0|1] [--useBondTypes 0|1] [--torsions] [--targetSize INT] [--pairs] [--minLength INT] [--maxLength INT] [--maccs166] [--substruct] [--rdmaccs] [--id-tag NAME] [--in FORMAT] [-o FILENAME] [--errors {strict,report,ignore}] [filenames [filenames ...]]

Generate FPS fingerprints from a structure file using RDKit

input structure files (default is stdin)

show this help message and exit
number of bits in the fingerprint (applies to RDK, Morgan, topological torsion, and atom pair fingerprints (default=2048)
tag name containing the record id (SD files only)
input structure format (default guesses from filename)
save the fingerprints to FILENAME (default=stdout)
how should structure parse errors be handled? (default=strict)

generate RDK fingerprints (default)
minimum number of bonds to include in the subgraph (default=1)
maximum number of bonds to include in the subgraph (default=7)
number of bits to set per path (default=4)
include information about the number of hydrogens on each atom (default=1)

generate Morgan fingerprints
radius for the Morgan algorithm (default=2)
use chemical-feature invariants (default=0)
include chirality information (default=0)
include bond type information (default=1)

generate Topological Torsion fingerprints
number of bits in the fingerprint (default=4)

generate Atom Pair fingerprints
minimum bond count for a pair (default=1)
maximum bond count for a pair (default=30)

generate MACCS fingerprints

generate ChemFP substructure fingerprints

generate 166 bit RDKit/MACCS fingerprints

This program guesses the input structure format based on the filename extension. If the data comes from stdin, or the extension name us unknown, then use "--in" to change the default input format. The supported format extensions are:

Valid FORMATs (use gz if compressed)

--------- ------------------------------------

smi, ism, can, smi.gz, ism.gz, can.gz
sdf, mol, sd, mdl, sdf.gz, mol.gz, sd.gz, mdl.gz
May 2013 rdkit2fps 1.1p1