DOKK / manpages / debian 10 / hhsuite / hhmake.1.en
HHMAKE(1) User Commands HHMAKE(1)

hhmake - build an HMM from an input alignment or convert between HMMER format and HHsearch format

hhmake -i file [options]

HHmake 3.0.0 (15-03-2015) Build an HMM from an input alignment in A2M, A3M, or FASTA format, or convert between HMMER format (.hmm) and HHsearch format (.hhm). Remmert M, Biegert A, Hauser A, and Soding J. HHblits: Lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat. Methods 9:173-175 (2011). (C) Johannes Soeding, Michael Remmert, Andreas Biegert, Andreas Hauser

query alignment (A2M, A3M, or FASTA), or query HMM

HMM file to be written to (default=<infile.hhm>)
HMM file to be appended to
verbose mode: 0:no screen output 1:only warings 2: verbose
max. number of query/template sequences displayed (def=10) Beware of overflows! All these sequences are stored in memory.
make consensus sequence master sequence of query MSA
use this name for HMM (default: use name of first sequence)

Filter query multiple sequence alignment

[0,100] maximum pairwise sequence identity (%) (def=90)
filter MSA by selecting most diverse set of sequences, keeping at least this many seqs in each MSA block of length 50 (def=100)
[0,100] minimum coverage with query (%) (def=0)
[0,100] minimum sequence identity with query (%) (def=0)
[0,100] minimum score per column with query (def=-20.0)
target diversity of alignment (default=off)

use A2M/A3M (default): upper case = Match; lower case = Insert; '-' = Delete; '.' = gaps aligned to inserts (may be omitted)
use FASTA: columns with residue in 1st sequence are match states
use FASTA: columns with fewer than X% gaps are match states

Context specific hhm pseudocounts:
position dependence of pc admixture 'tau' (pc mode, default=0)
0: no pseudo counts:
tau = 0
1: constant
tau = a
2: diversity-dependent: tau = a/(1+((Neff[i]-1)/b)^c) 3: CSBlast admixture: tau = a(1+b)/(Neff[i]+b) (Neff[i]: number of effective seqs in local MSA around column i)
[0,1] overall pseudocount admixture (def=0.9)
[1,inf[ Neff threshold value for mode 2 (def=4.0)
[0,3] extinction exponent c for mode 2 (def=1.0)
Context independent hhm pseudocounts (used for templates; used for query if contxt file is not available):
position dependence of pc admixture 'tau' (pc mode, default=2)
0: no pseudo counts:
tau = 0
1: constant
tau = a
2: diversity-dependent: tau = a/(1+((Neff[i]-1)/b)^c) (Neff[i]: number of effective seqs in local MSA around column i)
[0,1] overall pseudocount admixture (def=1.0)
[1,inf[ Neff threshold value for mode 2 (def=1.5)
[0,3] extinction exponent c for mode 2 (def=1.0)
Context-specific pseudo-counts:
use substitution-matrix instead of context-specific pseudocounts

-contxt <file> context file for computing context-specific pseudocounts (default=./data/context_data.crf)

max number of HMM columns (def=20001)

Example: hhmake -i test.a3m

February 2019 hhmake 3.0~beta3+dfsg