BaTool
BaTool is a real-time speech recognition engine, that gets a triphone Kaldi model and do a low latency speech recognition on the data coming from the system default microphone.
BaTool Conf
Batool is configurable via conf
file next to BaTool application. It has 4 main sections to configure:
Model Section:
This section defines the location of Kaldi model files. The addresses are relative to where BaTool application exists.
; Model path
[model]
fst = "Model/HCLG.fst"
mdl = "Model/final.oalimdl"
word = "Model/words.txt"
cmvn = "Model/global_cmvn.stats"
Decoder Section:
Kaldi related decoder settings.
max_active = 900
min_active = 200
train_max = 5
ac_scale = 0.05
min_sil = 20
train_max
: Maximum number of same samples for later training.min_sil
: Minimum silence before detection. This value directly connected with accuracy and latency.
Captain Section:
; Threshold for detecting words
[captain]
hard_threshold = 0.35
Miscellaneous Section:
; Miscellaneous parameters
[misc]
mic = "Sennheiser"
channel = "com_binaee_rebound"