Last active
November 28, 2018 17:02
-
-
Save rain1024/f3d368e27a037b37607585c3b6735709 to your computer and use it in GitHub Desktop.
ASR
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
/home/anhv/anaconda3/envs/automatic_speech_recognition/bin/python /home/anhv/PycharmProjects/undertheseanlp/automatic_speech_recognition/egs/vivos/train.py --kaldi_folder /home/anhv/PycharmProjects/kaldi-trunk --corpus_folder /home/anhv/PycharmProjects/undertheseanlp/automatic_speech_recognition/data/vivos/corpus --nj 10 | |
Init Kaldi Speech Recognition in uts_734 folder | |
===== PREPARING ACOUSTIC DATA ===== | |
===== FEATURES EXTRACTION ===== | |
steps/make_mfcc.sh --nj 10 --cmd run.pl data/train exp/make_mfcc/train mfcc | |
utils/validate_data_dir.sh: Successfully validated data-directory data/train | |
steps/make_mfcc.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance. | |
Succeeded creating MFCC features for train | |
steps/make_mfcc.sh --nj 10 --cmd run.pl data/test exp/make_mfcc/test mfcc | |
utils/validate_data_dir.sh: WARNING: you have only one speaker. This probably a bad idea. | |
Search for the word 'bold' in http://kaldi-asr.org/doc/data_prep.html | |
for more information. | |
utils/validate_data_dir.sh: Successfully validated data-directory data/test | |
steps/make_mfcc.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance. | |
Succeeded creating MFCC features for test | |
steps/compute_cmvn_stats.sh data/train exp/make_mfcc/train mfcc | |
Succeeded creating CMVN stats for train | |
steps/compute_cmvn_stats.sh data/test exp/make_mfcc/test mfcc | |
Succeeded creating CMVN stats for test | |
===== PREPARING LANGUAGE DATA ===== | |
utils/prepare_lang.sh data/local/dict <UNK> data/local/lang data/lang | |
Checking data/local/dict/silence_phones.txt ... | |
--> reading data/local/dict/silence_phones.txt | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> data/local/dict/silence_phones.txt is OK | |
Checking data/local/dict/optional_silence.txt ... | |
--> reading data/local/dict/optional_silence.txt | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> data/local/dict/optional_silence.txt is OK | |
Checking data/local/dict/nonsilence_phones.txt ... | |
--> reading data/local/dict/nonsilence_phones.txt | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> data/local/dict/nonsilence_phones.txt is OK | |
Checking disjoint: silence_phones.txt, nonsilence_phones.txt | |
--> disjoint property is OK. | |
Checking data/local/dict/lexicon.txt | |
--> reading data/local/dict/lexicon.txt | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> data/local/dict/lexicon.txt is OK | |
Checking data/local/dict/extra_questions.txt ... | |
--> data/local/dict/extra_questions.txt is empty (this is OK) | |
--> SUCCESS [validating dictionary directory data/local/dict] | |
**Creating data/local/dict/lexiconp.txt from data/local/dict/lexicon.txt | |
fstaddselfloops data/lang/phones/wdisambig_phones.int data/lang/phones/wdisambig_words.int | |
prepare_lang.sh: validating output directory | |
utils/validate_lang.pl data/lang | |
Checking data/lang/phones.txt ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> data/lang/phones.txt is OK | |
Checking words.txt: #0 ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> data/lang/words.txt is OK | |
Checking disjoint: silence.txt, nonsilence.txt, disambig.txt ... | |
--> silence.txt and nonsilence.txt are disjoint | |
--> silence.txt and disambig.txt are disjoint | |
--> disambig.txt and nonsilence.txt are disjoint | |
--> disjoint property is OK | |
Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... | |
--> found no unexplainable phones in phones.txt | |
Checking data/lang/phones/context_indep.{txt, int, csl} ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> 10 entry/entries in data/lang/phones/context_indep.txt | |
--> data/lang/phones/context_indep.int corresponds to data/lang/phones/context_indep.txt | |
--> data/lang/phones/context_indep.csl corresponds to data/lang/phones/context_indep.txt | |
--> data/lang/phones/context_indep.{txt, int, csl} are OK | |
Checking data/lang/phones/nonsilence.{txt, int, csl} ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> 112 entry/entries in data/lang/phones/nonsilence.txt | |
--> data/lang/phones/nonsilence.int corresponds to data/lang/phones/nonsilence.txt | |
--> data/lang/phones/nonsilence.csl corresponds to data/lang/phones/nonsilence.txt | |
--> data/lang/phones/nonsilence.{txt, int, csl} are OK | |
Checking data/lang/phones/silence.{txt, int, csl} ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> 10 entry/entries in data/lang/phones/silence.txt | |
--> data/lang/phones/silence.int corresponds to data/lang/phones/silence.txt | |
--> data/lang/phones/silence.csl corresponds to data/lang/phones/silence.txt | |
--> data/lang/phones/silence.{txt, int, csl} are OK | |
Checking data/lang/phones/optional_silence.{txt, int, csl} ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> 1 entry/entries in data/lang/phones/optional_silence.txt | |
--> data/lang/phones/optional_silence.int corresponds to data/lang/phones/optional_silence.txt | |
--> data/lang/phones/optional_silence.csl corresponds to data/lang/phones/optional_silence.txt | |
--> data/lang/phones/optional_silence.{txt, int, csl} are OK | |
Checking data/lang/phones/disambig.{txt, int, csl} ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> 2 entry/entries in data/lang/phones/disambig.txt | |
--> data/lang/phones/disambig.int corresponds to data/lang/phones/disambig.txt | |
--> data/lang/phones/disambig.csl corresponds to data/lang/phones/disambig.txt | |
--> data/lang/phones/disambig.{txt, int, csl} are OK | |
Checking data/lang/phones/roots.{txt, int} ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> 30 entry/entries in data/lang/phones/roots.txt | |
--> data/lang/phones/roots.int corresponds to data/lang/phones/roots.txt | |
--> data/lang/phones/roots.{txt, int} are OK | |
Checking data/lang/phones/sets.{txt, int} ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> 30 entry/entries in data/lang/phones/sets.txt | |
--> data/lang/phones/sets.int corresponds to data/lang/phones/sets.txt | |
--> data/lang/phones/sets.{txt, int} are OK | |
Checking data/lang/phones/extra_questions.{txt, int} ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> 9 entry/entries in data/lang/phones/extra_questions.txt | |
--> data/lang/phones/extra_questions.int corresponds to data/lang/phones/extra_questions.txt | |
--> data/lang/phones/extra_questions.{txt, int} are OK | |
Checking data/lang/phones/word_boundary.{txt, int} ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> 122 entry/entries in data/lang/phones/word_boundary.txt | |
--> data/lang/phones/word_boundary.int corresponds to data/lang/phones/word_boundary.txt | |
--> data/lang/phones/word_boundary.{txt, int} are OK | |
Checking optional_silence.txt ... | |
--> reading data/lang/phones/optional_silence.txt | |
--> data/lang/phones/optional_silence.txt is OK | |
Checking disambiguation symbols: #0 and #1 | |
--> data/lang/phones/disambig.txt has "#0" and "#1" | |
--> data/lang/phones/disambig.txt is OK | |
Checking topo ... | |
Checking word_boundary.txt: silence.txt, nonsilence.txt, disambig.txt ... | |
--> data/lang/phones/word_boundary.txt doesn't include disambiguation symbols | |
--> data/lang/phones/word_boundary.txt is the union of nonsilence.txt and silence.txt | |
--> data/lang/phones/word_boundary.txt is OK | |
Checking word-level disambiguation symbols... | |
--> data/lang/phones/wdisambig.txt exists (newer prepare_lang.sh) | |
Checking word_boundary.int and disambig.int | |
--> generating a 70 word sequence | |
--> resulting phone sequence from L.fst corresponds to the word sequence | |
--> L.fst is OK | |
--> generating a 51 word sequence | |
--> resulting phone sequence from L_disambig.fst corresponds to the word sequence | |
--> L_disambig.fst is OK | |
Checking data/lang/oov.{txt, int} ... | |
--> text seems to be UTF-8 or ASCII, checking whitespaces | |
--> text contains only allowed whitespaces | |
--> 1 entry/entries in data/lang/oov.txt | |
--> data/lang/oov.int corresponds to data/lang/oov.txt | |
--> data/lang/oov.{txt, int} are OK | |
--> data/lang/L.fst is olabel sorted | |
--> data/lang/L_disambig.fst is olabel sorted | |
--> SUCCESS [validating lang directory data/lang] | |
===== LANGUAGE MODEL CREATION ===== | |
===== MAKING lm.arpa ===== | |
===== MAKING G.fst ===== | |
arpa2fst --disambig-symbol=#0 --read-symbol-table=data/lang/words.txt data/local/tmp/lm.arpa data/lang/G.fst | |
LOG (arpa2fst[5.5.70~1-e5446]:Read():arpa-file-parser.cc:94) Reading \data\ section. | |
LOG (arpa2fst[5.5.70~1-e5446]:Read():arpa-file-parser.cc:149) Reading \1-grams: section. | |
LOG (arpa2fst[5.5.70~1-e5446]:RemoveRedundantStates():arpa-lm-compiler.cc:359) Reduced num-states from 1 to 1 | |
===== MONO TRAINING ===== | |
steps/train_mono.sh --nj 10 --cmd run.pl data/train data/lang exp/mono | |
steps/train_mono.sh: Initializing monophone system. | |
steps/train_mono.sh: Compiling training graphs | |
steps/train_mono.sh: Aligning data equally (pass 0) | |
steps/train_mono.sh: Pass 1 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 2 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 3 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 4 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 5 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 6 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 7 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 8 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 9 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 10 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 11 | |
steps/train_mono.sh: Pass 12 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 13 | |
steps/train_mono.sh: Pass 14 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 15 | |
steps/train_mono.sh: Pass 16 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 17 | |
steps/train_mono.sh: Pass 18 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 19 | |
steps/train_mono.sh: Pass 20 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 21 | |
steps/train_mono.sh: Pass 22 | |
steps/train_mono.sh: Pass 23 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 24 | |
steps/train_mono.sh: Pass 25 | |
steps/train_mono.sh: Pass 26 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 27 | |
steps/train_mono.sh: Pass 28 | |
steps/train_mono.sh: Pass 29 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 30 | |
steps/train_mono.sh: Pass 31 | |
steps/train_mono.sh: Pass 32 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 33 | |
steps/train_mono.sh: Pass 34 | |
steps/train_mono.sh: Pass 35 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 36 | |
steps/train_mono.sh: Pass 37 | |
steps/train_mono.sh: Pass 38 | |
steps/train_mono.sh: Aligning data | |
steps/train_mono.sh: Pass 39 | |
steps/diagnostic/analyze_alignments.sh --cmd run.pl data/lang exp/mono | |
steps/diagnostic/analyze_alignments.sh: see stats in exp/mono/log/analyze_alignments.log | |
6 warnings in exp/mono/log/init.log | |
560 warnings in exp/mono/log/update.*.log | |
9049 warnings in exp/mono/log/acc.*.*.log | |
41498 warnings in exp/mono/log/align.*.*.log | |
exp/mono: nj=10 align prob=-96.34 over 15.38h [retry=11.4%, fail=1.7%] states=94 gauss=997 | |
steps/train_mono.sh: Done training monophone system in exp/mono | |
===== MONO DECODING ===== | |
WARNING: the --mono, --left-biphone and --quinphone options are now deprecated and ignored. | |
tree-info exp/mono/tree | |
tree-info exp/mono/tree | |
fstpushspecial | |
fstdeterminizestar --use-log=true | |
fstminimizeencoded | |
fsttablecompose data/lang/L_disambig.fst data/lang/G.fst | |
fstisstochastic data/lang/tmp/LG.fst | |
5.03616e-05 -1.13647e-06 | |
fstcomposecontext --context-size=1 --central-position=0 --read-disambig-syms=data/lang/phones/disambig.int --write-disambig-syms=data/lang/tmp/disambig_ilabels_1_0.int data/lang/tmp/ilabels_1_0.21728 data/lang/tmp/LG.fst | |
fstisstochastic data/lang/tmp/CLG_1_0.fst | |
5.03616e-05 -1.13647e-06 | |
make-h-transducer --disambig-syms-out=exp/mono/graph/disambig_tid.int --transition-scale=1.0 data/lang/tmp/ilabels_1_0 exp/mono/tree exp/mono/final.mdl | |
fsttablecompose exp/mono/graph/Ha.fst data/lang/tmp/CLG_1_0.fst | |
fstdeterminizestar --use-log=true | |
fstminimizeencoded | |
fstrmsymbols exp/mono/graph/disambig_tid.int | |
fstrmepslocal | |
0.000513976 -0.000445992 | |
fstisstochastic exp/mono/graph/HCLGa.fst | |
add-self-loops --self-loop-scale=0.1 --reorder=true exp/mono/final.mdl exp/mono/graph/HCLGa.fst | |
===== MONO ALIGNMENT ===== | |
steps/align_si.sh --nj 10 --cmd run.pl data/train data/lang exp/mono exp/mono_ali | |
steps/align_si.sh: feature type is delta | |
steps/align_si.sh: aligning data in data/train using model from exp/mono, putting alignments in exp/mono_ali | |
steps/diagnostic/analyze_alignments.sh --cmd run.pl data/lang exp/mono_ali | |
steps/diagnostic/analyze_alignments.sh: see stats in exp/mono_ali/log/analyze_alignments.log | |
steps/align_si.sh: done aligning data. | |
===== TRI1 (first triphone pass) TRAINING ===== | |
steps/train_deltas.sh --cmd run.pl 2500 20000 data/train data/lang exp/mono_ali exp/tri1 | |
steps/train_deltas.sh: accumulating tree stats | |
steps/train_deltas.sh: getting questions for tree-building, via clustering | |
steps/train_deltas.sh: building the tree | |
WARNING (gmm-init-model[5.5.70~1-e5446]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 1 with no stats; corresponding phone list: 6 7 8 9 10 | |
WARNING (gmm-init-model[5.5.70~1-e5446]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 2 with no stats; corresponding phone list: 11 12 13 14 | |
WARNING (gmm-init-model[5.5.70~1-e5446]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 3 with no stats; corresponding phone list: 15 16 17 18 | |
** The warnings above about 'no stats' generally mean you have phones ** | |
** (or groups of phones) in your phone set that had no corresponding data. ** | |
** You should probably figure out whether something went wrong, ** | |
** or whether your data just doesn't happen to have examples of those ** | |
** phones. ** | |
steps/train_deltas.sh: converting alignments from exp/mono_ali to use current tree | |
steps/train_deltas.sh: compiling graphs of transcripts | |
steps/train_deltas.sh: training pass 1 | |
steps/train_deltas.sh: training pass 2 | |
steps/train_deltas.sh: training pass 3 | |
steps/train_deltas.sh: training pass 4 | |
steps/train_deltas.sh: training pass 5 | |
steps/train_deltas.sh: training pass 6 | |
steps/train_deltas.sh: training pass 7 | |
steps/train_deltas.sh: training pass 8 | |
steps/train_deltas.sh: training pass 9 | |
steps/train_deltas.sh: training pass 10 | |
steps/train_deltas.sh: aligning data | |
steps/train_deltas.sh: training pass 11 | |
steps/train_deltas.sh: training pass 12 | |
steps/train_deltas.sh: training pass 13 | |
steps/train_deltas.sh: training pass 14 | |
steps/train_deltas.sh: training pass 15 | |
steps/train_deltas.sh: training pass 16 | |
steps/train_deltas.sh: training pass 17 | |
steps/train_deltas.sh: training pass 18 | |
steps/train_deltas.sh: training pass 19 | |
steps/train_deltas.sh: training pass 20 | |
steps/train_deltas.sh: aligning data | |
steps/train_deltas.sh: training pass 21 | |
steps/train_deltas.sh: training pass 22 | |
steps/train_deltas.sh: training pass 23 | |
steps/train_deltas.sh: training pass 24 | |
steps/train_deltas.sh: training pass 25 | |
steps/train_deltas.sh: training pass 26 | |
steps/train_deltas.sh: training pass 27 | |
steps/train_deltas.sh: training pass 28 | |
steps/train_deltas.sh: training pass 29 | |
steps/train_deltas.sh: training pass 30 | |
steps/train_deltas.sh: aligning data | |
steps/train_deltas.sh: training pass 31 | |
steps/train_deltas.sh: training pass 32 | |
steps/train_deltas.sh: training pass 33 | |
steps/train_deltas.sh: training pass 34 | |
steps/diagnostic/analyze_alignments.sh --cmd run.pl data/lang exp/tri1 | |
steps/diagnostic/analyze_alignments.sh: see stats in exp/tri1/log/analyze_alignments.log | |
3 warnings in exp/tri1/log/questions.log | |
4 warnings in exp/tri1/log/init_model.log | |
138 warnings in exp/tri1/log/update.*.log | |
4753 warnings in exp/tri1/log/align.*.*.log | |
1 warnings in exp/tri1/log/build_tree.log | |
11876 warnings in exp/tri1/log/acc.*.*.log | |
exp/tri1: nj=10 align prob=-92.82 over 15.16h [retry=9.0%, fail=3.3%] states=1904 gauss=20061 tree-impr=4.76 | |
steps/train_deltas.sh: Done training system with delta+delta-delta features in exp/tri1 | |
===== TRI1 (first triphone pass) DECODING ===== | |
tree-info exp/tri1/tree | |
tree-info exp/tri1/tree | |
fstcomposecontext --context-size=3 --central-position=1 --read-disambig-syms=data/lang/phones/disambig.int --write-disambig-syms=data/lang/tmp/disambig_ilabels_3_1.int data/lang/tmp/ilabels_3_1.30361 data/lang/tmp/LG.fst | |
fstisstochastic data/lang/tmp/CLG_3_1.fst | |
5.03616e-05 -1.13647e-06 | |
make-h-transducer --disambig-syms-out=exp/tri1/graph/disambig_tid.int --transition-scale=1.0 data/lang/tmp/ilabels_3_1 exp/tri1/tree exp/tri1/final.mdl | |
fstrmepslocal | |
fstminimizeencoded | |
fstdeterminizestar --use-log=true | |
fstrmsymbols exp/tri1/graph/disambig_tid.int | |
fsttablecompose exp/tri1/graph/Ha.fst data/lang/tmp/CLG_3_1.fst | |
fstisstochastic exp/tri1/graph/HCLGa.fst | |
0.000513874 -0.000486971 | |
add-self-loops --self-loop-scale=0.1 --reorder=true exp/tri1/final.mdl exp/tri1/graph/HCLGa.fst | |
===== TRI1 ALIGNMENT ===== | |
steps/align_si.sh --nj 10 --cmd run.pl data/train data/lang exp/tri1 exp/tri1_ali | |
steps/align_si.sh: feature type is delta | |
steps/align_si.sh: aligning data in data/train using model from exp/tri1, putting alignments in exp/tri1_ali | |
steps/diagnostic/analyze_alignments.sh --cmd run.pl data/lang exp/tri1_ali | |
steps/diagnostic/analyze_alignments.sh: see stats in exp/tri1_ali/log/analyze_alignments.log | |
steps/align_si.sh: done aligning data. | |
===== TRI2A TRAINING ===== | |
steps/train_deltas.sh --cmd run.pl 2500 20000 data/train data/lang exp/tri1_ali exp/tri2a | |
steps/train_deltas.sh: accumulating tree stats | |
steps/train_deltas.sh: getting questions for tree-building, via clustering | |
steps/train_deltas.sh: building the tree | |
WARNING (gmm-init-model[5.5.70~1-e5446]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 1 with no stats; corresponding phone list: 6 7 8 9 10 | |
WARNING (gmm-init-model[5.5.70~1-e5446]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 2 with no stats; corresponding phone list: 11 12 13 14 | |
WARNING (gmm-init-model[5.5.70~1-e5446]:InitAmGmm():gmm-init-model.cc:55) Tree has pdf-id 3 with no stats; corresponding phone list: 15 16 17 18 | |
** The warnings above about 'no stats' generally mean you have phones ** | |
** (or groups of phones) in your phone set that had no corresponding data. ** | |
** You should probably figure out whether something went wrong, ** | |
** or whether your data just doesn't happen to have examples of those ** | |
** phones. ** | |
steps/train_deltas.sh: converting alignments from exp/tri1_ali to use current tree | |
steps/train_deltas.sh: compiling graphs of transcripts | |
steps/train_deltas.sh: training pass 1 | |
steps/train_deltas.sh: training pass 2 | |
steps/train_deltas.sh: training pass 3 | |
steps/train_deltas.sh: training pass 4 | |
steps/train_deltas.sh: training pass 5 | |
steps/train_deltas.sh: training pass 6 | |
steps/train_deltas.sh: training pass 7 | |
steps/train_deltas.sh: training pass 8 | |
steps/train_deltas.sh: training pass 9 | |
steps/train_deltas.sh: training pass 10 | |
steps/train_deltas.sh: aligning data | |
steps/train_deltas.sh: training pass 11 | |
steps/train_deltas.sh: training pass 12 | |
steps/train_deltas.sh: training pass 13 | |
steps/train_deltas.sh: training pass 14 | |
steps/train_deltas.sh: training pass 15 | |
steps/train_deltas.sh: training pass 16 | |
steps/train_deltas.sh: training pass 17 | |
steps/train_deltas.sh: training pass 18 | |
steps/train_deltas.sh: training pass 19 | |
steps/train_deltas.sh: training pass 20 | |
steps/train_deltas.sh: aligning data | |
steps/train_deltas.sh: training pass 21 | |
steps/train_deltas.sh: training pass 22 | |
steps/train_deltas.sh: training pass 23 | |
steps/train_deltas.sh: training pass 24 | |
steps/train_deltas.sh: training pass 25 | |
steps/train_deltas.sh: training pass 26 | |
steps/train_deltas.sh: training pass 27 | |
steps/train_deltas.sh: training pass 28 | |
steps/train_deltas.sh: training pass 29 | |
steps/train_deltas.sh: training pass 30 | |
steps/train_deltas.sh: aligning data | |
steps/train_deltas.sh: training pass 31 | |
steps/train_deltas.sh: training pass 32 | |
steps/train_deltas.sh: training pass 33 | |
steps/train_deltas.sh: training pass 34 | |
steps/diagnostic/analyze_alignments.sh --cmd run.pl data/lang exp/tri2a | |
steps/diagnostic/analyze_alignments.sh: see stats in exp/tri2a/log/analyze_alignments.log | |
15316 warnings in exp/tri2a/log/acc.*.*.log | |
4 warnings in exp/tri2a/log/init_model.log | |
5149 warnings in exp/tri2a/log/align.*.*.log | |
3 warnings in exp/tri2a/log/questions.log | |
1 warnings in exp/tri2a/log/build_tree.log | |
136 warnings in exp/tri2a/log/update.*.log | |
exp/tri2a: nj=10 align prob=-92.67 over 15.10h [retry=9.4%, fail=3.6%] states=1904 gauss=20042 tree-impr=5.45 | |
steps/train_deltas.sh: Done training system with delta+delta-delta features in exp/tri2a | |
===== TRI2A DECODING ===== | |
tree-info exp/tri2a/tree | |
tree-info exp/tri2a/tree | |
make-h-transducer --disambig-syms-out=exp/tri2a/graph/disambig_tid.int --transition-scale=1.0 data/lang/tmp/ilabels_3_1 exp/tri2a/tree exp/tri2a/final.mdl | |
fstrmepslocal | |
fsttablecompose exp/tri2a/graph/Ha.fst data/lang/tmp/CLG_3_1.fst | |
fstminimizeencoded | |
fstdeterminizestar --use-log=true | |
fstrmsymbols exp/tri2a/graph/disambig_tid.int | |
fstisstochastic exp/tri2a/graph/HCLGa.fst | |
0.000517835 -0.000487354 | |
add-self-loops --self-loop-scale=0.1 --reorder=true exp/tri2a/final.mdl exp/tri2a/graph/HCLGa.fst | |
steps/decode.sh --config conf/decode.config --nj 10 --cmd run.pl exp/tri2a/graph data/test exp/tri2a/decode | |
Refusing to split data because number of speakers 1 is less than the number of output .scp files 10 at utils/split_scp.pl line 114, <I> line 795. | |
===== TRI2A ALIGNMENT ===== | |
steps/align_si.sh --nj 10 --cmd run.pl data/train data/lang exp/tri2a exp/tri2a_ali | |
steps/align_si.sh: feature type is delta | |
steps/align_si.sh: aligning data in data/train using model from exp/tri2a, putting alignments in exp/tri2a_ali | |
steps/diagnostic/analyze_alignments.sh --cmd run.pl data/lang exp/tri2a_ali | |
steps/diagnostic/analyze_alignments.sh: see stats in exp/tri2a_ali/log/analyze_alignments.log | |
steps/align_si.sh: done aligning data. | |
===== run.sh script is finished ===== | |
grep: exp/tri2a/decode/wer_*: No such file or directory | |
===== Time Report ===== | |
Mono | |
9:8 | |
0:1 | |
0:23 | |
Tri1 | |
2:35 | |
0:0 | |
0:24 | |
Tri2a | |
2:31 | |
0:1 | |
0:23 | |
Total time: | |
15:59 | |
===== Score Report ===== | |
Best WER | |
Process finished with exit code 0 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment