언어모델생성 (1) 썸네일형 리스트형 [CMU Sphinx]언어모델(Language Model : LM)파일 생성법 Typical UsageGiven a large corpus of text in a file a.text, but no specified vocabulary Compute the word unigram counts cat a.text | text2wfreq > a.wfreq Convert the word unigram counts into a vocabulary consisting of the 20,000 most common words cat a.wfreq | wfreq2vocab -top 20000 > a.vocab Generate a binary id 3-gram of the training text, based on this vocabulary cat a.text | text2idngram -vo.. 이전 1 다음