Language Model Adaptation Using Dirichlet Class Language Model Based on Part-of-Speech

Hatami, Ali; akbari, Ahmad; Nasersharif, Babak

doi:10.7508/jist.2014.01.005

Manuscript ID : 139307301447342792 Visit : 13525 Page: 1 - 10

10.7508/jist.2014.01.005

Article Type: Original Research

Language Model Adaptation Using Dirichlet Class Language Model Based on Part-of-Speech

Subject Areas : Speech Processing

Ali Hatami ¹ , Ahmad akbari ² , Babak Nasersharif ³

1 - Iran University of Science and Technology
2 -
3 - K. N. Toosi

Received: 2014-10-22 Accepted : 2014-10-22 Published : 2014-03-21

Keywords: Speech Recognition, Language Model Adaptation, Part-of-Speech, Perplexity, Word Error Rate,

Abstract :

Language modeling has many applications in a large variety of domains. Performance of this model depends on its adaptation to a particular style of data. Accordingly, adaptation methods endeavour to apply syntactic and semantic characteristics of the language for language modeling. The previous adaptation methods such as family of Dirichlet class language model (DCLM) extract class of history words. These methods due to lake of syntactic information are not suitable for high morphology languages such as Farsi. In this paper, we present an idea for using syntactic information such as part-of-speech (POS) in DCLM for combining with one of the language models of n-gram family. In our work, word clustering is based on POS of previous words and history words in DCLM. The performance of language models are evaluated on BijanKhan corpus using a hidden Markov model based ASR system. The results show that use of POS information along with history words and class of history words improves performance of language model, and decreases the perplexity on our corpus. Exploiting POS information along with DCLM, the word error rate of the ASR system decreases by 1.2% compared to DCLM.

References:

Enhancing Speaker Identification System Based on MFCC Feature Extraction and Gated Recurrent Unit Network
Print Date : 2025-03-05
A New VAD Algorithm using Sparse Representation in Spectro-Temporal Domain
Print Date : 2019-11-04
Long-Term Spectral Pseudo-Entropy (LTSPE): A New Robust Feature for Speech Activity Detection
Print Date : 2019-05-29
Speech Emotion Recognition Based on Fusion Method
Print Date : 2017-03-13
Instance Based Sparse Classifier Fusion for Speaker Verification
Print Date : 2016-09-24

Share To

Article Url

Language Model Adaptation Using Dirichlet Class Language Model Based on Part-of-Speech