TY - GEN
T1 - Hill climbing on speech lattices
T2 - 36th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
AU - Rastrow, Ariya
AU - Dreyer, Markus
AU - Sethy, Abhinav
AU - Khudanpur, Sanjeev
AU - Ramabhadran, Bhuvana
AU - Dredze, Mark
PY - 2011
Y1 - 2011
N2 - We describe a new approach for rescoring speech lattices - with long-span language models or wide-context acoustic models - that does not entail computationally intensive lattice expansion or limited rescoring of only an N-best list. We view the set of word-sequences in a lattice as a discrete space equipped with the edit-distance metric, and develop a hill climbing technique to start with, say, the 1-best hypothesis under the lattice-generating model(s) and iteratively search a local neighborhood for the highest-scoring hypothesis under the rescoring model(s); such neighborhoods are efficiently constructed via finite state techniques. We demonstrate empirically that to achieve the same reduction in error rate using a better estimated, higher order language model, our technique evaluates fewer utterance-length hypotheses than conventional N-best rescoring by two orders of magnitude. For the same number of hypotheses evaluated, our technique results in a significantly lower error rate.
AB - We describe a new approach for rescoring speech lattices - with long-span language models or wide-context acoustic models - that does not entail computationally intensive lattice expansion or limited rescoring of only an N-best list. We view the set of word-sequences in a lattice as a discrete space equipped with the edit-distance metric, and develop a hill climbing technique to start with, say, the 1-best hypothesis under the lattice-generating model(s) and iteratively search a local neighborhood for the highest-scoring hypothesis under the rescoring model(s); such neighborhoods are efficiently constructed via finite state techniques. We demonstrate empirically that to achieve the same reduction in error rate using a better estimated, higher order language model, our technique evaluates fewer utterance-length hypotheses than conventional N-best rescoring by two orders of magnitude. For the same number of hypotheses evaluated, our technique results in a significantly lower error rate.
KW - Hill Climbing
KW - Rescoring
KW - Search Algorithm
UR - http://www.scopus.com/inward/record.url?scp=80051615430&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80051615430&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2011.5947487
DO - 10.1109/ICASSP.2011.5947487
M3 - Conference contribution
AN - SCOPUS:80051615430
SN - 9781457705397
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 5032
EP - 5035
BT - 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings
Y2 - 22 May 2011 through 27 May 2011
ER -