<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article"><front><journal-meta><journal-id journal-id-type="publisher-id">INFORMATICA</journal-id><journal-title-group><journal-title>Informatica</journal-title></journal-title-group><issn pub-type="epub">0868-4952</issn><issn pub-type="ppub">0868-4952</issn><publisher><publisher-name>VU</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">inf15302</article-id><article-id pub-id-type="doi">10.15388/Informatica.2004.062</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research article</subject></subj-group></article-categories><title-group><article-title>Limited‐Vocabulary Estonian Continuous Speech Recognition System using Hidden Markov Models</article-title></title-group><contrib-group><contrib contrib-type="Author"><name><surname>Alumäe</surname><given-names>Tanel</given-names></name><email xlink:href="mailto:tanel.alumae@phon.ioc.ee">tanel.alumae@phon.ioc.ee</email><xref ref-type="aff" rid="j_INFORMATICA_aff_000"/></contrib><contrib contrib-type="Author"><name><surname>Võhandu</surname><given-names>Leo</given-names></name><email xlink:href="mailto:leov@staff.ttu.ee">leov@staff.ttu.ee</email><xref ref-type="aff" rid="j_INFORMATICA_aff_000"/></contrib><aff id="j_INFORMATICA_aff_000">Tallinn Technical University, Ehitajate tee 5, 19086 Tallinn, Estonia</aff></contrib-group><pub-date pub-type="epub"><day>01</day><month>01</month><year>2004</year></pub-date><volume>15</volume><issue>3</issue><fpage>303</fpage><lpage>314</lpage><history><date date-type="received"><day>01</day><month>02</month><year>2004</year></date></history><abstract><p>The article presents a limited‐vocabulary speaker independent continuous Estonian speech recognition system based on hidden Markov models. The system is trained using an annotated Estonian speech database of 60 speakers, approximately 4 hours in duration. Words are modelled using clustered triphones with multiple Gaussian mixture components. The system is evaluated using a number recognition task and a simple medium‐vocabulary recognition task. The system performance is explored by employing acoustic models of increasing complexity. The number recognizer achieves an accuracy of 97%. The medium‐vocabulary system recognizes 82.9% words correctly if operating in real time. The correctness increases to 90.6% if real‐time requirement is discarded.</p></abstract><kwd-group><label>Keywords</label><kwd>continuous speech recognition</kwd><kwd>hidden Markov models</kwd><kwd>Estonian</kwd></kwd-group></article-meta></front></article>