<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article"><front><journal-meta><journal-id journal-id-type="publisher-id">INFORMATICA</journal-id><journal-title-group><journal-title>Informatica</journal-title></journal-title-group><issn pub-type="epub">0868-4952</issn><issn pub-type="ppub">0868-4952</issn><publisher><publisher-name>VU</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">inf19405</article-id><article-id pub-id-type="doi">10.15388/Informatica.2008.229</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research article</subject></subj-group></article-categories><title-group><article-title>Multi-Alignment Templates Induction</article-title></title-group><contrib-group><contrib contrib-type="Author"><name><surname>Laukaitis</surname><given-names>Algirdas</given-names></name><email xlink:href="mailto:algirdas.laukaitis@fm.vgtu.lt">algirdas.laukaitis@fm.vgtu.lt</email><xref ref-type="aff" rid="j_INFORMATICA_aff_000"/></contrib><contrib contrib-type="Author"><name><surname>Vasilecas</surname><given-names>Olegas</given-names></name><email xlink:href="mailto:olegas.vasilecas@fm.vgtu.lt">olegas.vasilecas@fm.vgtu.lt</email><xref ref-type="aff" rid="j_INFORMATICA_aff_000"/></contrib><aff id="j_INFORMATICA_aff_000">Faculty of Fundamental Sciences, Vilnius Gediminas Technical University, Saulėtekio al. 11, LT-10223 Vilnius, Lithuania</aff></contrib-group><pub-date pub-type="epub"><day>01</day><month>01</month><year>2008</year></pub-date><volume>19</volume><issue>4</issue><fpage>535</fpage><lpage>554</lpage><history><date date-type="received"><day>01</day><month>01</month><year>2008</year></date><date date-type="accepted"><day>01</day><month>06</month><year>2008</year></date></history><abstract><p>
This paper examins approaches for translation between English and morphology-rich languages. Experiment with English–Russian and English–Lithuanian revels that “pure” statistical approaches on 10 million word corpus gives unsatisfactory translation. Then, several Web-available linguistic resources are suggested for translation. Syntax parsers, bilingual and semantic dictionaries, bilingual parallel corpus and monolingualWeb-based corpus are integrated in one comprehensive statistical model. Multi-abstraction language representation is used for statistical induction of syntactic and semantic transformation rules called multi-alignment templates. The decodingmodel is described using the feature functions, a log-linear modeling approach and A* search algorithm. An evaluation of this approach is performed on the English–Lithuanian language pair. Presented experimental results demonstrates that the multi-abstraction approach and hybridization of learning methods can improve quality of translation.
</p></abstract><kwd-group><label>Keywords</label><kwd>machine translation</kwd><kwd>natural language processing</kwd><kwd>statistical induction</kwd><kwd>EM algorithm</kwd><kwd>A* search</kwd></kwd-group></article-meta></front></article>