<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
	<front>
		<journal-meta>
			<journal-id journal-id-type="publisher-id">INFORMATICA</journal-id>
			<journal-title-group>
				<journal-title>Informatica</journal-title>
			</journal-title-group>
			<issn pub-type="epub">0868-4952</issn>
			<issn pub-type="ppub">0868-4952</issn>
			<publisher>
				<publisher-name>VU</publisher-name>
			</publisher>
		</journal-meta>
		<article-meta>
			<article-id pub-id-type="publisher-id">inf21401</article-id>
			<article-id pub-id-type="doi">10.15388/Informatica.2010.300</article-id>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Research article</subject>
				</subj-group>
			</article-categories>
			<title-group>
				<article-title>Statistical Classification of Scientific Publications</article-title>
			</title-group>
			<contrib-group>
				<contrib contrib-type="Author">
					<name>
						<surname>Balys</surname>
						<given-names>Vaidas</given-names>
					</name>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Rudzkis</surname>
						<given-names>Rimantas</given-names>
					</name>
					<email xlink:href="mailto:rudzkis@ktl.mii.lt">rudzkis@ktl.mii.lt</email>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<aff id="j_INFORMATICA_aff_000">Vilnius University Institute of Mathematics and Informatics, Akademijos 4, LT-08663 Vilnius, Lithuania</aff>
			</contrib-group>
			<pub-date pub-type="epub">
				<day>01</day>
				<month>01</month>
				<year>2010</year>
			</pub-date>
			<volume>21</volume>
			<issue>4</issue>
			<fpage>471</fpage>
			<lpage>486</lpage>
			<history>
				<date date-type="received">
					<day>01</day>
					<month>07</month>
					<year>2009</year>
				</date>
				<date date-type="accepted">
					<day>01</day>
					<month>09</month>
					<year>2010</year>
				</date>
			</history>
			<abstract>
				<p>The problem of automatic classification of scientific texts is considered. Methods based on statistical analysis of probabilistic distributions of scientific terms in texts are discussed. The procedures for selecting the most informative terms and the method of making use of auxiliary information related to the terms positions are presented. The results of experimental evaluation of proposed algorithms and procedures over real-world data are reported.</p>
			</abstract>
			<kwd-group>
				<label>Keywords</label>
				<kwd>statistical classification</kwd>
				<kwd>probabilistic distribution</kwd>
				<kwd>parametric estimation</kwd>
				<kwd>auxiliary information</kwd>
				<kwd>informative terms</kwd>
			</kwd-group>
		</article-meta>
	</front>
</article>