<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
	<front>
		<journal-meta>
			<journal-id journal-id-type="publisher-id">INFORMATICA</journal-id>
			<journal-title-group>
				<journal-title>Informatica</journal-title>
			</journal-title-group>
			<issn pub-type="epub">0868-4952</issn>
			<issn pub-type="ppub">0868-4952</issn>
			<publisher>
				<publisher-name>VU</publisher-name>
			</publisher>
		</journal-meta>
		<article-meta>
			<article-id pub-id-type="publisher-id">inf23402</article-id>
			<article-id pub-id-type="doi">10.15388/Informatica.2012.373</article-id>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Research article</subject>
				</subj-group>
			</article-categories>
			<title-group>
				<article-title>Learning Process Termination Criteria</article-title>
			</title-group>
			<contrib-group>
				<contrib contrib-type="Author">
					<name>
						<surname>Brumen</surname>
						<given-names>Boštjan</given-names>
					</name>
					<email xlink:href="mailto:bostjan.brumen@uni-mb.si">bostjan.brumen@uni-mb.si</email>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Hölbl</surname>
						<given-names>Marko</given-names>
					</name>
					<email xlink:href="mailto:marko.holbl@uni-mb.si">marko.holbl@uni-mb.si</email>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Harej Pulko</surname>
						<given-names>Katja</given-names>
					</name>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Welzer</surname>
						<given-names>Tatjana</given-names>
					</name>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Heričko</surname>
						<given-names>Marjan</given-names>
					</name>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Jurič</surname>
						<given-names>Matjaž B.</given-names>
					</name>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_001"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Jaakkola</surname>
						<given-names>Hannu</given-names>
					</name>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_002"/>
				</contrib>
				<aff id="j_INFORMATICA_aff_000">University of Maribor, Faculty of Electrical Engineering, Computer Science and Informatics, Smetanova 17, SI-2000 Maribor, Slovenia</aff>
				<aff id="j_INFORMATICA_aff_001">University of Ljubljana, Faculty of Computer and Information Science, Tržaška cesta 25, SI-1000 Ljubljana, Slovenia</aff>
				<aff id="j_INFORMATICA_aff_002">Tampere University of Technology, Pori, Pohjoisranta 11, FIN-28101 Pori, Finland</aff>
			</contrib-group>
			<pub-date pub-type="epub">
				<day>01</day>
				<month>01</month>
				<year>2012</year>
			</pub-date>
			<volume>23</volume>
			<issue>4</issue>
			<fpage>521</fpage>
			<lpage>536</lpage>
			<history>
				<date date-type="received">
					<day>01</day>
					<month>11</month>
					<year>2011</year>
				</date>
				<date date-type="accepted">
					<day>01</day>
					<month>04</month>
					<year>2012</year>
				</date>
			</history>
			<abstract>
				<p>In a supervised learning, the relationship between the available data and the performance (what is learnt) is not well understood. How much data to use, or when to stop the learning process, are the key questions.</p><p>In the paper, we present an approach for an early assessment of the extracted knowledge (classification models) in the terms of performance (accuracy). The key questions are answered by detecting the point of convergence, i.e., where the classification model's performance does not improve any more even when adding more data items to the learning set. For the learning process termination criteria we developed a set of equations for detection of the convergence that follow the basic principles of the learning curve. The developed solution was evaluated on real datasets. The results of the experiment prove that the solution is well-designed: the learning process stopping criteria are not subjected to local variance and the convergence is detected where it actually has occurred.</p>
			</abstract>
			<kwd-group>
				<label>Keywords</label>
				<kwd>learning curve</kwd>
				<kwd>learning process</kwd>
				<kwd>classification</kwd>
				<kwd>accuracy</kwd>
				<kwd>assessment</kwd>
				<kwd>data mining</kwd>
			</kwd-group>
		</article-meta>
	</front>
</article>