<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
	<front>
		<journal-meta>
			<journal-id journal-id-type="publisher-id">INFORMATICA</journal-id>
			<journal-title-group>
				<journal-title>Informatica</journal-title>
			</journal-title-group>
			<issn pub-type="epub">0868-4952</issn>
			<issn pub-type="ppub">0868-4952</issn>
			<publisher>
				<publisher-name>VU</publisher-name>
			</publisher>
		</journal-meta>
		<article-meta>
			<article-id pub-id-type="publisher-id">inf24108</article-id>
			<article-id pub-id-type="doi">10.15388/Informatica.2013.388</article-id>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Research article</subject>
				</subj-group>
			</article-categories>
			<title-group>
				<article-title>General Context-Aware Data Matching and Merging Framework</article-title>
			</title-group>
			<contrib-group>
				<contrib contrib-type="Author">
					<name>
						<surname>Žitnik</surname>
						<given-names>Slavko</given-names>
					</name>
					<email xlink:href="mailto:slavko.zitnik@fri.uni-lj.si">slavko.zitnik@fri.uni-lj.si</email>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Šubelj</surname>
						<given-names>Lovro</given-names>
					</name>
					<email xlink:href="mailto:lovro.subelj@fri.uni-lj.si">lovro.subelj@fri.uni-lj.si</email>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Lavbič</surname>
						<given-names>Dejan</given-names>
					</name>
					<email xlink:href="mailto:dejan.lavbic@fri.uni-lj.si">dejan.lavbic@fri.uni-lj.si</email>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Vasilecas</surname>
						<given-names>Olegas</given-names>
					</name>
					<email xlink:href="mailto:olegas@fm.vgtu.lt">olegas@fm.vgtu.lt</email>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_001"/>
				</contrib>
				<contrib contrib-type="Author">
					<name>
						<surname>Bajec</surname>
						<given-names>Marko</given-names>
					</name>
					<email xlink:href="mailto:marko.bajec@fri.uni-lj.si">marko.bajec@fri.uni-lj.si</email>
					<xref ref-type="aff" rid="j_INFORMATICA_aff_000"/>
				</contrib>
				<aff id="j_INFORMATICA_aff_000">University of Ljubljana, Faculty of Computer and Information Science, Tržaska cesta 25, SI-1000 Ljubljana</aff>
				<aff id="j_INFORMATICA_aff_001">Information Systems Research Laboratory, Vilnius Gediminas Technical University, Saulėtekio 11, LT-10223 Vilnius, Lithuania</aff>
			</contrib-group>
			<pub-date pub-type="epub">
				<day>01</day>
				<month>01</month>
				<year>2013</year>
			</pub-date>
			<volume>24</volume>
			<issue>1</issue>
			<fpage>119</fpage>
			<lpage>152</lpage>
			<history>
				<date date-type="received">
					<day>01</day>
					<month>07</month>
					<year>2012</year>
				</date>
				<date date-type="accepted">
					<day>01</day>
					<month>09</month>
					<year>2012</year>
				</date>
			</history>
			<abstract>
				<p>Due to numerous public information sources and services, many methods to combine heterogeneous data were proposed recently. However, general end-to-end solutions are still rare, especially systems taking into account different context dimensions. Therefore, the techniques often prove insufficient or are limited to a certain domain. In this paper we briefly review and rigorously evaluate a general framework for data matching and merging. The framework employs collective entity resolution and redundancy elimination using three dimensions of context types. In order to achieve domain independent results, data is enriched with semantics and trust. However, the main contribution of the paper is evaluation on five public domain-incompatible datasets. Furthermore, we introduce additional attribute, relationship, semantic and trust metrics, which allow complete framework management. Besides overall results improvement within the framework, metrics could be of independent interest.</p>
			</abstract>
			<kwd-group>
				<label>Keywords</label>
				<kwd>entity resolution</kwd>
				<kwd>redundancy elimination</kwd>
				<kwd>semantic elevation</kwd>
				<kwd>trust</kwd>
				<kwd>ontologies</kwd>
			</kwd-group>
		</article-meta>
	</front>
</article>