<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article"><front><journal-meta><journal-id journal-id-type="publisher-id">INFORMATICA</journal-id><journal-title-group><journal-title>Informatica</journal-title></journal-title-group><issn pub-type="epub">0868-4952</issn><issn pub-type="ppub">0868-4952</issn><publisher><publisher-name>VU</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="publisher-id">inf15308</article-id><article-id pub-id-type="doi">10.15388/Informatica.2004.068</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research article</subject></subj-group></article-categories><title-group><article-title>Outlier Detection Based on the Distribution of Distances between Data Points</article-title></title-group><contrib-group><contrib contrib-type="Author"><name><surname>Šaltenis</surname><given-names>Vydunas</given-names></name><email xlink:href="mailto:saltenis@ktl.mii.lt">saltenis@ktl.mii.lt</email><xref ref-type="aff" rid="j_INFORMATICA_aff_000"/></contrib><aff id="j_INFORMATICA_aff_000">Institute of Mathematics and Informatics, Akademijos 4, 2600 Vilnius, Lithuania</aff></contrib-group><pub-date pub-type="epub"><day>01</day><month>01</month><year>2004</year></pub-date><volume>15</volume><issue>3</issue><fpage>399</fpage><lpage>410</lpage><history><date date-type="received"><day>01</day><month>01</month><year>2004</year></date></history><abstract><p>A novel approach to outlier detection on the ground of the properties of distribution of distances between multidimensional points is presented. The basic idea is to evaluate the outlier factor for each data point. The factor is used to rank the dataset objects regarding their degree of being an outlier. Selecting the points with the minimal factor values can then identify outliers. The main advantages of the approach are: (1) no parameter choice in outlier detection is necessary; (2) detection is not dependent on clustering algorithms.</p><p>To demonstrate the quality of the outlier detection, the experiments were performed on widely used datasets. A comparison with some popular detection methods shows the superiority of our approach.</p></abstract><kwd-group><label>Keywords</label><kwd>outlier detection</kwd><kwd>high‐dimensional data</kwd><kwd>distribution of distances</kwd></kwd-group></article-meta></front></article>