Analysis of the effect of unexpected outliers in the classification of spectroscopy data

Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

4 Citations (Scopus)

Abstract

Multi-class classification algorithms are very widely used, but we argue that they are not always ideal from a theoretical perspective, because they assume all classes are characterized by the data, whereas in many applications, training data for some classes may be entirely absent, rare, or statistically unrepresentative. We evaluate one-sided classifiers as an alternative, since they assume that only one class (the target) is well characterized. We consider a task of identifying whether a substance contains a chlorinated solvent, based on its chemical spectrum. For this application, it is not really feasible to collect a statistically representative set of outliers, since that group may contain anything apart from the target chlorinated solvents. Using a new one-sided classification toolkit, we compare a One-Sided k-NN algorithm with two well-known binary classification algorithms, and conclude that the one-sided classifier is more robust to unexpected outliers.

Original languageEnglish
Title of host publicationArtificial Intelligence and Cognitive Science - 20th Irish Conference, AICS 2009, Revised Selected Papers
Pages124-133
Number of pages10
DOIs
Publication statusPublished - 2010
Event20th Annual Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2009 - Dublin, Ireland
Duration: 19 Aug 200921 Aug 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6206 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th Annual Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2009
Country/TerritoryIreland
CityDublin
Period19/08/0921/08/09

Keywords

  • Classification
  • k-Nearest Neighbour
  • One-Class
  • One-Sided
  • Spectroscopy Analysis
  • Support Vector Machine

Fingerprint

Dive into the research topics of 'Analysis of the effect of unexpected outliers in the classification of spectroscopy data'. Together they form a unique fingerprint.

Cite this