Machine learning methods for quantitative analysis of Raman spectroscopy data

Research output: Contribution to conference (Published)Paper

30 Citations (Scopus)

Abstract

The automated identification and quantification of illicit materials using Roman spectroscopy is of significant importance for law enforcement agencies. This paper explores the use of Machine Learning (ML) methods in comparison with standard statistical regression techniques for developing automated identification methods. In this work, the ML task is broken into two sub-tasks, data reduction and prediction. In well-conditioned data, the number of samples should be much larger than the number of attributes per sample, to limit the degrees of freedom in predictive models. In this spectroscopy data, the opposite is normally true. Predictive models based on such data have a high number of degrees of freedom, which increases the risk of models over-fitting to the sample data and having poor predictive power. In the work described here, an approach to data reduction based on Genetic Algorithms is described. For the prediction sub-task, the objective is to estimate the concentration of a component in a mixture, based on its Raman spectrum and the known concentrations of previously seen mixtures. Here, Neural Networks and k-Nearest Neighbours are used for prediction. Preliminary results are presented for the problem of estimating the concentration of cocaine in solid mixtures, and compared with previously published results in which statistical analysis of the same dataset was performed. Finally, this paper demonstrates how more accurate results may be achieved by using an ensemble of prediction techniques.

Original languageEnglish
Pages1130-1139
Number of pages10
DOIs
Publication statusPublished - 2002
EventOpto-Ireland 2002: Optics and Photonics Technologies and Applications - Galway, Ireland
Duration: 5 Sep 20026 Sep 2002

Conference

ConferenceOpto-Ireland 2002: Optics and Photonics Technologies and Applications
Country/TerritoryIreland
CityGalway
Period5/09/026/09/02

Keywords

  • Ensemble
  • Forensic science
  • Genetic Algorithm
  • Machine Learning
  • Narcotics
  • Neural Network
  • Raman
  • Regression
  • Spectroscopy

Fingerprint

Dive into the research topics of 'Machine learning methods for quantitative analysis of Raman spectroscopy data'. Together they form a unique fingerprint.

Cite this