Multigroup classification of audio signals using time-frequency parameters

Karthikeyan Umapathy, Sridhar Krishnan, Shihab Jimaa

Research output: Contribution to journalArticlepeer-review

61 Scopus citations

Abstract

The ongoing advancements in the multimedia technologies drive the need for efficient classification of the audio signals to make the content-based retrieval process more accurate and much easier from huge databases. The challenge of this task lies in an accurate extraction of signal characteristics so as to derive a strong discriminatory feature suitable for classification. In this paper, a time-frequency (TF) approach for audio classification is proposed. Audio signals are nonstationary in nature and TF approach is the best way to analyze them. The audio signals were decomposed using an adaptive TF decomposition algorithm, and the signal decomposition parameter based on octave (scaling) was used to generate a set of 42 features over three frequency bands within the auditory range. These features were analyzed using linear discriminant functions and classified into six music groups (rock, classical, country, jazz, folk and pop). Overall classification accuracies as high as 97.6% was achieved by linear discriminant analysis of 170 audio signals.

Original languageBritish English
Pages (from-to)308-315
Number of pages8
JournalIEEE Transactions on Multimedia
Volume7
Issue number2
DOIs
StatePublished - Apr 2005

Keywords

  • Content-based retrieval
  • Linear discriminant analysis
  • Matching pursuit
  • Music classification
  • Time-frequency

Fingerprint

Dive into the research topics of 'Multigroup classification of audio signals using time-frequency parameters'. Together they form a unique fingerprint.

Cite this