K-Means Clustering in Dual Space for Unsupervised Feature Partitioning in Multi-view Learning

Corrado Mio, Gabriele Gianini, Ernesto Damiani

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In contrast to single-view learning, multi-view learning trains simultaneously distinct algorithms on disjoint subsets of features (the views), and jointly optimizes them, so that they come to a consensus. Multi-view learning is typically used when the data are described by a large number of features. It aims at exploiting the different statistical properties of distinct views. A task to be performed before multi-view learning - in the case where the features have no natural groupings - is multi-view generation (MVG): it consists in partitioning the feature set in subsets (views) characterized by some desired properties. Given a dataset, in the form of a table with a large number of columns, the desired solution of the MVG problem is a partition of the columns that optimizes an objective function, encoding typical requirements. If the class labels are available, one wants to minimize the inter-view redundancy in target prediction and maximize consistency. If the class labels are not available, one wants simply to minimize inter-view redundancy (minimize the information each view has about the others). In this work, we approach the MVG problem in the latter, unsupervised, setting. Our approach is based on the transposition of the data table: the original instance rows are mapped into columns (the 'pseudo-features'), while the original feature columns become rows (the 'pseudo-instances'). The latter can then be partitioned by any suitable standard instance-partitioning algorithm: the resulting groups can be considered as groups of the original features, i.e. views, solution of the MVG problem. We demonstrate the approach using k-means and the standard benchmark MNIST dataset of handwritten digits.

Original languageBritish English
Title of host publicationProceedings - 14th International Conference on Signal Image Technology and Internet Based Systems, SITIS 2018
EditorsRichard Chbeir, Gabriella Sanniti di Baja, Luigi Gallo, Kokou Yetongnon, Albert Dipanda, Modesto Castrillon-Santana
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-8
Number of pages8
ISBN (Electronic)9781538693858
DOIs
StatePublished - 2 Jul 2018
Event14th International Conference on Signal Image Technology and Internet Based Systems, SITIS 2018 - Las Palmas de Gran Canaria, Spain
Duration: 26 Nov 201829 Nov 2018

Publication series

NameProceedings - 14th International Conference on Signal Image Technology and Internet Based Systems, SITIS 2018

Conference

Conference14th International Conference on Signal Image Technology and Internet Based Systems, SITIS 2018
Country/TerritorySpain
CityLas Palmas de Gran Canaria
Period26/11/1829/11/18

Keywords

  • Consensus clustering
  • Dual space clustering
  • K-means
  • Multi-view learning

Fingerprint

Dive into the research topics of 'K-Means Clustering in Dual Space for Unsupervised Feature Partitioning in Multi-view Learning'. Together they form a unique fingerprint.

Cite this