Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
  • Colleges & Schools
  • Statistics
  • All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Institutes and Centres
  3. Insight Centre for Data Analytics
  4. Insight Research Collection
  5. Multi-view based unlabeled data selection using feature transformation methods for semiBoost learning
 
  • Details
Options

Multi-view based unlabeled data selection using feature transformation methods for semiBoost learning

Author(s)
Thanh-Binh, Le 
Hong, Sugwon 
Kim, Sang-Woon 
Uri
http://hdl.handle.net/10197/8705
Date Issued
02 August 2017
Abstract
SemiBoost Mallapragada et al. (2009) is a boosting framework for semi-supervised learning, in which unlabeled data as well as labeled data both contribute to learning. Various strategies have been proposed in the literature to perform the task of selecting useful unlabeled data in SemiBoost. Recently, a multi-view based strategy was proposed in Le and Kim (2016), in which the feature set of the data is decomposed into subsets (i.e., multiple views) using a feature-decomposition method. In the decomposition process, the strategy inevitably results in some loss of information. To avoid this drawback, this paper considered feature-transformation methods, rather than using the decomposition method, to obtain the multiple views. More specifically, in the feature-transformation method, a number of views were obtained from the entire feature set using the same number of different mapping functions. After deriving the number of views of the data, each of the views was used for measuring corresponding confidences, for first evaluating examples to be selected. Then, all the confidence levels measured from the multiple views were combined as a weighted average for deriving a target confidence. The experimental results, which were obtained using support vector machines for well-known benchmark data, demonstrate that the proposed mechanism can compensate for the shortcomings of the traditional strategies. In addition, the results demonstrate that when the data is transformed appropriately into multiple views, the strategy can achieve further improvement in results in terms of classification accuracy.
Sponsorship
Science Foundation Ireland
Type of Material
Journal Article
Publisher
Elsevier
Journal
Neurocomputing
Volume
249
Start Page
277
End Page
289
Copyright (Published Version)
2017 Elsevier
Keywords
  • Machine learning

  • Statistics

  • SemiBoost learning

  • Useful unlabeled data...

  • Multiple views of fea...

  • Feature decomposition...

DOI
10.1016/j.neucom.2017.04.021
Language
English
Status of Item
Peer reviewed
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
Owning collection
Insight Research Collection
Scopus© citations
1
Acquisition Date
Feb 6, 2023
View Details
Views
1083
Last Month
1
Acquisition Date
Feb 6, 2023
View Details
Downloads
213
Acquisition Date
Feb 6, 2023
View Details
google-scholar
University College Dublin Research Repository UCD
The Library, University College Dublin, Belfield, Dublin 4
Phone: +353 (0)1 716 7583
Fax: +353 (0)1 283 7667
Email: mailto:research.repository@ucd.ie
Guide: http://libguides.ucd.ie/rru

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement