The process of manually labeling instances, essential to a supervised classifier, can be expensive and time-consuming. In such a scenario the semi-supervised approach, which makes use of the un-labeled patterns when building the decision function, is a more appealing choice. Indeed, large amounts of unlabeled samples often can be easily obtained. Many optimization techniques have been developed in the last decade to include the unlabeled patterns in the Support Vector Machines formulation. Two broad strategies are followed: continuous and combinatorial. The approach presented in this paper belongs to the latter family and is especially suitable when a fair estimation of the proportion of positive and negative samples is available. Our method is very simple and requires a very light parameter selection. Several medium and large scale experiments on both artificial and real-world datasets have been carried out, proving the effectiveness and the efficiency of the proposed algorithm

Lagrangean-based Combinatorial Optimization for Large Scale S3VMs / Bagattini, Francesco; Cappanera, Paola; Schoen, Fabio. - In: IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS. - ISSN 2162-237X. - STAMPA. - 29:(2018), pp. 4426-4435. [10.1109/TNNLS.2017.2766704]

Lagrangean-based Combinatorial Optimization for Large Scale S3VMs

BAGATTINI, FRANCESCO
;
CAPPANERA, PAOLA;SCHOEN, FABIO
2018

Abstract

The process of manually labeling instances, essential to a supervised classifier, can be expensive and time-consuming. In such a scenario the semi-supervised approach, which makes use of the un-labeled patterns when building the decision function, is a more appealing choice. Indeed, large amounts of unlabeled samples often can be easily obtained. Many optimization techniques have been developed in the last decade to include the unlabeled patterns in the Support Vector Machines formulation. Two broad strategies are followed: continuous and combinatorial. The approach presented in this paper belongs to the latter family and is especially suitable when a fair estimation of the proportion of positive and negative samples is available. Our method is very simple and requires a very light parameter selection. Several medium and large scale experiments on both artificial and real-world datasets have been carried out, proving the effectiveness and the efficiency of the proposed algorithm
2018
29
4426
4435
Bagattini, Francesco; Cappanera, Paola; Schoen, Fabio
File in questo prodotto:
File Dimensione Formato  
bagattini.pdf

Accesso chiuso

Descrizione: versione finale pre-stampa
Tipologia: Versione finale referata (Postprint, Accepted manuscript)
Licenza: Tutti i diritti riservati
Dimensione 1.12 MB
Formato Adobe PDF
1.12 MB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1099951
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 6
social impact