Publicaciones

Título	Adapting linear discriminant analysis to the paradigm of learning from label proportions
Autores	PÉREZ ORTIZ, MARÍA, Gutierrez P.A. , CARBONERO RUZ, MARIANO, Hervas-Martinez C.
Publicación externa	No
Alcance	Conference Paper
Naturaleza	Científica
Web	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85016086157&doi=10.1109%2fSSCI.2016.7850150&partnerID=40&md5=754ff697f8c69bd6b26ffcd31c3cd1c8
Fecha de publicacion	01/01/2017
Scopus Id	2-s2.0-85016086157
DOI	10.1109/SSCI.2016.7850150
Abstract	The recently coined term \'learning from label proportions\' refers to a new learning paradigm where training data is given by groups (also denoted as \'bags\'), and the only known information is the label proportion of each bag. The aim is then to construct a classification model to predict the class label of an individual instance, which differentiates this paradigm from the one of multi-instance learning. This learning setting presents very different applications in political science, marketing, healthcare and, in general, all fields in relation with anonymous data. In this paper, two new strategies are proposed to tackle this kind of problems. Both proposals are based on the optimisation of pattern class memberships using the data distribution in each bag and the known label proportions. To do so, linear discriminant analysis has been reformulated to work with non-crisp class memberships. The experimental part of this paper sets different objetives: 1) study the difference in performance, comparing our proposals and the fully supervised setting, 2) analyse the potential benefits of refining class memberships by the proposed approaches, and 3) test the influence of other factors in the performance, such as the number of classes or the bag size. The results of these experiments are promising, but further research should be encouraged for studying more complex data configurations. © 2016 IEEE.
Palabras clave	Artificial intelligence; Classification models; learning from label proportions; Learning paradigms; Learning settings; Linear discriminant analysis; Multi-instance learning; Potential benefits; weak supervision; Discriminant analysis
Miembros de la Universidad Loyola	MARIANO CARBONERO RUZ