Título Adapting linear discriminant analysis to the paradigm of learning from label proportions
Autores PÉREZ ORTIZ, MARÍA, Gutierrez P.A., CARBONERO RUZ, MARIANO, Hervas-Martinez C., CARBONERO RUZ, MARIANO, PÉREZ ORTIZ, MARÍA
Publicación externa No
Alcance Conference Paper
Naturaleza Científica
Ámbito Internacional
Web https://www.scopus.com/inward/record.uri?eid=2-s2.0-85016086157&doi=10.1109%2fSSCI.2016.7850150&partnerID=40&md5=754ff697f8c69bd6b26ffcd31c3cd1c8
Fecha de publicacion 01/01/2017
Scopus Id 2-s2.0-85016086157
DOI 10.1109/SSCI.2016.7850150
Abstract The recently coined term 'learning from label proportions' refers to a new learning paradigm where training data is given by groups (also denoted as 'bags'), and the only known information is the label proportion of each bag. The aim is then to construct a classification model to predict the class label of an individual instance, which differentiates this paradigm from the one of multi-instance learning. This learning setting presents very different applications in political science, marketing, healthcare and, in general, all fields in relation with anonymous data. In this paper, two new strategies are proposed to tackle this kind of problems. Both proposals are based on the optimisation of pattern class memberships using the data distribution in each bag and the known label proportions. To do so, linear discriminant analysis has been reformulated to work with non-crisp class memberships. The experimental part of this paper sets different objetives: 1) study the difference in performance, comparing our proposals and the fully supervised setting, 2) analyse the potential benefits of refining class memberships by the proposed approaches, and 3) test the influence of other factors in the performance, such as the number of classes or the bag size. The results of these experiments are promising, but further research should be encouraged for studying more complex data configurations. © 2016 IEEE.
Palabras clave Artificial intelligence; Classification models; learning from label proportions; Learning paradigms; Learning settings; Linear discriminant analysis; Multi-instance learning; Potential benefits; weak
Miembros de la Universidad Loyola

Change your preferences Gestionar cookies