Adapting Linear Discriminant Analysis to the Paradigm of Learning from Label Proportions

Hits: 7270
Research areas:
Year:
2016
Type of Publication:
In Proceedings
Authors:
Book title:
2016 IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2016)
Pages:
1-7
Organization:
Athens, Greece
Month:
6th-9th December
ISBN:
978-1-5090-4241-8
Abstract:
Learning from label proportions denotes the learning paradigm where training data is provided in groups (or 'bags'), and only the label proportion of each bag is available. The objective is to learn a model to predict the class label of an individual instance, which differentiates this paradigm from the one of multi-instance learning. This learning setting presents very different applications in political science, marketing, healthcare and, in general, all fields concerning anonymous data. Two different iterative strategies are proposed in this paper to deal with this type of problems, both based on the optimisation of the class membership of the patterns using the pattern distribution per bag and the label proportions. To do so, linear discriminant analysis is reformulated to deal with non-crisp class memberships. A thorough set of experiments is conducted to test: 1) the performance gap between these approaches and the fully supervised setting, 2) the potential advantages of optimising class memberships by our proposals, and 3) the influence of factors such as the bag size and the number of classes of the problem in the performance. The results of these experiments are promising, but further research should be encouraged for studying more complex data configurations.
Back