What is it about?

GA-Auto-PU is the first automated machine learning (Auto-ML) system for positive-unlabelled (PU) learning. Many algorithms for PU learning have been proposed, so finding the optimal algorithm for a given task is computationally unfeasible. Auto-ML can assist in this task by searching for an optimal algorithm for a specific dataset through a guided process. We use a simple genetic algorithm to guide this process, and our system outperforms a state-of-the-art PU learning algorithm with statistical significance.

Featured Image

Why is it important?

PU learning is a field of machine learning that aims to learn classifiers from data consisting of a set of labelled positive instances, and a set of unlabelled instances, which may be positive or negative, but whose label is unknown. This is a common scenario in machine learning, but is often overlooked. In this case, often the unlabelled set is treated as a negative set, so instances are misrepresented. Our aim is to make PU learning more accessible, and therefore more widely used, by introducing an Auto-ML system that constructs PU learning algorithms for specific input datasets.

Read the Original

This page is a summary of: GA-auto-PU, July 2022, ACM (Association for Computing Machinery),
DOI: 10.1145/3520304.3528932.
You can read the full text:

Read

Contributors

The following have contributed to this page