Questions tagged [semi-supervised-learning]

Making use of both unsupervised and supervised learning paradigms to train on a partially labelled dataset.

In a partly labeled dataset, using only the labeled observations to train a model can prove non-optimal. The remaining, unlabeled part of the dataset may contain valuable information about data structure, that could be used to improve the model, especially when the proportion of labeled data is low.

The semi-supervised learning approach uses both unsupervised learning and supervised learning concepts in order to get the best from a dataset. This paradigm includes specific semi-supervised techniques as well as mixed-up approaches using standard supervised and unsupervised methods.

54 questions

votes

3 answers

Build a binary classifier with only positive and unlabeled data

I have 2 datasets, one with positive instances of what I would like to detect, and one with unlabeled instances. What methods can I use ? As an example, suppose we want to understand detect spam email based on a few structured email characteristics.…

classification semi-supervised-learning

asked Jul 07 '14 at 09:34

nassimhddd

votes

4 answers

Why positive-unlabeled learning?

Machine learning can be divided into several areas: supervised learning, unsupervised learning, semi-supervised learning, learning to rank, recommendation systems, etc, etc. One such area is PU Learning, where only Positive and Unlabeled instances…

machine-learning classification semi-supervised-learning

asked Jan 17 '18 at 14:54

Ricardo Magalhães Cruz

3,380
1
14
33

votes

1 answer

Custom conditional loss function in Keras

I'm looking for a way to create a conditional loss function that looks like this: there is a vector of labels, say l (l has the same length as the input x), then for a given input (y_true, y_pred, l) the loss should be: def…

keras loss-function semi-supervised-learning

asked Mar 01 '18 at 05:32

Tian

votes

2 answers

General strategy for imbalanced, semi-supervised, sparse problem

I am looking for some general advice on where to start with this problem. There are 350 sparse (low positive integer) features. I have 2000 positives, 1000 negatives, and infinite unlabeled data, where the estimated true positive rate in the…

machine-learning predictive-modeling semi-supervised-learning

asked Dec 28 '16 at 16:07

user27436

votes

3 answers

Predictive clustering

I have an hypothesis but i don't know if it's true. If the cluster is dense and we apply a supervised learning on this data, the model generated by this cluster will be more efficient for new data falling into this cluster than other. Thus we have…

clustering supervised-learning semi-supervised-learning

asked Mar 06 '16 at 15:46

KyBe

votes

1 answer

How to approach semi-supervised binary classification problem with few labels only from one class?

I confront with a binary classification problem where I do have a few instances with labels (so far this is "semi-supervised" learning as far as I know), but only from the positive class. So I cannot take any negative examples as basis for learning…

machine-learning classification semi-supervised-learning

asked Jul 29 '20 at 07:50

Fredrik

votes

2 answers

Time series binary classificaiton with labelling issues

My situation is quite complicated so I will give a similar example from a simpler domain. Suppose we want to try to predict WHEN a mobile game users will make a purchase if given a sale. Almost every user is always instantaneously a non-purchaser…

machine-learning classification time-series feature-engineering semi-supervised-learning

asked Jul 03 '18 at 05:33

Keith

votes

1 answer

Probability for label correctness in semi-supervised learning

I am aware of the existence of semi-supervised learning approaches, such as the Ladder Network, where only a subset of the data is labeled. Are there any methods or papers which consider correctness probabilities for the labels of that training data…

supervised-learning unsupervised-learning labels semi-supervised-learning

asked Jun 09 '17 at 12:46

AlexGuevara

votes

1 answer

Solutions for Labelling Training Data for Binary Classification Problems

I have a huge dataset for which I am trying to use an 80-20 (Holdout method) approach to train and test my model. However, the dataset I have been given has 6m rows. The objective is to train+test+validate the model before using live data traffic…

classification semi-supervised-learning labelling

asked Nov 08 '20 at 12:09

ha9u63a7

votes

1 answer

What is the difference between all the different types of learning within machine learning?

This is a question that is really hard to google, and the differences are confusing. Does anyone have good examples of the differences between them all? Supervised Learning Semi-Supervised Learning Distant Supervision Active Learning Lightly…

machine-learning unsupervised-learning supervised-learning semi-supervised-learning

asked Dec 06 '19 at 16:44

A.White

votes

4 answers

Supervised clustering

I'm working on a clustering problem. I have a training set composed of sets of points where the clusters are known and I want to find the good clusters on a testing dataset. It's a kind of supervised clustering. I looked for articles about…

clustering unsupervised-learning supervised-learning semi-supervised-learning

asked Sep 22 '19 at 18:46

Rodolphe LAMPE

votes

0 answers

Inductive vs Transductive Learning

I am reading about Inductive and Transductive Learning. Some of the questions that come to mind are the following: What is the difference between these two? Which algorithms are usually employed for these methods? Why would someone choose the…

semi-supervised-learning

asked Jul 22 '19 at 13:11

Outcast

1,037
2
11
27

votes

1 answer

Generic strategy for object detection

I have a huge collection of objects from which only a tiny fraction are in a class of interest. The collection is initially unlabelled, but labels can be added using an expensive operation (for example, by human). Currently I use the simple generic…

machine-learning classification object-recognition semi-supervised-learning active-learning

asked Mar 06 '15 at 10:28

Valentas

1,064
1
8
20

votes

1 answer

Accuracy after selftraining didn't change

I used Decisiton Tree Classifier which I trained with 50 000 samples. I have also set with unlabeled samples, so I decided to use self training algorithm. Unlabeled set has 10 000 samples. I would like to ask if it is normal, that after retrainig…

accuracy semi-supervised-learning

asked Mar 23 '19 at 13:10

SMI9

votes

0 answers

Neural Network for detecting/checking for requirements in diagrams

My question is more about what approach is a good/the best approach for my problem: THE PROBLEM - I'm an (mechanical/software) engineer and we take extensive amount of time to review technical drawings prior to them being complete/ready/meeting…

neural-network image-classification supervised-learning semi-supervised-learning

asked Dec 28 '18 at 20:36

amlwwalker

2 3 4 Next