Most Popular

1500 questions
27
votes
4 answers

macro average and weighted average meaning in classification_report

I use the "classification_report" from from sklearn.metrics import classification_report in order to evaluate the imbalanced binary classification Classification Report : precision recall f1-score support 0 1.00…
user10296606
  • 1,784
  • 5
  • 17
  • 31
27
votes
5 answers

BERT vs Word2VEC: Is bert disambiguating the meaning of the word vector?

Word2vec: Word2vec provides a vector for each token/word and those vectors encode the meaning of the word. Although those vectors are not human interpretable, the meaning of the vectors are understandable/interpretable by comparing with other…
sovon
  • 521
  • 1
  • 5
  • 7
27
votes
4 answers

Cross validation Vs. Train Validate Test

I have a doubt regarding the cross validation approach and train-validation-test approach. I was told that I can split a dataset into 3 parts: Train: we train the model. Validation: we validate and adjust model parameters. Test: never seen before…
NaveganTeX
  • 445
  • 1
  • 4
  • 9
27
votes
2 answers

What is the difference between semantic segmentation, object detection and instance segmentation?

I'm fairly new at computer vision and I've read an explanation at a medium post, however it still isn't clear for me how they truly differ.
Guilherme Marques
  • 398
  • 1
  • 3
  • 8
27
votes
4 answers

Word2Vec for Named Entity Recognition

I'm looking to use google's word2vec implementation to build a named entity recognition system. I've heard that recursive neural nets with back propagation through structure are well suited for named entity recognition tasks, but I've been unable…
Madison May
  • 2,029
  • 2
  • 17
  • 18
27
votes
2 answers

What is the advantage of using log softmax instead of softmax?

Are there any advantages to using log softmax over softmax? What are the reasons to choose one over the other?
rawwar
  • 831
  • 2
  • 12
  • 23
27
votes
1 answer

Adaboost vs Gradient Boosting

How is AdaBoost different from a Gradient Boosting algorithm since both of them use a Boosting technique? I could not figure out actual difference between these both algorithms from a theory point of view.
CodeMaster GoGo
  • 768
  • 1
  • 6
  • 15
27
votes
2 answers

How to deal with time series which change in seasonality or other patterns?

Background I'm working on a time series data set of energy meter readings. The length of the series varies by meter - for some I have several years, others only a few months, etc. Many display significant seasonality, and often multiple layers -…
Jo Douglass
  • 401
  • 1
  • 5
  • 10
27
votes
2 answers

local minima vs saddle points in deep learning

I heard Andrew Ng (in a video I unfortunately can't find anymore) talk about how the understanding of local minima in deep learning problems has changed in the sense that they are now regarded as less problematic because in high-dimensional spaces…
oW_
  • 6,254
  • 4
  • 28
  • 45
27
votes
3 answers

How to sum values grouped by two columns in pandas

I have a Pandas DataFrame like this: df = pd.DataFrame({ 'Date': ['2017-1-1', '2017-1-1', '2017-1-2', '2017-1-2', '2017-1-3'], 'Groups': ['one', 'one', 'one', 'two', 'two'], 'data': range(1, 6)}) Date Groups data 0 …
Kevin
  • 533
  • 2
  • 5
  • 12
27
votes
3 answers

How to deal with string labels in multi-class classification with keras?

I am newbie on machine learning and keras and now working a multi-class image classification problem using keras. The input is tagged image. After some pre-processing, the training data is represented in Python list as: [["dog",…
Dracarys
  • 393
  • 1
  • 3
  • 5
27
votes
4 answers

Is there a straightforward way to run pandas.DataFrame.isin in parallel?

I have a modeling and scoring program that makes heavy use of the DataFrame.isin function of pandas, searching through lists of facebook "like" records of individual users for each of a few thousand specific pages. This is the most time-consuming…
Therriault
  • 871
  • 1
  • 8
  • 13
27
votes
1 answer

PyTorch vs. Tensorflow Fold

Both PyTorch and Tensorflow Fold are deep learning frameworks meant to deal with situations where the input data has non-uniform length or dimensions (that is, situations where dynamic graphs are useful or needed). I would like to know how they…
noe
  • 22,074
  • 1
  • 43
  • 70
27
votes
3 answers

Encoding categorical variables using likelihood estimation

I am trying to understand how I can encode categorical variables using likelihood estimation, but have had little success so far. Any suggestions would be greatly appreciated.
small dwarf
  • 271
  • 1
  • 3
  • 4
26
votes
2 answers

Text categorization: combining different kind of features

The problem I am tackling is categorizing short texts into multiple classes. My current approach is to use tf-idf weighted term frequencies and learn a simple linear classifier (logistic regression). This works reasonably well (around 90% macro F-1…