Highest Voted Questions - Data Science Stack Exchange

29

votes

10 answers

Collaborating on Jupyter Notebooks

I have prepared Jupyter Notebook with some findings and I shared it with other team members through GitHub to get their feedback in a written form. It used to work like this when working together on a piece of code but does not work for Jupyter…

jupyter

asked Nov 13 '19 at 16:14

dzieciou

697
1
6
15

29

votes

1 answer

What is the difference between upsampling and bi-linear upsampling in a CNN?

I am trying to understand this paper and am unsure of what bi-linear upsampling is. Can anyone explain this at a high-level?

deep-learning cnn convolution convolutional-neural-network

asked Sep 11 '18 at 20:49

JGG

503
1
5
7

29

votes

2 answers

What is the difference between fit() and fit_generator() in Keras?

What is the difference between fit() and fit_generator() in Keras? When should I use fit() vs fit_generator()?

deep-learning keras

asked Jul 13 '18 at 20:21

N.IT

1,975
4
17
35

29

votes

1 answer

Should one hot vectors be scaled with numerical attributes

In the case of having a combination of categorical and numerical Attributes, I usually convert the categorical attributes to one hot vectors. My question is do I leave those vectors as is and scale the numerical attributes through…

feature-engineering feature-scaling data-science-model

asked May 14 '18 at 17:54

Suresh Kasipandy

568
1
4
8

29

votes

4 answers

Books about the "Science" in Data Science?

What are the books about the science and mathematics behind data science? It feels like so many "data science" books are programming tutorials and don't touch things like data generating processes and statistical inference. I can already code, what…

statistics reference-request

asked Jun 11 '14 at 13:28

Anton

399
4
5

29

votes

6 answers

Deep learning basics

I am looking for a paper detailing the very basics of deep learning. Ideally like the Andrew Ng course for deep learning. Do you know where I can find this ?

machine-learning deep-learning

asked Dec 08 '14 at 22:37

Maxi

433
5
7

29

votes

6 answers

L2 loss vs. mean squared loss

I see some literature consider L2 loss (least squared error) and mean squared error loss are two different kinds of loss functions. However, it seems to me these two loss functions essentially compute the same thing (with a 1/n factor…

loss-function

asked Jan 01 '18 at 01:58

Edamame

2,705
5
23
32

29

votes

1 answer

What is Hellinger Distance and when to use it?

I am interested in knowing what really happens in Hellinger Distance (in simple terms). Furthermore, I am also interested in knowing what are types of problems that we can use Hellinger Distance? What are the benefits of using Hellinger Distance?

machine-learning data-mining text-mining distance

asked Aug 31 '17 at 02:11

Smith Volka

665
2
6
13

29

votes

10 answers

Any Online R console?

I am looking for an online console for the language R. Like I write the code and the server should execute and provide me with the output. Similar to the website Datacamp.

r statistics

asked Oct 13 '14 at 21:13

Gotham

291
1
3
3

29

votes

1 answer

Word2Vec vs. Sentence2Vec vs. Doc2Vec

I recently came across the terms Word2Vec, Sentence2Vec and Doc2Vec and kind of confused as I am new to vector semantics. Can someone please elaborate the differences in these methods in simple words. What are the most suitable tasks for each…

machine-learning data-mining clustering nlp unsupervised-learning

asked Jun 30 '17 at 07:05

Smith

529
1
5
14

29

votes

1 answer

NLP - why is "not" a stop word?

I am trying to remove stop words before performing topic modeling. I noticed that some negation words (not, nor, never, none etc..) are usually considered to be stop words. For example, NLTK, spacy and sklearn include "not" on their stop word lists.…

nlp topic-model sentiment-analysis

asked Dec 15 '16 at 22:20

E.K.

405
4
6

29

votes

7 answers

Difference between AlphaGo's policy network and value network

I was reading a high level summary about Google's AlphaGo, and I came across the terms "policy network" and "value network". At a high level, I understand that the policy network is used to suggest moves and the value network is used to, "Reduce the…

machine-learning reinforcement-learning

asked Mar 28 '16 at 16:40

Ryan Zotti

4,129
3
19
32

28

votes

2 answers

Predicting a word using Word2vec model

Given a sentence: "When I open the ?? door it starts heating automatically" I would like to get the list of possible words in ?? with a probability. The basic concept used in word2vec model is to "predict" a word given surrounding context. Once the…

nlp predictive-modeling word-embeddings

asked Jan 14 '16 at 07:13

DED

345
1
3
7

28

votes

3 answers

Data Science Project Ideas

I don't know if this is a right place to ask this question, but a community dedicated to Data Science should be the most appropriate place in my opinion. I have just started with Data Science and Machine learning. I am looking for long term project…

machine-learning bigdata dataset

asked Jul 25 '14 at 18:36

Kevin Desai

383
1
3
4

28

votes

4 answers

Scikit-learn: Getting SGDClassifier to predict as well as a Logistic Regression

A way to train a Logistic Regression is by using stochastic gradient descent, which scikit-learn offers an interface to. What I would like to do is take a scikit-learn's SGDClassifier and have it score the same as a Logistic Regression here.…

python logistic-regression scikit-learn gradient-descent

asked Aug 04 '15 at 08:11

hlin117

675
1
8
11

Most Popular