Most Popular

1500 questions
29
votes
10 answers

Collaborating on Jupyter Notebooks

I have prepared Jupyter Notebook with some findings and I shared it with other team members through GitHub to get their feedback in a written form. It used to work like this when working together on a piece of code but does not work for Jupyter…
dzieciou
  • 697
  • 1
  • 6
  • 15
29
votes
1 answer

What is the difference between upsampling and bi-linear upsampling in a CNN?

I am trying to understand this paper and am unsure of what bi-linear upsampling is. Can anyone explain this at a high-level?
JGG
  • 503
  • 1
  • 5
  • 7
29
votes
2 answers

What is the difference between fit() and fit_generator() in Keras?

What is the difference between fit() and fit_generator() in Keras? When should I use fit() vs fit_generator()?
N.IT
  • 1,975
  • 4
  • 17
  • 35
29
votes
1 answer

Should one hot vectors be scaled with numerical attributes

In the case of having a combination of categorical and numerical Attributes, I usually convert the categorical attributes to one hot vectors. My question is do I leave those vectors as is and scale the numerical attributes through…
29
votes
4 answers

Books about the "Science" in Data Science?

What are the books about the science and mathematics behind data science? It feels like so many "data science" books are programming tutorials and don't touch things like data generating processes and statistical inference. I can already code, what…
Anton
  • 399
  • 4
  • 5
29
votes
6 answers

Deep learning basics

I am looking for a paper detailing the very basics of deep learning. Ideally like the Andrew Ng course for deep learning. Do you know where I can find this ?
Maxi
  • 433
  • 5
  • 7
29
votes
6 answers

L2 loss vs. mean squared loss

I see some literature consider L2 loss (least squared error) and mean squared error loss are two different kinds of loss functions. However, it seems to me these two loss functions essentially compute the same thing (with a 1/n factor…
Edamame
  • 2,705
  • 5
  • 23
  • 32
29
votes
1 answer

What is Hellinger Distance and when to use it?

I am interested in knowing what really happens in Hellinger Distance (in simple terms). Furthermore, I am also interested in knowing what are types of problems that we can use Hellinger Distance? What are the benefits of using Hellinger Distance?
Smith Volka
  • 665
  • 2
  • 6
  • 13
29
votes
10 answers

Any Online R console?

I am looking for an online console for the language R. Like I write the code and the server should execute and provide me with the output. Similar to the website Datacamp.
Gotham
  • 291
  • 1
  • 3
  • 3
29
votes
1 answer

Word2Vec vs. Sentence2Vec vs. Doc2Vec

I recently came across the terms Word2Vec, Sentence2Vec and Doc2Vec and kind of confused as I am new to vector semantics. Can someone please elaborate the differences in these methods in simple words. What are the most suitable tasks for each…
29
votes
1 answer

NLP - why is "not" a stop word?

I am trying to remove stop words before performing topic modeling. I noticed that some negation words (not, nor, never, none etc..) are usually considered to be stop words. For example, NLTK, spacy and sklearn include "not" on their stop word lists.…
E.K.
  • 405
  • 4
  • 6
29
votes
7 answers

Difference between AlphaGo's policy network and value network

I was reading a high level summary about Google's AlphaGo, and I came across the terms "policy network" and "value network". At a high level, I understand that the policy network is used to suggest moves and the value network is used to, "Reduce the…
Ryan Zotti
  • 4,129
  • 3
  • 19
  • 32
28
votes
2 answers

Predicting a word using Word2vec model

Given a sentence: "When I open the ?? door it starts heating automatically" I would like to get the list of possible words in ?? with a probability. The basic concept used in word2vec model is to "predict" a word given surrounding context. Once the…
DED
  • 345
  • 1
  • 3
  • 7
28
votes
3 answers

Data Science Project Ideas

I don't know if this is a right place to ask this question, but a community dedicated to Data Science should be the most appropriate place in my opinion. I have just started with Data Science and Machine learning. I am looking for long term project…
Kevin Desai
  • 383
  • 1
  • 3
  • 4
28
votes
4 answers

Scikit-learn: Getting SGDClassifier to predict as well as a Logistic Regression

A way to train a Logistic Regression is by using stochastic gradient descent, which scikit-learn offers an interface to. What I would like to do is take a scikit-learn's SGDClassifier and have it score the same as a Logistic Regression here.…
hlin117
  • 675
  • 1
  • 8
  • 11