Questions tagged [notation]

Notation refers to a specific way of writing various concepts, functions, etc. Usually it is introduced to abbreviate complicated expressions and make common idioms more readable.

16 questions
5
votes
1 answer

What types of matrix multiplication are used in Machine Learning? When are they used?

I'm looking at equations for neural networks and backpropagation and I see this symbol in the equations, ⊙. I thought matrix multiplication of neural networks always involved matrices that matched dimensions on both sides, such as... [3, 3]@[3, 2].…
Ant
  • 177
  • 1
  • 5
4
votes
3 answers

Notation for features (general notation for continuous and discrete random variables)

I'm looking for the right notation for features from different types. Let us say that my samples as $m$ features that can be modeled with $X_1,...,X_m$. The features Don't share the same distribution (i.e. some categorical, some numerical, etc.).…
Yael M
  • 41
  • 2
3
votes
1 answer

Layer notation for convolutional neural networks

When reading about convolutional neural networks (CNNs), I often come across a special notation used in the community and in scientific papers, describing the architecture of the network in terms of layers. However, I was not able to find a paper or…
Czechnology
  • 135
  • 6
3
votes
1 answer

Why is an activation function notated as "g"?

In many cases an activation function is notated as g (e.g. Andrew Ng's Course courses), especially if it doesn't refer to any specific activation function such as sigmoid. However, where does this convention come from? And for what reason did g…
Blaszard
  • 901
  • 1
  • 13
  • 29
2
votes
2 answers

Formal math notation of masked vector

I'm struggling to write my algorithm in a concise and correct way. The following is an explanation for an optimizer's update step of part of a vector of weights (not a matrix in my case). I have a vector $\alpha \in \mathbb{R}^d$, and a set $S$ that…
leed
  • 145
  • 4
2
votes
2 answers

Pound notation in Summation

I was going through a paper comparing glove and word2vec. I came across the pound notation shown below. What does it mean when used like this? The link for paper is here
Sagar Patel
  • 63
  • 1
  • 5
2
votes
1 answer

Layer notation for feed forward neural networks

Apologies in advance, for I have a fairly rudimentary question on the notations for studying Feed-Forward Neural Networks. Here is a nice schematic taken from this blog-post. Here $x_i = f_i(W_i \cdot x_{i-1})$ where $f_i$ is the activation…
2
votes
1 answer

What does the pair of vertical lines in empirical entropy formula mean?

I am learning from this post. $\alpha$ is the ratio of the first subset, $$\alpha=\frac{\left|D_{1}\right|}{\left|D\right|}$$ according to the context and code of the post, $\left|D\right|$ means the number of samples? What are the pair of…
Jay
  • 425
  • 3
  • 8
2
votes
1 answer

PAC Learnability - Notation

The following is from Understanding Machine Learning: Theory to Algorithm textbook: Definition of PAC Learnability: A hypothesis class $\mathcal H$ is PAC learnable if there exist a function $m_H : (0, 1)^2 \rightarrow \mathbb{N}$ and a learning…
tkj80
  • 139
  • 3
2
votes
1 answer

Explain notation in Bishop Eq 1.60

I'm starting to work through Bishop's "Pattern Recognition and Machine Learning" book, and have run into unfamiliar notation. Eq. 160 is prefaced by saying "For this purpose, we shall assume that, given the value of $x$, the corresponding value of…
Jolyon
  • 123
  • 2
1
vote
1 answer

Meaning of equation for CNN probabilities

So the first equation above refers to a CNN (rather a committee of CNNs) for image classification. I am unable to understand exactly what the author is trying to do in the first equation. So far, I think they're calculating the index of max…
Stormlight
  • 33
  • 4
1
vote
0 answers

Does this notation mean vector-concatenation?

When reading papers on neural networks, I occasionally stumble upon the following notation with a semicolon: $$ \text{tanh}(\mathbf{W_c}[\mathbf{c}_t;\mathbf{h}_t]) $$ Unless otherwise noted, does this by default mean the following: vector…
Kari
  • 2,686
  • 1
  • 17
  • 47
1
vote
1 answer

Cannot see what the "notation abuse" is, mentioned by author of book

From Sutton and Barto, Reinforcement Learning: An Introduction (second edition draft), in equation 3.4 of page 38. The probabilities given by the four-argument function p completely characterize the dynamics of a finite MDP. From it, one can…
cinqS
  • 367
  • 1
  • 2
  • 13
1
vote
1 answer

What is meant by this notation for ensemble classifier error rate

The below is a picture which denotes the error of an ensemble classifier. Can someone help me understand the notation What does it mean to have (25 and i) in brackets and what is ε^1 is it error of first classifier or the error rate raised to power…
Arun Jose
  • 125
  • 4
1
vote
0 answers

Probability notation q(y) and q(Y) and its implication to vector functions

The function in question is (from Appendix B, Proof of proposition 2.1 from Posterior Regularization for Structured Latent Variable Models): $$q(\textbf{Z}) = \frac{p_{\theta}(\textbf{Z}|\textbf{X})exp(\lambda^T \cdot…
Martin G
  • 131
  • 3
1
2