Questions tagged [data-augmentation]
122 questions
11
votes
1 answer
CNN - imbalanced classes, class weights vs data augmentation
I have a dataset with a few strongly imbalanced classes, eg. the smallest class is about 54 times smaller than the largest. Therefore, data augmentation in order to equalize the size of classes seems like a bad idea to me (in the example above each…
I.D.M
- 175
- 1
- 10
7
votes
4 answers
When using Data augmentation is it ok to validate only with the original images?
I'm working on a multi-classification deep learning algorithm and I was getting big over-fitting:
My model is supposed to classify sunglasses on 17 different brands, but I only had around 400 images from each brand so I created a folder with data…
Santiago Marin Mejia
- 73
- 1
- 1
- 4
7
votes
2 answers
Why augmenting the training data with binary attributes works better for our dataset?
We have a dataset with multiple features ~400 where all of the features have a histogram as you can see in the following picture (sampled only a few)
Our assumption
We thought that this looked like some categorical data where few of the times had a…
George Pligoropoulos
- 607
- 3
- 16
6
votes
1 answer
Is GridSearchCV in combination with ImageDataGenerator possible and recommendable?
I want to optimize some hyperparameters for a CNN architecture by using GridSearchCV (Scikit-Learn) in combination with Data Augmentation (ImageDataGenerator from Keras).
However, GridSearchCV only offers the fit function and not the fit_generator…
Code Now
- 393
- 5
- 10
6
votes
2 answers
Data Augmentation for Regression
I want to use deep learning for regression. However, the number of training samples is not large. In image processing, some new samples are generated on the basis of initial data through tasks like rotation to increase the number of data sets. Can I…
Amin
- 191
- 3
- 9
5
votes
1 answer
Memory error on using data generator in keras
I am using the following augmentations on dataset of size 9 GB:
datagen = ImageDataGenerator(
featurewise_center=False, # set input mean to 0 over the dataset
samplewise_center=False, # set each sample mean to 0
…
Srihari
- 767
- 4
- 12
- 27
5
votes
1 answer
Data Augmentation in videos
I know some techniques for augmenting data when images are used, but I don't know if there are any such techniques specifically catered to videos. Since videos have an additional temporal dimension compared to images I was wondering if there were…
ksh
- 151
- 6
4
votes
1 answer
Why does adding data augmentation decrease training accuracy a tiny bit?
Before data augmentation, my model clearly overfits and hits a 100% training accuracy and a 52% validation accuracy. When only adding data augmentation with Keras, as a regularization technique, it achieves a 95% training accuracy with slower…
Kralley
- 43
- 1
- 3
4
votes
2 answers
Why are results without Transfer Learning better than with Transfer Learning?
I developed a neural network for license plate recognition and used the EfficientNet architecture (https://keras.io/api/applications/efficientnet/#efficientnetb0-function) with and without pretrained weights on ImageNet and with and without data…
Tobitor
- 93
- 9
4
votes
1 answer
How to implement PCA color augmentation as discussed in AlexNet
I read through "ImageNet Classification with Deep Convolutional Neural Networks" again specifically for details on how to implement PCA color augmentation.
I am unsure if I have it right. Here is how I did this in numpy:
# original image is…
kawingkelvin
- 211
- 1
- 2
- 7
3
votes
0 answers
Is it possible to increase the number of images of one class using data augmentation, which is not applied on the other class, in the same dataset?
I have 2 classes for my image classification problem, say class A and class B, and I am using tensorflow and keras for the same. One of them have around 5K images while the other have just around 2K images. Is it possible to increase the number of…
Hrushikar Teja K
- 31
- 4
3
votes
1 answer
First perform data augmentation or normalization?
Should I first perform data augmentation or normalization in deep learning? I am mainly interested in 2D and 3D input data. In tutorials that I have seen so far the data augmentation always comes first. Is there a (mathematical) reason for that?…
Anno
- 151
- 1
- 1
- 6
3
votes
1 answer
What exactly are the data augmentation experimental Keras' layers doing?
From what I gathered, data augmentation consists in increasing your number of instances in your dataset by applying some transfromations. Let's say I want to classify images. If I apply a random rotation to every image in a data set containing $n$…
CMB
- 43
- 4
3
votes
1 answer
How to do data augmentation for Machine Learning on statistical data?
I am training machine learning classification model. I have data in csv format with lets say 5 features(or columns) and 100 such observations(or rows).
I want to add more similar data to improve my model accuracy. I do not have more data from…
Mitesh Patel
- 159
- 1
- 3
3
votes
0 answers
Back-Translation model for German and English
Do you know of any pre-trained models for back translation between German and English? I am aware that there are ways to include a monolingual corpus into the training of a machine translation model (often referred to as back translation). I am…
treebased
- 31
- 2