Questions tagged [faster-rcnn]

49 questions
10
votes
2 answers

How does the bounding box regressor work in Fast R-CNN?

In the fast R-CNN paper (https://arxiv.org/abs/1504.08083) by Ross Girshick, the bounding box parameters are continuous variables. These values are predicted using regression method. Unlike other neural network outputs, these values do not represent…
9
votes
2 answers

Is Faster RCNN the same thing as VGG-16, RESNET-50, etc... or not?

My understanding is that Faster RCNN is an architecture for performing object detection. It finds objects in an image and classifies them. My understanding is also that VGG-16, RESNET-50, etc... also find objects in images and classify them. Are…
b19wh33l5
  • 91
  • 1
  • 2
8
votes
4 answers

Faster-RCNN how anchor work with slider in RPN layer?

I am trying to understand the whole Faster-RCNN, From https://www.quora.com/How-does-the-region-proposal-network-RPN-in-Faster-R-CNN-work Then a sliding window is run spatially on these feature maps. The size of sliding window is n×n (here 3×3).…
5
votes
2 answers

Feeding 3 consecutive video frames to a CNN to track a tennis ball

I want to use CNN transfer learning to track a tennis ball from TV broadcasts of tennis matches. I used VGG annotating tool annotation tool link (use version 1 of the tool for compatibility with matterport code) and have about 200 frames annotated…
mLstudent33
  • 574
  • 1
  • 4
  • 17
4
votes
2 answers

Backpropagation in Faster R-CNN

I understand how the convolution layers are applied after selective search finds the regions of interest in vanilla R-CNN and so the back-propagation or any weight updating is done in the individual convolution networks. But in Faster R-CNN a…
deadcode
  • 153
  • 1
  • 7
3
votes
1 answer

Faster RCNN-RPN NETWORK

I already asked this question in stack overflow, but got response from experts, to post this question here, please help me to understand this concept... I am trying to understand RPN network in Faster RCNN. I understand the concept of RPN…
3
votes
0 answers

Bounding box regression in R-CNN

In R-CNN paper, they give the definition of the target values for bounding box regression Given that $(P, G)$ is a (prediction box, ground-truth box) pair of the form $(x, y, w, h)$ where $x, y$ is the center coordinate of the box, $w, h$ are width…
HOANG GIANG
  • 159
  • 9
3
votes
1 answer

Get bounding boxes for adjacent instances of a single class in image

I have a dataset with thousands of music score pages and manually annotated bounding boxes for the individual bars: My objective is now to train a DNN that should ultimately be able to get these bounding boxes on its own. First idea was to use…
sonovice
  • 131
  • 2
2
votes
1 answer

Why does Faster R-CNN use SGD optimizer instead of Adam?

I just start learning Faster R-CNN and I have some doubts about the optimizer of this network. In my understanding, Adam optimizer performs much better than SGD in a lot of networks. However, the paper of Faster R-CNN choose SGD optimizer instead of…
2
votes
2 answers

Pre trained dataset for Car damage detection

I'm making a Car Damage Detection model which would have 2 classes to detect upon. My dataset has a total of 300 images (out of which I'd be using some for testing), which are totally insufficient to train the model from scratch. Can I use a…
2
votes
1 answer

How does R-CNN and AlexNet compare?

I know AlexNet does object classification in images [categories] and R-CNN does object localization [category and bounding box]. How does R-CNN and AlexNet compare? Are they used for the same purpose or R-CNN does more? Does R-CNN use Alexnet as a…
Fakrudeen
  • 123
  • 4
2
votes
1 answer

what is the best approach to detect small objects with similar shape?

I'm working a model which detect different products in supermarket shelf. In the training data, there are a lot of objects with similar shape placed very close to or stacked to each others.(eg: milks with different brands are stacked, placed on the…
2
votes
0 answers

Which model is used for document extraction (CamScanner, Microsoft Lens etc)

I want to start a small project where I'd create a model(s) that would extract document from a picture and rescale it, something like CamScanner or Microsoft Lens apps do. I've gathered a small dataset just to prototype the concept, but I'm not sure…
1
vote
0 answers

Understanding how anchors are created in a regional proposal network

I understand that in Faster R-CNN, the image is fed into a pre-trained CNN (such as VG16). So say I have a 37x50x512 feature map. Firstly, I assume that each feature map (37x50x1) is fed into the RPN? or does each of the 512 feature maps get fed in…
1
vote
0 answers

Best way to train yolov5 on a custom dataset

I have a dataset with about 100 images that look like this. My goal is get yolov5 to detect buildings in similar images. In order to do this I would like yolov5 to get to close to 1 in precision on the training dataset The images are of size 10000…
Ariel Baron
  • 111
  • 1
1
2 3 4