Convolutional Neural Networks (CNN, also called ConvNets) are a tool used for classification tasks and image recognition. The name giving first step is the extraction of features from the input data.
Questions tagged [cnn]
1345 questions
59
votes
3 answers
How to set batch_size, steps_per epoch, and validation steps?
I am starting to learn CNNs using Keras. I am using the theano backend.
I don't understand how to set values to:
batch_size
steps_per_epoch
validation_steps
What should be the value set to batch_size, steps_per_epoch, and validation_steps, if I…
Ermene
- 693
- 1
- 6
- 6
40
votes
3 answers
What's the difference between Attention vs Self-Attention? What problems does each other solve that the other can't?
As stated in the question above..is there a difference between attention and self attention mechanism ? Also additionally can anybody share with me tips and tricks about how self attention mechanism can be implemented in CNN?
Pratik.S
- 533
- 1
- 5
- 10
36
votes
6 answers
Why do convolutional neural networks work?
I have often heard people saying that why convolutional neural networks are still poorly understood. Is it known why convolutional neural networks always end up learning increasingly sophisticated features as we go up the layers?
What caused them…
Praise the lord
- 461
- 1
- 4
- 5
30
votes
1 answer
What is the difference between upsampling and bi-linear upsampling in a CNN?
I am trying to understand this paper and am unsure of what bi-linear upsampling is. Can anyone explain this at a high-level?
JGG
- 523
- 2
- 5
- 8
29
votes
4 answers
What is a channel in a CNN?
I was reading an article about convolutional neural networks, and I found something that I don't understand, which is:
The filter must have the same number of channels as the input image so that the element-wise multiplication can take place.
Now,…
J.D.
- 941
- 6
- 20
- 33
26
votes
1 answer
back propagation in CNN
I have the following CNN:
I start with an input image of size 5x5
Then I apply convolution using 2x2 kernel and stride = 1, that produces feature map of size 4x4.
Then I apply 2x2 max-pooling with stride = 2, that reduces feature map to size 2x2.…
koryakinp
- 436
- 2
- 5
- 14
24
votes
1 answer
How to add non-image features along side images as the input of CNNs
I'm training a convolutional neural network to classify images on fog conditions (3 classes). However, for each of about 150.000 images I also have four meteorological variables available that might help in predicting the classes of the images. I…
Josh
- 497
- 4
- 8
21
votes
2 answers
In CNN, why do we increase the number of filters in deeper Convolution layers for complex images?
I have been doing this online course Introduction to TensorFlow for AI, ML and DL. Here in one part, they were showing a CNN model for classifying human and horses. In this model, the first Conv2D layer had 16 filters, followed by two more Conv2D…
Sanjay
- 333
- 1
- 2
- 8
21
votes
2 answers
What are the differences between Convolutional1D, Convolutional2D, and Convolutional3D?
I've been learning about Convolutional Neural Networks. When looking at Keras examples, I came across three different convolution methods. Namely, 1D, 2D & 3D. What are the differences between these three layers? What are their use cases? Are there…
Saurabh
- 357
- 1
- 2
- 9
17
votes
2 answers
Updating the weights of the filters in a CNN
I am currently trying to understand the architecture of a CNN. I understand the convolution, the ReLU layer, pooling layer, and fully connected layer. However, I am still confused about the weights.
In a normal neural network, each neuron has its…
Felix
- 173
- 1
- 1
- 5
16
votes
6 answers
How to prepare the varied size input in CNN prediction
I want to make a CNN model in Keras which can be fed images of different sizes. According to other questions, I could understand how to set a model, like Input =(None,None,3). However, I'm not sure how to prepare the input/output…
kainamanama
- 311
- 1
- 3
- 8
14
votes
3 answers
what is darknet and why is it needed for YOLO object detection?
what is darknet and why is it needed for YOLO object detection ? I read that its a neural network written in C , but why is it needed for YOLO object detection when we have lot of machine learning framework,api like tensorflow,keras,pytorch .
Im…
star
- 1,521
- 7
- 20
- 31
14
votes
2 answers
CNN - How does backpropagation with weight-sharing work exactly?
Consider a Convolutional Neural Network (CNN) for image classification. In order to detect local features, weight-sharing is used among units in the same convolutional layer.
In such a network, the kernel weights are updated via the backpropagation…
Andy R
- 453
- 1
- 4
- 8
13
votes
2 answers
Finding outliers in Image dataset
I have been working on an image classification tasks for which I am extracting the image frames from the video stream collected for different classes.
I have already trained an image classification model (using transfer learning) however due to the…
deepguy
- 1,471
- 8
- 21
- 39
13
votes
1 answer
What is difference between Fully Connected layer and Bilinear layer in CNN?
What is the difference between Fully Connected layers and Bilinear layers in deep learning?
N.IT
- 2,015
- 4
- 21
- 36