Does the rectangular image as the input affect the performace of CNN models?

More Xin Gao's questions See All

How to display grain boundaries in RVE model?

Many literatures have shown that the RVE model shows the grain boundaries between different grains. How can this be achieved using DAMASK+paraview?

21 July 2024 9,224 1 View

How would I calculated ddPCR spiked nucleic acid recovery calculations?

Hello everyone, I'm in the initial stages of conducting a study involving the ddPCR (Bio-Rad QX200, QX600) and I ran into some problems involving recovery % calculations using direct...

02 July 2024 6,011 2 View

Potato handbook crop of the future--PDF?

Whoever has an electronic version of this book, sell it to me，《potato handbook crop of the future》Thanks

15 June 2024 4,404 0 View

Any advice on analyzing two-factor gene expression study?

Hello Research Community, I am currently working on a gene expression study involving two factors: treatment (control vs. treatment) and time points. The results from a two-way ANOVA using GLM...

06 June 2024 2,310 2 View

Who sold me this book《Plant Nutrition Diagnostics: Potato》?

《Plant Nutrition Diagnostics: Potato》

03 June 2024 1,663 2 View

Does the zirconium isotope fractionate as through the column?

Is the zirconium isotope of geological sample fractionated when passing through the column, and how to determine whether this fractionation is or not？

26 May 2024 6,721 2 View

In C++, why is the constructor of a virtual base class called first, while the object of the virtual base class is placed last?

In C++, why is the constructor of a virtual base class called first, while the object of the virtual base class is placed last? What are the advantages of doing this?

07 May 2024 1,583 1 View

Through what theories can we aptly explain the social networks and subcultures that people form around cats?

Just like the stray cat protection organizations on campus, they are spontaneous and not organized by the school. They provide a series of services for kittens in life and death. And the source of...

11 April 2024 4,944 0 View

PMSCV for protein transit protein expression?

We are using pMSCV for transit expression of a protein.: 1. The gene was cloned in (between XhoI/EcoRI). 2. Transfection with lipofectmin 3000 to 293t cell. 3. After 48 hrs, GFP can be...

04 April 2024 8,350 4 View

DAB staining for biocytin filled interneurons?

Hi! We are using DAB to stain biocytin filled (20-40min) interneurons in spinal cord. As shown in the figure, the soma is stained well, but not the dendrites. Could someone tell me what might be...

25 March 2024 838 4 View

Weak DAPI staining after immunohistochemistry - how to improve?

After immunohistochemistry of previously fixed in PFA and EtOH and then frozen 20 μm sections of zebrafish brain, DAPI staining is very weak (right) compared to the same sections stained without...

05 August 2024 9,637 2 View

Dirty and clean?

Hi everyone I need a file with a dirty and clean potato image

04 August 2024 7,199 4 View

Hello, regarding Mxene 2D titanium carbide?

I fabricated Ti3C2Tx using concentrated HF 40%, I plot an XRD as attached image below.. please let to know if I obtained it or not.

02 August 2024 6,789 4 View

IHC profiler in image j software. has anyone used it to quantify nuclear DAB positivity?

Dear researchers. I tried using the IHC PROFILER in image j to quantify nuclear DAB staining. I followed the instructions in the original article by "Varghese F, Bukhari AB, Malhotra R, De A...

29 July 2024 2,229 0 View

While using the IHC PROFILER plugin in image j software, to quantify the epithelial cytoplasmic DAB staining do you crop remove the stromal tissue?

My question pertaining to the DAB staining in cytoplasm of human oral squamous cell carcinoma tissue. When quantifying the epithelial cancer cells do we have to crop remove the stromal tissue?...

29 July 2024 2,682 6 View

Why in 2D image created by discovery studio 2021, the halogens appear as alkyl group?

In my molecule there is Chloro group at 2-position of phenyl ring, but in 2D image it appears as methyl showing no interaction.

28 July 2024 734 0 View

Why is there a significant edge deviation in radar point cloud and camera registration?

The above are manually labeled extrinsic matrices based on the first image It can be seen that the projection error at the edge is large, while the error at the center is small. What could be the...

23 July 2024 7,479 3 View

About Machine Learning for Automated Diagnosis of Skin Cancer with Dermoscopy Images ?

Machine Learning for Automated Diagnosis of Skin Cancer with Dermoscopy Images

21 July 2024 5,471 0 View

Which is the best approach for anomaly detection in scanned image data set?

Anomaly detection in scanned image data set

18 July 2024 3,578 3 View

Less substrate signals towards the last lanes of the Native PAGE?

I have been running native page for FAM DNA substrate ( fluorescence samples) for protein DNA binding reaction. Binding is there but towards the end of the lane , I am loosing signals...

17 July 2024 6,213 4 View

Alexey Chernyavskiy

The issue with squared vs. rectangular images is important only in the classification tasks. In classification CNNs there is always a final fully-connected (FC) layer that performs the classification. It has the output size equal to the number of classes, and the size of its input is fixed (e.g. input = 512, output is 5 classes). So, the whole CNN part that is placed before the FC layer is tuned to produce an output of this specific size. In this way, the CNN must deal with an image of a predefined size, e.g. it's 221x221 in some popular CNNs, or it can be 256x256 in order to be more aligned with some processor architectures. So, if you want to classify an image of a different size, then feeding it through the CNN will produce a different input to the FC layer (e.g., not 512 but 499) and the FC layer will not be able to process this input (since the action of an FC layer is basically a matrix-vector multiplication).

Therefore, we usually either crop the part from the image that has the size that suits the particular CNN architecture, or we resize this image. Resizing is not always good since it kills the aspect ratio which may harm classification accuracy.

This requirement for a fixed input size for classification can be removed if we use tricks like Global average pooling, so that the input to the FC is always the same size (equal to the number of feature maps it the final convolutional layer that goes right before the FC), no matter which size in the input image.

So, for classification, there is no requirement for the image to be square. It can be rectangular as well, but it should have the same fixed size.

If the CNN doesn't contain an FC layer - e.g., it's fully-convolutional as in many classification tasks and almost all modern segmentation CNNs - then the shape of the input doesn't matter.

Murtaza Khan

In general, if you resize an image, it will affect the performance of the machine learning algorithms such as CNN. Because extraction of features in an image utilizes pixels and resizing will remove or add pixels from the image consequently features after and before resizing may not be the same. Most of the software packages support rectangle images and the width and height may need not be the same. Though large size images take longer time to process and may cause an out-of-memory error if the data size is big.