Hard Negative Mining in CNN leading to class imbalance ?

More Haziq Razali's questions See All

Do you think can be any Uranium bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about Uranium ore deposits in world.

11 August 2024 6,720 0 View

Do you think can be any diamond bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about diamond ore deposits in world.

11 August 2024 2,167 1 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

If Banks do not provide credit facility, what are the options available for FPOs and impact on producer’s income?

10 August 2024 8,198 5 View

Controlling for pupil light reflex when analyzing pupil size time course?

I used eye tracking to examine how participants from two different populations (A and B) react to an image. Participants in population A exhibit larger pupil sizes over time, but they also have...

10 August 2024 3,229 0 View

What are a “Farmers Producer Organization” (FPO) and its essential features?

10 August 2024 477 5 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

Do interactions between biosphere, carbon cycle, & water cycle impact global warming & interaction between atmosphere & hydrosphere?

How do interactions between the biosphere, the carbon cycle, and the water cycle impact global warming and interaction between the atmosphere and the hydrosphere?

09 August 2024 3,291 2 View

How to get moment output in Abaqus Standart?

I have input a moment load in module load Abaqus, i put my moment load on the node surface (using reference point). I have define moment in history output and make a set for moment too. But the...

08 August 2024 4,831 4 View

How is energy cycled through the Earth's climate system and how do matter cycle and energy flow through the rock cycle?

08 August 2024 8,162 0 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

How to Compress Information Neurally?

Samuel Morse, the inventor of the Morse Code, understood that certain letters in the English language occurred more frequently than others (Gallistel and King 2010). To deal with this, Morse used...

01 August 2024 4,456 2 View

Dirk Tassilo Hettich

Dear Haziq, your notion is correct. Imbalanced classes lead to substantial problems in binary classification and the interpretation of results. One way to address this is to employ a different performance measure than accuracy (e.g. AUC-values) in conjunction with permutation tests. Although from another domain, in my article I try to shed some light on how to tackle class imbalance in applied machine learning.

Article EEG Responses to Auditory Stimuli for Automatic Affect Recognition

Floris De Smedt

Hi,

I am no expert on CNN object detection, but in "Taking a deeper look at pedestrians" the training is also performed using an imbalance. They state that the influence of this imbalance is minimal on the resulting accuracy. Note however that the samples they use are generated using another pedestrian detection algorithm.

I think it depends on the application if the balance matters. In the case of object detection, the amount of samples that should be classified as negatives is in most cases a lot larger as the ones that are positive. This is also reflected in the training procedure when using non-CNN approaches (ACF, ICF, ...), where the amount of negatives is in a lot of cases 5-10 times larger as the amount of positives. I would not see a reason this should be different for CNN.

Paul Yarnold

Imbalanced class sizes aren't a problem for maximum-accuracy methods. The following paper presents a brief description and comparison of various legacy methods versus enumerated-optimal classification tree analysis.

https://www.researchgate.net/publication/291947229_Using_data_mining_techniques_to_characterize_participation_in_observational_studies

Article Using data mining techniques to characterize participation i...

Christian Perone

There are many ways to solve your class imbalance, the most used approach that I'm aware of, and which is actually easy to do and also implemented on almost every Deep Learning framework, is to use penalized models. On these models, they impose an additional cost on the model for making classification mistakes on the minority class during training. These penalties can help the model to pay more attention to the minority class, and in summary you're forcing the model to pay more attention to the classes where you have less data.

Tal Schuster

Dear Haziq,

you might find this paper useful. It suggests an interleaving learning method to balance between hard and easy samples and improved the results of a CNN generating descriptors for optical flow.

A correlation between the difficulty of samples and their subcategory should be considered since some classes might be neglected or because the optimization of the harder class might disrupt the easier one. On that case, a learning method that uses all samples but account their difficulty could be preferred.

Conference Paper Optical Flow Requires Multiple Strategies (but Only One Network)

Kamran Kowsari

with HDLtex you can create several layer to make you data set balanced hierarchically

Please read this paper,

Conference Paper HDLTex: Hierarchical Deep Learning for Text Classification

Shah Nawaz

Please take a look at Conference Paper Class Rectification Hard Mining for Imbalanced Deep Learning