Generating discriminative features and addressing feature variations across different datasets for same set of data variables?

More Saurabh Malgaonkar's questions See All

How to estimate sample size for GWAS of continuous and discrete traits? What are the pre-requisites?

Genome-wide association study (GWAS) Continuous traits: eg. Height Discrete traits: eg. Eye color

28 July 2024 286 0 View

May i use mgcl2 or cucl2 or bivalent salt in place of calcium chloride for crosslinking the membrane consisting sodium alginate. which is better?

i am working on self heating membrane preparation. so for crosslinking of membrane, may i use copper chloride, or magnesium chloride or any bi valent cation salt to crosslink it? and which one...

29 May 2024 2,474 0 View

Any idea or sample data used for simulating gasket (using gasket element type) material with a pressure-closure curve and other details in Abaqus?

It is a sponsored project to develop a gasket made up of FGM material and validate the nonlinear behavior of the FGM using Abaqus simulation. I was exploring the behavior of the gasket element...

25 May 2024 2,407 0 View

Can we detect consciousness in newborn infants?

How can we determine if newborn infants possess consciousness, and if so, what methods and measures can we use to identify and assess this awareness within their developing minds?

19 May 2024 788 7 View

How empathy can be quantified ?

In what ways can empathy be measured or assessed to determine its extent or impact, and how can its presence or level be quantified through observable behaviours, physiological responses, or...

17 May 2024 4,265 3 View

Reductionist approach of science vs unexplored areas like strong emergence and consciousness ?

As AI can take care of the reductionist approach of science very well, can humans concentrate more on unexplored areas like strong emergence and consciousness?

16 May 2024 3,982 0 View

How ALD-Al2O3 can work as seeding layer for tin oxide deposition by thermal Atomic layer deposition (ALD)on a substrate?

While Al2O3 has 3 gm/cc and Tin oxide has 7 gm/cc measured on as low as 8nm films on same substrate, so how a rarer material can provide seeding to a denser material? As it seems that in rarer...

30 March 2024 9,305 1 View

Is it OK to use AI in writing research articles?

Nowadays AI is being used in every field. Even researchers are also using AI to write papers. Manually it used to take lot of time to rightly knit the story and produce a good paper but now...

31 January 2024 4,797 3 View

What are the conversion formula and related theory to calculate conductivity in metal oxides due to oxygen vacancy/interstitial position?

Like I want to calculate electron density, mobility and conductivity in n-type Tin oxide knowing the value of x in tin oxide chemical formula SnO(2-x), where Sn:O = 1:(2-x).

04 January 2024 8,279 1 View

What is the rheology behavior of soil?

How to determine and explanation of this as Geotechnical engineer

05 December 2023 4,475 2 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

Qamar Ul Islam

Dear Saurabh Malgaonkar

These articles might be helpful, have a look:

1. https://res.mdpi.com/d_attachment/genes/genes-11-00717/article_deploy/genes-11-00717-v2.pdf

2. https://towardsdatascience.com/understanding-dataset-shift-f2a5a262a766

Kind Regards

Muhammad Ali

Dear Saurabh Malgaonkar,

Related to your, statistical modelling based approaches are most effective and applicable to your project. One way is to use Bayesian parametric framework of Maximum Likelihood Estimation (MLE), however., working with Bayesian one of the noticeable hurdle is calculating the normalizing constant (otherwise the procedure is very straightforward). The second option is to consider non parametric clustering approach by considering von Misses Fisher distribution, Langevin distribution, Kent distribution or the Bingham distribution defined on the unit hypersphere. Also you treat these distributions by considering MCMC sampling techniques, or Gibs sampling techniques. See e.g., https://www.jmlr.org/papers/volume6/banerjee05a/banerjee05a.pdf etc....

Anirban Nandy

One way is to avoid such noisy variables. Variable selection process needs to be revisited. After taking distinct variables the features can be assessed to get robust results.

Saurabh Malgaonkar

Thanks Qamar, Muhammad and Anirban for providing key pointers towards the solution for the encountered research problem.

Rubaiath E Ulfath

You may try Gradient Boosted Algorithms like XgBoost. XgBoost has a robust capability of handling missing values, variations and confounding factors. Also, you might experiment by incorporating undersampling methods for noise removal and ensuring well distributed discriminating factors in your dataset.