Can data-augmentation techniques be applied for numeric data-sets ? If so, please suggest some of the techniques?

More Suhail Ganiny's questions See All

How can we quantify protein in a natural biomaterial which is a blend of proteins, carbohydrates, etc?

I have used Bradford assay for protein quantification; however, I am getting inconsistent concentrations. Tried twice but similar inconsistency. Like the highest quantity of a biomaterial is...

11 July 2024 2,257 1 View

How can we quantify protein in a natural biomaterial which is a blend of proteins, carbohydrates, etc?

I have used Bradford assay for protein quantification; however, I am getting inconsistent concentrations for the second time. Like the highest quantity of a biomaterial is having lower...

11 July 2024 3,557 3 View

DAF-2DA prtocol and conditions?

Does anyone suggest experimental condition (dark all time?) for measuring NO production in endothelial cells using DAF-2? Thank you Hamid

14 February 2024 2,968 1 View

What is the photogenerated carrier density in time-resolved measurements?

In TCSPC (Time-correlated single photon counting) measurement, how do we know about the photogenerated carrier density?

14 July 2023 8,978 1 View

What will be reference speed and power of Wind Turbine Controller in Region 4?

Hello Researchers, I am working on wind Turbine Control in Region 4 i.e wind speed more than 25m/s. For Controller design what will be the reference speed and power in region 4 & less papers...

20 February 2023 3,817 0 View

Can someone help me to solve the error, whiling installing the wien2k code????

While installing wien2k code, the final error message i received is given in the image below. I need help in resolving the error. Thanking you in advance...

31 January 2023 9,023 7 View

Which Controller for PMSG Wind Turbine?

Hello Researchers, I want to design controller for PMSG Wind turbine that works satisfactorily while operating in High Speed region, probably more than 20m/s wind speed. Kindly suggest me...

21 December 2022 6,920 5 View

Installing wien2k On HPC?

Dear WIEN2k Users, I am facing a problem in installing the wien2k software on HPC. I will be humbled if someone suggests and helps?

15 May 2022 1,922 2 View

Does anyone uses the corrtest CS350 EIS Potentiostat /Galvanostat? How is its performance and compatibility?

How is the performance and compatibility of corrtest CS350 EIS Potentiostat /Galvanostat?

06 March 2022 1,654 0 View

Is PVA soluble in Toluene?

How to dissolve PVA in toluene?

06 January 2022 7,058 3 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

How can I prepare virus for a TEM or SEM imaging?

I have virus (viral hemorrhagic septicemia virus) in suspension and the experiment will not involve cells. What level of TCID50 is preferred?

11 August 2024 3,115 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Is it possible to use the Fused Deposition Modeling (FDM) to additively manufacture interconnected porous structure generation of >100-200 micrometer?

Usually, additive manufacturing techniques like SEBM, SLS, and SLM are used for interconnected porous lattice structure generation with sizes of >100–200 micrometers. Can the Fused Deposition...

09 August 2024 7,892 0 View

Muhammad Ali

Dear Suhail,

I would like to provide you some useful links that you may be interested in : Article The Relationship Between Variable Selection and Data Agument...

https://stackoverflow.com/questions/39265746/data-augmentation-techniques-for-general-datasets/39272735

https://arxiv.org/pdf/1609.08764.pdf

https://bair.berkeley.edu/blog/2019/06/07/data_aug/

Cristian Ramos-Vera

I recomend

Article Bootstrap Methods

Article Bootstrap percolation

Article Analysis of cost data in randomized trials: an application o...

Article Bootstrap-DEA analysis of BRICS’ energy efficiency based on ...

Article Chapter 52 The Bootstrap

Article Bootstrapping clustered data

https://link.springer.com/content/pdf/10.1007/JHEP03(2014)100.pdf

https://amstat.tandfonline.com/doi/abs/10.1080/01621459.1994.10476768

https://projecteuclid.org/download/pdf_1/euclid.bj/1174324983

https://arxiv.org/ftp/arxiv/papers/1809/1809.04016.pdf

Ritika Lohiya

Augmenting the images is easier and simple as relationship between the pixels and label assignment can be maintained. Whereas, perturbing a dataset with categorical and numeric features can perturb the data sample into entirely different class. However, unsupervised machine learning algorithms can be used for randomly perturb features in each subset using the distribution’s mean and standard deviation as perturbation bounds.

Suhail Ganiny

Thanks for sharing the links Muhammad Ali and Cristian Ramos-Vera

Thank you Ritika Lohiya for the answer

Do you mean that the numeric data can be interpolated or extrapolated while keeping the mean and standard deviation within acceptable bounds?

Can you share a research publication or a link for the same?

Please refer the following link, this might help:

https://towardsdatascience.com/augmenting-categorical-datasets-with-synthetic-data-for-machine-learning-a25095d6d7c8

Ritika Lohiya the link is indeed helpful.

Muna Al-Hawawreh

Yes,

I already applied it on numerical dataset using varitional auto-encoder..

Here you can find the details and the python code...

Conference Paper Industrial Internet of Things Based Ransomware Detection usi...

Code Python code Deep Stacked Varitional Neural Network

Dear Muna Al-Hawawreh

Thanks for sharing your publication and the python code.

Xingjie Li

Yes, data-augmentation techniques are useful in the unbalanced-data area. Generative Adversarial Networks (GAN) can generate realistic data, which is beneficial to train the model.

Thanks for the answer Xingjie Li