What is synthetic data?

More Rajiv Kumar's questions See All

Need explaination on fuzzy logic

i want to learn fuzzy. i found lots of papers, material on internet. i tried implementing on real data collected. i performed statistical operation to get value for a particular segment (from long...

06 July 2015 6,207 0 View

File Uploading to Web Server using Android

How can i upload my csv file from Android device storage to a directory on my Web server. I have set permissions 777 for directory on my linux server. Please provide some code samples or links to...

08 September 2014 5,870 2 View

What is a time sliced data graph?

I have a big time series of data in hours/minutes. I want to plot it as a time sliced graph. The size of time slice should be 1 second. What is time sliced data? How can I plot the time sliced...

03 April 2014 638 0 View

Anyone have experience assigning GPS map data with coloring options,like traffic data, on Google?

I have longitude,latitude,altitude and one processed data D. I want to draw a map on Google Map with this information. The map is drawn using latitude, longitude, altitude. The D is to show on...

01 February 2014 9,382 6 View

What exactly does "average of every 5 seconds of data" mean in a total of 415 seconds of data?

This is a trend detection kind of thing. I have time series data. I have to analyze it after every 5 seconds. I am taking an absolute sum of data values of every 5 seconds of data, dividing it...

01 February 2014 5,552 5 View

What would be the processof getting instantaneous speed from GPS?

The gps device has accuracy up to 5 meters. How to calculate speed from the GPS readings. I have latitude, longitude, altitude and data is sampled at 20Hz.

31 December 2013 7,257 3 View

How do I calculate IRI (International Roughness Index) for a road classification?

I have calculated IRI using the formula given in "Study of calculation of IRI based on power spectral density of pavement surface roughness" by Lou S. Wang et al. This formula is for the China...

31 December 2013 4,059 4 View

How do I set the number of neurons in feed-forward neural networks when two outputs are required?

Suppose we have to predict the class of output from a neural network. I want to get the class type, and the numerical value that represents the class type obtained at the first output neuron.

31 December 2013 5,675 3 View

How to write matlab script for quarter car modelling passive suspension system?

Quarter Car Model

11 December 2013 2,841 4 View

What is the cutoff frequency for filter design to apply to vehicle vibration measured using accelerometer?

The vehicle is moving at constant average speed of 20Km/hour. The vibration due to road profile is logged using tri-axial accelerometer and filtered to remove noise. The cutoff frequency is the...

11 December 2013 6,325 8 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Flavio Prattico Popular answer

Synthetic data are commonly generated in order to validate mathematical models comparing the behaviors of the real data of those generated through the model.

Practically, image you want to generate a synthetic time series in matlab of a certain Gaussian process with a certain length. The first step is to find the parameters of the Normal distribution that fit your process ( code in matlab s = fitdist(x,'Normal'), where x contains your real data). Second step: with the two parameter make the inverse of the normal distribution (just write the well known function). You can now give to this inverse probability distribution function as input random numbers ranging from 0 to 1 (code in matlab syn=rand(n) , where n will be the length of your synthetic time series ). The output will be a synthetic time series, with length equal to n, that follows, in terms of probability distribution function, the real process that you had chosen.

I hope this helps...bye

Dirman Hanafi

Please you try to open this link may can help you.

http://stackoverflow.com/questions/12570678/generate-synthetic-data-2d-x-t-x-v-using-matlab

Flavio Prattico

Jimmy Omony

Suppose you have a biological or physical system for which you know the (range of) parameters governing its dynamics, then by varying the input/or perturbations to such a system you can basically simulate the expected output. This output is what is often quantified as "measured variables". By varying the input, parameters and simulating the behavior of such a system under various situations you basically generate various outputs (measurements). These measurements, are in principle what is referred to as synthetic data. I hope this helps, for the start :).

Juan Villegas Cortez

This topic is related to digital signal processing, in order to simulate real signals (pure signal + noise). In general, this approach is close to generate real conditions in order to produce "simulated" o "synthetic" phenomena behavior, and then study the real phenomenon considering as background support the initial conditions as variables in time domain, and later in frequency domain.

Jyotindra C. Prajapati

Synthetic data are "any production data applicable to a given situation that are not obtained by direct measurement"

Rajiv Kumar

Hello Juan

can you please send link to further explore on it? How we can apply it in time-domain and frequency domain? I had a signal. i transformed it to frequency domain also. i have attributes that affect the signal. some of these attributes are impractical to determine in real time. is there anything that can let me generate synthetic data in such case.

thanks

rajiv

Christopher Landauer

there is a book whose title is something like statistical analysis with missing data,

in which they generate synthetic data as part of the algorithms

the E-M method is also a good source for this style of thinking (``if you don't have enough data, make some up, but be careful about its properties, and study the sensitivity of the result to changes in those properties'')

Daniel Bernard Bonnery

Hi,

There is a difference between synthetic and simulated data, at least in survey-related literature. I do not know if this terminological differentiation is also common in all fields of statistics. Imagine that you have confidential dataset, and you want to make a synthetic version public. You will just try to fit a model to this data, estimate the parameters, and re-generate the dataset with those parameters. A more elaborate technique consists in generating many datasets. This technique, as mentioned by Christopher Landouer, was introduced by D. Rubin and was seen as an application to multiple imputation to tackle the issue of confidentiality.

Simulated data, for experts in synthetic data, just designates data that was generated without paying attention to the real data.

A recent reference with relevant biblio:

http://www.springer.com/la/book/9781461403258 (Jorg Drechsler, Synthetic Datasets for Statistical Disclosure Control)

Bests,

Daniel

Mubarak Ahmad

Hello everyone, How i can get information from frequency domain synthetic data