I have a (big) dataset with evidence of autocorrelation. Now I'm using an CO AR1 model, how do I know this is right?

More Joost van Vlimmeren's questions See All

In a lineair Mixed model (spss) I would like to add in interaction effects, however due to a large number of combination, many become 'redundant' ?

Any alternatives / options to solve this? Because of individual interaction examinations i know that these interactions exist, but combining all interactions gets me in trouble. I have both...

03 April 2016 7,950 3 View

How can I prepare virus for a TEM or SEM imaging?

I have virus (viral hemorrhagic septicemia virus) in suspension and the experiment will not involve cells. What level of TCID50 is preferred?

11 August 2024 3,115 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Is it possible to use the Fused Deposition Modeling (FDM) to additively manufacture interconnected porous structure generation of >100-200 micrometer?

Usually, additive manufacturing techniques like SEBM, SLS, and SLM are used for interconnected porous lattice structure generation with sizes of >100–200 micrometers. Can the Fused Deposition...

09 August 2024 7,892 0 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How to define an anisotropic material with asymmetric elastic compliance/stiffness matrix in ANSYS APDL?

I need to model an anisotropic material in which the Poisson's ratio ν_12 ≠ ν_21 and so on. Therefore, the elastic compliance matrix wouldn't be a symmetric one. In ANSYS APDL, for TB,ANEL...

09 August 2024 5,048 2 View

Giorgio Carlo Cappello

First of all, if you have this strong proof it means that your design research match with method and tools, and if you are trying to find out others methods it could be incorrect because you didn't designed in advanced, and attempt tentative method and tool essentially are wrong approach, so why are you seeking something else not designed?

Joost van Vlimmeren

Thank you for your quick answer. I was seeking for alternatives because I wanted to make sure that this is the right method. I know there can be used different time lag lenghts for autocorrelation and there is also a possibility to use trend analysis i guess.

Nicola Mingotti

It is not fully clear to me what you are modelling, can you give more informations on the data you have? it seems interesting. Besides that:

-) in general, you can't state in advance that the model you chose is the best, you should try all alternative models, which is impossible.

-) If you know well what you are modelling you can often rule out models which make no sense

-) watch residuals ! If they are uncorrelated and homoskedastic you are on a good way

-) beware of R^2 !! If interested on predictions i suggest you score models (e.g.) looking at residuals of out of the sample forecasts.

Thanks for your interest in this subject.

I have data per day of the amount of people visiting shopping centers.

I can split the file by shopping centers and make models of each center, with the predictors: day of the week, month in the year, weather variables and economic variables.

I can see that in some centres the residuals are correlated because they show trends. This suggests that a lineair regression model cannot be used. Using a AR(1) model increases R-Square and doesnt violate the uncorrelated residual assumption, because all Durbin-watson values are close to 2,0.

Problem is actually, there can still be trends observed when plotting the residuals of the AR(1) model over time, so I guess I have non-stationary series. That is where ARIMA(p,d,q) (with d=1 for a lineair trend) comes in right? I'm not sure about this and how to use this correctly (using SPSS).

Thanx again!

So, you may try with something like this (simplified version) which adds to the linear model an autoregressive term.

y_i = b0 + b1*x_i + b_2*y_(i-1) + eps_i

The autoregressive term was already added, still i can see non-stationarity (so not just temporary fluctuations, but trends or no full recovery after a random shock), which are ignored with the autogregressive b_2*y_(i-1) term right?

yes, if you see non stationarity in residuals you must add something else to your model

Chenying Gao

Check the sample ACF and PCAF and determine your model by the lags out of 95% bound;

Fit the ARMA(p,q);

Test the residuals to se if it's white noise;

select different model and compare AICC.

Thank you Chenying,

i knew that's an option, but the real challenge is to make this for 100 different cases. I can do your method 100 times but that is very time-consuming and maybe there will be more cases in the future.

Patrizio Vanella

Hi Joost,

if I understand your problem correctly, your problem is the high number of shopping centers, which would be too much for modeling individually, right? You might use a Principal Component Analysis first. Transforming the 100 time series into their principal components, you could run your time series analysis based on the important principal components first. This way you very likely won't need to regress on 100 but rather on 5-15 time series.

Regarding the stochastic term, in my mind, graphical analysis tells you more than statistical tests. Look at the time series of the errors, usually you get an idea whether, they are non-stationary. The augmented Dickey-Fuller test also helps very much (although it is not perfect either for small sample size). If your time series is non-stationary, you should differentiate it.

The "best" orders of your ARMA(p,q) model cannot be derived easily. You might take a look at the ACF and the PACF, like Chenying said, but be careful with the interpretation! It takes a lot of experience to see this from the ACF and PACF, its not as easy. I would rather use them as an indication of the maximum values of p and q and from then on try different ps and ds. Better use one or more information criteria for your decision: AIC; BIC; HQC.