How to deal with zero values in renewable energy time series data (1990–2024) for some countries in my panel?

Hello Sir,

Here’s a summarizing how to handle the long sequences of zero renewable-energy values in your panel:

1. Keep the zeros—they are real information

Zeros in your Gulf Cooperation Council (GCC) countries before 2009 are not missing data but actual observations reflecting non-adoption of renewables. Removing or arbitrarily changing them would bias your results and distort the timing of renewable adoption . Many energy-economics studies on GCC countries explicitly retain zeros to capture adoption effects.

2. Approaches commonly recommended in the literature

a. Two-part (hurdle) or adoption–intensity models (Recommended)

Step 1: Model the probability of adoption Pr(RE>0)Pr(RE>0)Pr(RE>0) using a logit/probit with fixed effects.
Step 2: Conditional on RE>0, model the intensity of renewable consumption using FE-OLS, PPML, or dynamic specifications. This approach separates the decision to adopt renewables from the decision of how much to consume and is widely used for semi-continuous variables with many zeros .

b. PPML (Pseudo-Poisson Maximum Likelihood)

PPML handles zero values naturally, avoids bias from log-transforming zeros, and is robust under heteroskedasticity . If your specification is multiplicative (elasticities), PPML is a strong alternative to log(1+x).

c. Panel ARDL / PMG with structural breaks

If you need long-run/short-run dynamics, panel-ARDL can be applied on levels (not logs) to include the zero years, but you should include adoption dummies or structural-break terms for GCC countries starting around 2009 .

d. Restricting the sample (2009–2024) as robustness

Run an auxiliary regression restricted to post-adoption years as a robustness check, but do not rely solely on this shorter window—otherwise you discard meaningful variation and shrink your time dimension.

3. Why not simple Tobit or deleting zeros

Tobit treats zeros as censored rather than genuine non-use; this can be inappropriate when zeros are a distinct adoption state. Deleting zeros would bias results and misrepresent historical reality.

4. Practical workflow

Describe the data: Report the share and timing of zeros by country.

Benchmark: Estimate FE-OLS or PPML on the full 1990–2024 panel (levels, not logs).

Two-part model: Estimate adoption (probit) and intensity (FE-OLS or PPML).

Panel ARDL: Include adoption dummies or breaks; test long-run dynamics.

Robustness: Re-estimate on 2009–2024 and compare.

Report sensitivity: Clearly show how coefficients change across approaches.

5. Examples from published work

Hazmi (2024) used Panel-ARDL on GCC clean energy and growth, retaining zero values to reflect adoption timing .
Santos Silva & Tenreyro (2006) showed PPML handles zeros more reliably than log transformations .
Duan (1983) introduced two-part models for semi-continuous data—widely cited for situations like yours .
Reviews on zero-inflated and hurdle models for panel data also recommend modeling adoption separately .
GCC renewable adoption reviews emphasize that early zeros represent real policy lags, not missing data .

6. Recommendation

Retain the full 1990–2024 panel with the zero years. Use two-part or hurdle models (adoption + intensity) as your primary specification, compare with PPML and Panel-ARDL for robustness, and present a post-2009 subsample as a sensitivity check. Document adoption timing and justify the approach with the above references.

Sources1. Example review on GCC renewable adoption and late starts – International Journal of Energy Economics and Policy (2022). 2. Duan, N. (1983). Smearing estimate: A nonparametric retransformation method. Journal of the American Statistical Association. 3. Mullahy, J. (1998). Much ado about two: Reconsidering two-part models and econometric approaches for semi-continuous data. Journal of Health Economics. 4. Santos Silva, J.M.C. & Tenreyro, S. (2006). The Log of Gravity. The Review of Economics and Statistics. 5. Hazmi, A. (2024). Dynamic Interrelationships among Clean Energy and Economic Growth in GCC Countries: A Panel-ARDL Approach.

Can the limit of quantification (LOQ) of an analytical method fall outside its linear dynamic range, or must it always be within it?

Swerling Characteristic functions?

Radar Detection Probabilities?

Radar Detection Probabilities using beta distributed Scattering Cross section?

I have two problems: 1) the enzyme is not immobilizing efficiently into the MOF material.. 2) the MOF itself has peak on 400nm by using p-NPA test.?

Wireless insite3d convert path in to image ????

Wireless insite3d convert path in to image ????

EDS and mapping?

Determine pore diameter by BJH of TiO2?

I am looking for lower limb conventional physiotherapy protocol or guideline in children with hemiplegic cerebral palsy. please any help?

How to develop investments in renewable energy sources?

Explain theoretically and with the aid of an example the concept of equation linear and not linear in variables and parameters?

Determining the worth of a point improvement in Hamilton Depression Scale?

Could dyes amplify the spectrum of light to a specific wavelength?

What is the problem with these tissue culture plants?

Transfection in HEK293T cells?

Is artifacts in XPS possible to build high deviation in binding energy larger than 5 eV??

How to retain the a GFP tagged gene expression in stable cell line?

Is there a database with onshore wind production for the wind farms in Portugal?

HAs anyone used TGA to find activation enegy of water evaporation?