How to deal with zero values in renewable energy time series data (1990–2024) for some countries in my panel?

23 September 2025 2 6K Report

I am conducting an econometric study on the impact of renewable and non-renewable energy consumption on green economic growth in 7 selected countries during the period 1990–2024.

The challenge is that, for 3 Arab Gulf countries (UAE, Saudi Arabia, Qatar), renewable energy consumption values are zero from 1990 to 2008, while positive values appear only from 2009 onwards. For the other 4 countries in my sample, renewable energy data is available for the full period without this issue.

My supervisor suggests that such long sequences of zero values should not be included in the model. However, I believe they reflect the historical reality, since renewable energy was not in use during that period.

I would greatly appreciate your advice on the following questions:

Should I keep the zero values and estimate the model for the full period (1990–2024), even if the coefficients for renewable energy may turn out insignificant?

Or should I restrict the analysis to the shorter period (2009–2024) for the Gulf countries?

Are there recommended econometric approaches to handle this type of data structure, such as log(1+x) transformations, dynamic panel ARDL, or even zero-inflated models?

👉 Important: I am particularly looking for answers supported with evidence, references, or examples from published studies that have dealt with similar cases (for instance, renewable energy adoption in GCC countries or other regions where renewable energy data starts late and shows long zero periods).

References I have found so far include:

Chen & Roth (2023), Logs with zeros? Some problems and solutions (arXiv).

Bellemare & Wichman (2019), Elasticities and the inverse hyperbolic sine transformation.

Silva & Tenreyro (2006), The log of gravity, Review of Economics and Statistics.

Any suggestions with proper references would be highly valuable for my work.

Thank you very much for your time and guidance.

Mohamed Abdelghani Benziada

Hello Sir,

Here’s a summarizing how to handle the long sequences of zero renewable-energy values in your panel:

1. Keep the zeros—they are real information

Zeros in your Gulf Cooperation Council (GCC) countries before 2009 are not missing data but actual observations reflecting non-adoption of renewables. Removing or arbitrarily changing them would bias your results and distort the timing of renewable adoption . Many energy-economics studies on GCC countries explicitly retain zeros to capture adoption effects.

2. Approaches commonly recommended in the literature

a. Two-part (hurdle) or adoption–intensity models (Recommended)

Step 1: Model the probability of adoption Pr(RE>0)Pr(RE>0)Pr(RE>0) using a logit/probit with fixed effects.
Step 2: Conditional on RE>0, model the intensity of renewable consumption using FE-OLS, PPML, or dynamic specifications. This approach separates the decision to adopt renewables from the decision of how much to consume and is widely used for semi-continuous variables with many zeros .

b. PPML (Pseudo-Poisson Maximum Likelihood)

PPML handles zero values naturally, avoids bias from log-transforming zeros, and is robust under heteroskedasticity . If your specification is multiplicative (elasticities), PPML is a strong alternative to log(1+x).

c. Panel ARDL / PMG with structural breaks

If you need long-run/short-run dynamics, panel-ARDL can be applied on levels (not logs) to include the zero years, but you should include adoption dummies or structural-break terms for GCC countries starting around 2009 .

d. Restricting the sample (2009–2024) as robustness

Run an auxiliary regression restricted to post-adoption years as a robustness check, but do not rely solely on this shorter window—otherwise you discard meaningful variation and shrink your time dimension.

3. Why not simple Tobit or deleting zeros

Tobit treats zeros as censored rather than genuine non-use; this can be inappropriate when zeros are a distinct adoption state. Deleting zeros would bias results and misrepresent historical reality.

4. Practical workflow

Describe the data: Report the share and timing of zeros by country.

Benchmark: Estimate FE-OLS or PPML on the full 1990–2024 panel (levels, not logs).

Two-part model: Estimate adoption (probit) and intensity (FE-OLS or PPML).

Panel ARDL: Include adoption dummies or breaks; test long-run dynamics.

Robustness: Re-estimate on 2009–2024 and compare.

Report sensitivity: Clearly show how coefficients change across approaches.

5. Examples from published work

Hazmi (2024) used Panel-ARDL on GCC clean energy and growth, retaining zero values to reflect adoption timing .
Santos Silva & Tenreyro (2006) showed PPML handles zeros more reliably than log transformations .
Duan (1983) introduced two-part models for semi-continuous data—widely cited for situations like yours .
Reviews on zero-inflated and hurdle models for panel data also recommend modeling adoption separately .
GCC renewable adoption reviews emphasize that early zeros represent real policy lags, not missing data .

6. Recommendation

Retain the full 1990–2024 panel with the zero years. Use two-part or hurdle models (adoption + intensity) as your primary specification, compare with PPML and Panel-ARDL for robustness, and present a post-2009 subsample as a sensitivity check. Document adoption timing and justify the approach with the above references.

Sources1. Example review on GCC renewable adoption and late starts – International Journal of Energy Economics and Policy (2022). 2. Duan, N. (1983). Smearing estimate: A nonparametric retransformation method. Journal of the American Statistical Association. 3. Mullahy, J. (1998). Much ado about two: Reconsidering two-part models and econometric approaches for semi-continuous data. Journal of Health Economics. 4. Santos Silva, J.M.C. & Tenreyro, S. (2006). The Log of Gravity. The Review of Economics and Statistics. 5. Hazmi, A. (2024). Dynamic Interrelationships among Clean Energy and Economic Growth in GCC Countries: A Panel-ARDL Approach.

Fariba Abbasi

When dealing with zero values in renewable energy time series data (1990–2024) for a country panel, there are several approaches depending on the research objective and data characteristics:

Keep zeros as meaningful information If a zero truly reflects no production or consumption of renewable energy in that country/year, it is best to keep it. This itself carries important analytical insights (e.g., the adoption of renewables starting only in recent years).

Data transformation (for models sensitive to zeros) In log-based analyses, zeros cause errors. A common practice is to apply log(x+1) or use the inverse hyperbolic sine (IHS) transformation. Both handle zeros while preserving the benefits of log scaling.

Imputation or interpolation If zeros occur because of missing data or reporting errors (not actual values), they can be replaced using interpolation, moving averages, or values from adjacent years.

Separate analysis of countries or periods Countries with a high share of zeros can be analyzed separately, or you may restrict the study period to years with valid, nonzero data.

Unbalanced panel modeling If zeros stand for missing values, the dataset becomes an unbalanced panel, which many panel data models can handle.

In short, the choice depends on whether zeros are real or due to missing data.

If real → keep them, possibly using transformations like log(x+1).
If missing → apply appropriate imputation methods.

Can the limit of quantification (LOQ) of an analytical method fall outside its linear dynamic range, or must it always be within it?

Swerling Characteristic functions?

Radar Detection Probabilities?

Radar Detection Probabilities using beta distributed Scattering Cross section?

I have two problems: 1) the enzyme is not immobilizing efficiently into the MOF material.. 2) the MOF itself has peak on 400nm by using p-NPA test.?

Wireless insite3d convert path in to image ????

Wireless insite3d convert path in to image ????

EDS and mapping?

Determine pore diameter by BJH of TiO2?

I am looking for lower limb conventional physiotherapy protocol or guideline in children with hemiplegic cerebral palsy. please any help?

How to develop investments in renewable energy sources?

Explain theoretically and with the aid of an example the concept of equation linear and not linear in variables and parameters?

Determining the worth of a point improvement in Hamilton Depression Scale?

Could dyes amplify the spectrum of light to a specific wavelength?

What is the problem with these tissue culture plants?

Transfection in HEK293T cells?

Is artifacts in XPS possible to build high deviation in binding energy larger than 5 eV??

How to retain the a GFP tagged gene expression in stable cell line?

Can a photocatalytic degradation of methylene blue from red mud be pseudo- zero order kinetics?

Is there a database with onshore wind production for the wind farms in Portugal?