I am conducting an econometric study on the impact of renewable and non-renewable energy consumption on green economic growth in 7 selected countries during the period 1990–2024.
The issue is that, for 3 Arab Gulf countries (UAE, Saudi Arabia, Qatar), the renewable energy consumption values are zero from 1990 to 2008, while positive values appear only from 2009 onwards. For the other 4 countries in my sample, renewable energy data is available for the entire period without this problem.
My supervisor suggests that such long sequences of zero values cannot be included in the model, but I believe they reflect the historical reality (since renewable energy was not in use during that period).
My questions are:
Should I keep the zero values and estimate the model for the full period (1990–2024), even if the coefficients for renewable energy may turn out insignificant?
Or should I restrict the analysis to the shorter period (2009–2024) for the Gulf countries?
Are there recommended econometric approaches to handle this kind of data structure, such as log(1+x) transformations, dynamic panel ARDL, or even zero-inflated models?
👉 Important: I am looking for answers with evidence, references, or examples from published studies that have faced a similar issue (e.g., renewable energy adoption in GCC countries or other regions where the series started late and had long zero periods).
Any suggestions with proper references would be greatly appreciated.