Please consider the sample dataset below.
In simple terms,
An sensor is defective and hence measurement has incorrect values since 2000 and we have the data for 10 years with both: measured and actual.
Now, we want to have the actual values from the algorithm.
We tried, XGBoost and CatBoost by creating another column named diff = measured- actual
and fed to the algorithm to identify the pattern. but not sure which algorithm is appropriate although suspecting **Neural network or Time series (ARIMA)** could work but not sure
because we have just 10 years data on monthly level
```
library(tidyverse)
train_data