I have NYC taxi trips dataset contains multiple attributes like (pickups and dropoffs) coordinates, DateTime also, trip distance and so on, indexed by "tpep_pickup_datetime" coulmn as datetime64[ns] data type, I extracted some features from date time pickup and drop-off columns like month, day, hour and other. So I am focusing on DateTime and location to do trip time prediction.

Concerning the problem that faced me, it is the problem of converting the dataset into a fixed time series intervals (1 or 10 mins as an example to get 1440 time bins for each day) to be ready to LSTM input, let me reflect the essential point of the problem, I have tried to do a resample the dataset based on pickups-time, but the dataset contains more features, so it is difficult to convert it into a sequence time series with fixed interval lengths. Because the data contain a lot of trips at the same time (such as Sunday at 8:00-9:00 am approximate 2990 trip) but they are from different places.

So the main problem which is briefly: "How do I convert or prepare a taxi dataset to a time series with the fixed intervals?".

Thank you advance,

More Mohammed Fatih's questions See All
Similar questions and discussions