I am trying to create a logistics dummy dataset for doing some analysis and possible predictions on the data.
Assumed variables are as follows: VARIABLES RANGES awb random number eg:235533 destination_city random cities product different products product_category different categories origin_city random metro cities logistics_provider_id id's eg:1,20,28,27 dispatch_date datetime between mar01-2015 to mar15-2015 final_delivery_status created,delivered,returned actual_delivery_date datetime between mar16-2015 to mar30-2015 promised_delivery_date datetime between mar25-2015 to Apr6-2015
So, from the above variables assumed i want to create the dummy data with in the ranges mentioned. How can i create the dummy data using python
Expected output: example_dummy_data: awb destination_city product product_category 1 104842891 Byatarayanapura Wrangler Denim Jeans Men's Clothing 2 104842938 Bareilly Sky Blue Denim Men's Clothing 3 104842942 Saharanpur puma shoes Men's Footwear 4 104842943 Saharanpur classic puma shoes Men's Foorwear 5 104843066 Mumbai Elegant black belt Fashion Accessories origin_city log_prov_id dispatch date final_del_status 1 Gurgaon 18 2014-09-02 00:26:11 DEL 2 Bangalore 19 2014-09-01 23:34:30 RTN 3 New Delhi 18 2014-09-01 18:59:41 RTC 4 New Delhi 15 2014-09-02 00:05:33 DEL 5 Hyderabad 16 2014-09-01 22:09:14 UDL Actual_del_date promised_del_date 1 2014-09-03 00:00:00 2014-09-05 20:00:00 2 2014-09-04 00:00:00 2014-09-06 20:00:00 3 2014-09-04 00:00:00 2014-09-06 20:00:00 4 2014-09-04 00:00:00 2014-09-07 20:00:00 5 2014-09-02 00:00:00 2014-09-06 20:00:00
I want to create the data with 10000 rows like above,Is there any best way to create in the ranges mentioned above