10 October 2018 3 10K Report

I am trying to create a logistics dummy dataset for doing some analysis and possible predictions on the data.

Assumed variables are as follows: VARIABLES RANGES awb random number eg:235533 destination_city random cities product different products product_category different categories origin_city random metro cities logistics_provider_id id's eg:1,20,28,27 dispatch_date datetime between mar01-2015 to mar15-2015 final_delivery_status created,delivered,returned actual_delivery_date datetime between mar16-2015 to mar30-2015 promised_delivery_date datetime between mar25-2015 to Apr6-2015

So, from the above variables assumed i want to create the dummy data with in the ranges mentioned. How can i create the dummy data using python

Expected output: example_dummy_data: awb destination_city product product_category 1 104842891 Byatarayanapura Wrangler Denim Jeans Men's Clothing 2 104842938 Bareilly Sky Blue Denim Men's Clothing 3 104842942 Saharanpur puma shoes Men's Footwear 4 104842943 Saharanpur classic puma shoes Men's Foorwear 5 104843066 Mumbai Elegant black belt Fashion Accessories origin_city log_prov_id dispatch date final_del_status 1 Gurgaon 18 2014-09-02 00:26:11 DEL 2 Bangalore 19 2014-09-01 23:34:30 RTN 3 New Delhi 18 2014-09-01 18:59:41 RTC 4 New Delhi 15 2014-09-02 00:05:33 DEL 5 Hyderabad 16 2014-09-01 22:09:14 UDL Actual_del_date promised_del_date 1 2014-09-03 00:00:00 2014-09-05 20:00:00 2 2014-09-04 00:00:00 2014-09-06 20:00:00 3 2014-09-04 00:00:00 2014-09-06 20:00:00 4 2014-09-04 00:00:00 2014-09-07 20:00:00 5 2014-09-02 00:00:00 2014-09-06 20:00:00

I want to create the data with 10000 rows like above,Is there any best way to create in the ranges mentioned above

More Huda da's questions See All
Similar questions and discussions