21 January 2021 4 5K Report

I have a dataset with an unusual configuration and was hoping for guidance on choosing a method to test for changes in mean.

For context, this is operational data tracking vehicle arrivals at 10 physical sites, and measures relative site volume using a ratio of daily arrivals in relation to the site's capacity. Variation among the 10 ratios is calculated for each day (coefficient of variation).

A program was started that changes the conditions under which vehicles determine which site they drive to (goal is to reduce variation in above described ratios). There have been three different changes in conditions, yet the time lengths they were implemented for were all different:

Baseline - 30 days (cannot be extended)

Phase 1 - 78 days

Phase 2 - 116 days

Phase 3 - 87 days

I'm being asked to determine whether there were significant changes in the mean variation during each phase compared to Baseline. Since I'm testing the same group (the entire vehicle/site system) under three different conditions, I believe three separate paired t-tests would be appropriate. However, I know the sample size must be identical for each pair. Generating a proportional random sample would still give me different sample sizes (obviously). My question is whether it's acceptable to choose a constant number of days to sample from each phase (e.g. 15 days of ratio variations) or if there would be a more appropriate test to use?

More Ryan S.'s questions See All
Similar questions and discussions