I found an equation that fits recidivism data in Utah. I would like to publish it. A colleague has presented a challenge--how do I know this is the only form (or model) that can fit the data? I will show you my solution and ask, if you have the time, can you find an alternative?
I assumed a slightly adjusted decay function. A first group "I" has a risk P of returning to prison each month. A second group "II" have zero risk. The percentage of zero-risk people, Z, is a constant--they stay out of prison. The people at risk is (F-Z)%. (100% is the initial sample size.) F is the percent remaining free at time t (in months). Then the decay rate looks like this:
dF/dt = -P (F-Z)
I allowed that P may vary linearly with time: P =K+Lt, and I excluded the first point (t=0) as an outlier.
A solution is F = (100-Z)ab exp (t+ct^2) +Z
The constants to fit the data: a = 1.0611, b = 0.91615, c = 0.00545, Z = 33.00
This is a good least-squares fit from t = 1 to t = 34 months.
The implications of the model are that:
a. People at risk to recidivate are intractable (as they were managed in the system).
b. A large number of people released to parole, about 33%, have essentially zero risk.
I believe this is a very important insight into the nature of recidivism, but only if the model is the CORRECT model. Is there a different one that fits the data? I am listing my data below.
Month % Free on Parole
0.0, 100.0
1.0, 97.8
2.0, 91.9
3.0, 87.7
4.0, 83.1
5.0, 79.0
6.0, 75.3
7.0, 70.5
8.0, 67.2
9.0, 63.75
10.0, 61.4
11.0, 58.3
12.0, 55.6
13.0, 53.1
14.0, 51.9
15.0, 50.4
16.0, 48.8
17.0, 47.0
18.0, 45.6
19.0, 44.4
20.0, 43.1
21.0, 42.6
22.0, 41.25
23.0, 40.1
24.0, 39.4
25.0, 38.6
26.0, 38.25
27.0, 37.8
28.0, 37.5
29.0, 36.9
30.0, 36.6
31.0, 35.9
32.0, 35.6
33.0, 35.3
34.0, 35.0