Could someone recommend a good technical reference/books for building HR Attrition data, like sampling design?

I have raw and very dirty historical data of employment and other profile information and I wanted to develop a model using Survival Analysis. But I don't know 1.) how to build the data ready for it, and 2.) how to define "attrition" or "attrited or resigned" employee.

For example, I have 5 jobs in the span of 10 years, what record shall I include in my development sample? Shall I first select a window, say, hiring date from 2001 to 2005, and if I will observe that particular account from the hiring date for, say, "2 years", and saw that it resigned, then it will tagged as "attrited/resigned or 1 " otherwise 0? Is there a way I could established that "2 years" observation window?

Similar questions and discussions