I'm using target encoding in my work, and I'd like to understand why it's effective from a mathematical point of view.

Intuitively, my understanding is that it allows you to encode the past with the future. I can see why that's effective, and also why it could cause target leakage. However, I can't find a good mathematical explanation for its effectiveness/ issues.

Does anyone know the answer, or have a link to a resource they'd be willing to share?

More Connor Skelland's questions See All
Similar questions and discussions