I am trying to understand how NN can be used for multilabel classification and I read the following paper: "Large-scale Multi-label Text Classification —
Revisiting Neural Networks" (http://arxiv.org/pdf/1312.5419v3.pdf) . Would someone please help me understand the following?
I don't understand how the training examples are transformed for NN. Did they use one hot vector with 1 at the place of relevant label.