How to represent the aforementioned classification algorithms using their generalized objective functions? Are there any relevent literature available?
Your need to work the representtion of each task. For example, Naive Bayes you need to understand the Graphical probabilistic graphical model behind. Then, deducing the loss function from the MAP inference problem is straightforward. A well established reference in this manner is the Bishop book "Pattern Recognitin and machine learning"