The chatbot has been implemented using deep learning techniques. can anyone please suggest an efficient and effective way to measure the performance so that it can be compared with other implementations that has been done in the past. In some researches i saw that many researchers used Bleu and perplexity for the same and some researchers just gave the accuracy in term of percentage.

Please let me know if you have any question so that we can make this discussion fruitful.

More Sumit Singh Chauhan's questions See All
Similar questions and discussions