The 264 word lists are for historical reference, the 277 word lists are more recent and comprehensive.

The original lists were alphabetically sorted. The clustered lists organize the function words by grammatical type to optimize the attributes (in this case for use in a decision tree classifier). See paper for details.

Conference Paper Optimizing Features for Dialogue Act Classification

More James Dominic O'Shea's questions See All
Similar questions and discussions