Authors: | Sidorov G., Velasquez F., Stamatatos E., Gelbukh A., Chanona-Hernández L. |
---|
Title: | Syntactic Dependency-Based N-grams: More Evidence of Usefulness in Classification |
---|
Conference: | 14th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2013) |
---|
Editors: | |
---|
Ed: | No |
---|
Eds: | No |
---|
Pages: | 13-24 |
---|
To appear: | No |
---|
Month: | |
---|
Year: | 2013 |
---|
Place: | |
---|
Pubisher: | Springer LNCS |
---|
Link: | |
---|
File name: | |
---|
Abstract: | The paper introduces and discusses a concept of syntactic n-grams (sn-grams) that can be applied instead of traditional n-grams in many NLP tasks. Sn-grams are constructed by following paths in syntactic trees, so sngrams allow bringing syntactic knowledge into machine learning methods. Still, previous parsing is necessary for their construction. We applied sn-grams in the
task of authorship attribution for corpora of three and seven authors with very promising results. |