Authors: | Stamatatos E., Fakotakis N., Kokkinakis G. |
---|
Title: | Automatic Authorship Attribution |
---|
Conference: | 9th Conf. οf the European Chapter of the Association for Computational Linguistics (EACL’99) |
---|
Editors: | |
---|
Ed: | No |
---|
Eds: | No |
---|
Pages: | 158-164 |
---|
To appear: | No |
---|
Month: | |
---|
Year: | 1999 |
---|
Place: | |
---|
Pubisher: | |
---|
Link: | |
---|
File name: | |
---|
Abstract: | In this paper we present an approach to automatic authorship attribution dealing with real-world (or unrestricted) text. Our method is based on the computational analysis of the input text using a text-processing tool. Besides the style markers relevant to the output of this tool we also use analysis-dependent style markers, that is, measures that represent the way in which the text has been processed. No word frequency counts, nor other lexically-based measures are taken into account. We show that the proposed set of style markers is able to distinguish texts of various authors of a weekly newspaper using multiple regression. All the experiments we present were performed using real-world text downloaded from the World Wide Web. Our approach is easily trainable and fully-automated requiring no manual text preprocessing nor sampling. |