Συνέδριο

Συγγραφείς: Stamatatos E., Fakotakis N., Kokkinakis G.
Τίτλος: Automatic Extraction of Rules for Sentence Boundary Disambiguation
Συνέδριο: Workshop in Machine Learning in Human Language Technology, Advance Course on Artificial Intelligence (ACAI’99)
Editors:
Ed: Όχι
Eds: Όχι
Σελίδες: 88-92
Να εμφανιστεί: Όχι
Μήνας:
Έτος: 1999
Τόπος:
Εκδότης:
Δεσμός:
Όνομα αρχείου:
Περίληψη: Transformation-based learning (TBL) is the most important machine learning theory aiming at the automatic extraction of rules based on already tagged corpora. However, the application of this theory to a certain application without taking into account the features that characterize this application may cause problems regarding the training time cost as well as the accuracy of the extracted rules. In this paper we present a variation of the basic idea of the TBL and we apply it to the extraction of the sentence boundary disambiguation rules in real-world text, a prerequisite for the vast majority of the natural language processing applications. We show that our approach achieves considerably higher accuracy results and, moreover, requires minimal training time in comparison to the traditional TBL.