Abstract
Translator Stylometry is a small but growing area of research in computational linguistics. Despite the research proliferation on the wider research field of authorship attribution using computational linguistics techniques, the translator stylometry problem is more challenging and there is no sufficient literature on the topic. Some authors even claimed that this problem does not have a solution; a claim we will challenge in this paper. We present an innovative set of translator stylometric features that can be used as signatures to detect and identify translators. The features are based on the concept of network motifs: small graph local substructures which have been used successfully in characterizing global network dynamics. The text is transformed into a network, where words become nodes and their adjacencies in a sentence are represented through links. Motifs of size 3 are then extracted from this network and their distribution is used as a signature for the corresponding translator.
We then investigate the impact of sample size, method of normalization and imbalance dataset on classification accuracy. We also adopt the Fuzzy Lattice Reasoning Classifier (FLR) among others, where FLR achieved the best performance with a classification accuracy reaching the 70% mark
We then investigate the impact of sample size, method of normalization and imbalance dataset on classification accuracy. We also adopt the Fuzzy Lattice Reasoning Classifier (FLR) among others, where FLR achieved the best performance with a classification accuracy reaching the 70% mark
Original language | English |
---|---|
Title of host publication | 2011 IEEE International Conference on Fuzzy Systems |
Editors | Shyi Ming Chen |
Place of Publication | Taipei, Taiwan |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 2039-2045 |
Number of pages | 7 |
Volume | 1 |
ISBN (Print) | 9781424473151 |
DOIs | |
Publication status | Published - 2011 |
Event | IEEE International Conference on Fuzzy Systems - Taipei, Taiwan, Province of China Duration: 1 Jan 2011 → 30 Jun 2011 |
Conference
Conference | IEEE International Conference on Fuzzy Systems |
---|---|
Abbreviated title | FUZZ-IEEE |
Country/Territory | Taiwan, Province of China |
City | Taipei |
Period | 1/01/11 → 30/06/11 |