A computational linguistic approach for the identification of translator stylometry using Arabic-English text

Heba El-Fiqi, Eleni Petraki, Hussein Aly Abbass

    Research output: A Conference proceeding or a Chapter in BookConference contributionpeer-review

    10 Citations (Scopus)
    6 Downloads (Pure)

    Abstract

    Translator Stylometry is a small but growing area of research in computational linguistics. Despite the research proliferation on the wider research field of authorship attribution using computational linguistics techniques, the translator stylometry problem is more challenging and there is no sufficient literature on the topic. Some authors even claimed that this problem does not have a solution; a claim we will challenge in this paper. We present an innovative set of translator stylometric features that can be used as signatures to detect and identify translators. The features are based on the concept of network motifs: small graph local substructures which have been used successfully in characterizing global network dynamics. The text is transformed into a network, where words become nodes and their adjacencies in a sentence are represented through links. Motifs of size 3 are then extracted from this network and their distribution is used as a signature for the corresponding translator.

    We then investigate the impact of sample size, method of normalization and imbalance dataset on classification accuracy. We also adopt the Fuzzy Lattice Reasoning Classifier (FLR) among others, where FLR achieved the best performance with a classification accuracy reaching the 70% mark
    Original languageEnglish
    Title of host publication2011 IEEE International Conference on Fuzzy Systems
    EditorsShyi Ming Chen
    Place of PublicationTaipei, Taiwan
    PublisherIEEE, Institute of Electrical and Electronics Engineers
    Pages2039-2045
    Number of pages7
    Volume1
    ISBN (Print)9781424473151
    DOIs
    Publication statusPublished - 2011
    EventIEEE International Conference on Fuzzy Systems - Taipei, Taiwan, Province of China
    Duration: 1 Jan 201130 Jun 2011

    Conference

    ConferenceIEEE International Conference on Fuzzy Systems
    Abbreviated titleFUZZ-IEEE
    Country/TerritoryTaiwan, Province of China
    CityTaipei
    Period1/01/1130/06/11

    Fingerprint

    Dive into the research topics of 'A computational linguistic approach for the identification of translator stylometry using Arabic-English text'. Together they form a unique fingerprint.

    Cite this