Automated categorisation of patent claims that reference human genome sequences

Donglu Wang, Gabriela Ferraro, Hanna Suominen, Osmat A. Jefferson

Research output: A Conference proceeding or a Chapter in BookConference contribution

1 Citation (Scopus)

Abstract

Debates on gene patents have necessitated the analysis of patents that disclose and reference human sequences. In this study, we built an automated classifier that assigns sequences to one of nine predefined categories according to their functional roles in patent claims by applying natural language processing and supervised learning techniques. To improve its correctness, we experimented with various feature mappings, resulting in the maximal accuracy of 79%.

Original languageEnglish
Title of host publicationADCS 2014 - Proceedings of the 19th Australasian Document Computing Symposium
EditorsJ Shane Culpepper, Laurence Park, Guido Zuccon
Place of PublicationMelbourne
PublisherAssociation for Computing Machinery (ACM)
Pages117-120
Number of pages4
ISBN (Electronic)9781450330008
ISBN (Print)9781450330008
DOIs
Publication statusPublished - 26 Nov 2014
Event19th Australasian Document Computing Symposium, ADCS 2014 - Melbourne, Melbourne, Australia
Duration: 27 Nov 201428 Nov 2014

Publication series

NameACM International Conference Proceeding Series
Volume27-28-November-2014

Conference

Conference19th Australasian Document Computing Symposium, ADCS 2014
CountryAustralia
CityMelbourne
Period27/11/1428/11/14

Fingerprint

Supervised learning
Classifiers
Genes
Processing

Cite this

Wang, D., Ferraro, G., Suominen, H., & Jefferson, O. A. (2014). Automated categorisation of patent claims that reference human genome sequences. In J. S. Culpepper, L. Park, & G. Zuccon (Eds.), ADCS 2014 - Proceedings of the 19th Australasian Document Computing Symposium (pp. 117-120). (ACM International Conference Proceeding Series; Vol. 27-28-November-2014). Melbourne: Association for Computing Machinery (ACM). https://doi.org/10.1145/2682862.2682872
Wang, Donglu ; Ferraro, Gabriela ; Suominen, Hanna ; Jefferson, Osmat A. / Automated categorisation of patent claims that reference human genome sequences. ADCS 2014 - Proceedings of the 19th Australasian Document Computing Symposium. editor / J Shane Culpepper ; Laurence Park ; Guido Zuccon. Melbourne : Association for Computing Machinery (ACM), 2014. pp. 117-120 (ACM International Conference Proceeding Series).
@inproceedings{e0240418b9a1432ebe6b556198848b98,
title = "Automated categorisation of patent claims that reference human genome sequences",
abstract = "Debates on gene patents have necessitated the analysis of patents that disclose and reference human sequences. In this study, we built an automated classifier that assigns sequences to one of nine predefined categories according to their functional roles in patent claims by applying natural language processing and supervised learning techniques. To improve its correctness, we experimented with various feature mappings, resulting in the maximal accuracy of 79{\%}.",
keywords = "Human Genome Sequences, Natural Language Processing, Patent Claim Analysis, Supervised Classification",
author = "Donglu Wang and Gabriela Ferraro and Hanna Suominen and Jefferson, {Osmat A.}",
year = "2014",
month = "11",
day = "26",
doi = "10.1145/2682862.2682872",
language = "English",
isbn = "9781450330008",
series = "ACM International Conference Proceeding Series",
publisher = "Association for Computing Machinery (ACM)",
pages = "117--120",
editor = "Culpepper, {J Shane} and Laurence Park and Guido Zuccon",
booktitle = "ADCS 2014 - Proceedings of the 19th Australasian Document Computing Symposium",
address = "United States",

}

Wang, D, Ferraro, G, Suominen, H & Jefferson, OA 2014, Automated categorisation of patent claims that reference human genome sequences. in JS Culpepper, L Park & G Zuccon (eds), ADCS 2014 - Proceedings of the 19th Australasian Document Computing Symposium. ACM International Conference Proceeding Series, vol. 27-28-November-2014, Association for Computing Machinery (ACM), Melbourne, pp. 117-120, 19th Australasian Document Computing Symposium, ADCS 2014, Melbourne, Australia, 27/11/14. https://doi.org/10.1145/2682862.2682872

Automated categorisation of patent claims that reference human genome sequences. / Wang, Donglu; Ferraro, Gabriela; Suominen, Hanna; Jefferson, Osmat A.

ADCS 2014 - Proceedings of the 19th Australasian Document Computing Symposium. ed. / J Shane Culpepper; Laurence Park; Guido Zuccon. Melbourne : Association for Computing Machinery (ACM), 2014. p. 117-120 (ACM International Conference Proceeding Series; Vol. 27-28-November-2014).

Research output: A Conference proceeding or a Chapter in BookConference contribution

TY - GEN

T1 - Automated categorisation of patent claims that reference human genome sequences

AU - Wang, Donglu

AU - Ferraro, Gabriela

AU - Suominen, Hanna

AU - Jefferson, Osmat A.

PY - 2014/11/26

Y1 - 2014/11/26

N2 - Debates on gene patents have necessitated the analysis of patents that disclose and reference human sequences. In this study, we built an automated classifier that assigns sequences to one of nine predefined categories according to their functional roles in patent claims by applying natural language processing and supervised learning techniques. To improve its correctness, we experimented with various feature mappings, resulting in the maximal accuracy of 79%.

AB - Debates on gene patents have necessitated the analysis of patents that disclose and reference human sequences. In this study, we built an automated classifier that assigns sequences to one of nine predefined categories according to their functional roles in patent claims by applying natural language processing and supervised learning techniques. To improve its correctness, we experimented with various feature mappings, resulting in the maximal accuracy of 79%.

KW - Human Genome Sequences

KW - Natural Language Processing

KW - Patent Claim Analysis

KW - Supervised Classification

UR - http://www.scopus.com/inward/record.url?scp=84958533762&partnerID=8YFLogxK

U2 - 10.1145/2682862.2682872

DO - 10.1145/2682862.2682872

M3 - Conference contribution

SN - 9781450330008

T3 - ACM International Conference Proceeding Series

SP - 117

EP - 120

BT - ADCS 2014 - Proceedings of the 19th Australasian Document Computing Symposium

A2 - Culpepper, J Shane

A2 - Park, Laurence

A2 - Zuccon, Guido

PB - Association for Computing Machinery (ACM)

CY - Melbourne

ER -

Wang D, Ferraro G, Suominen H, Jefferson OA. Automated categorisation of patent claims that reference human genome sequences. In Culpepper JS, Park L, Zuccon G, editors, ADCS 2014 - Proceedings of the 19th Australasian Document Computing Symposium. Melbourne: Association for Computing Machinery (ACM). 2014. p. 117-120. (ACM International Conference Proceeding Series). https://doi.org/10.1145/2682862.2682872