Contextualised Segment-Wise Citation Function Classification

Xiaorui Jiang, Jingqiang Chen

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)
121 Downloads (Pure)

Abstract

Much effort has been made in the past decades to citation function classification, but noteworthy issues exist. Annotation difficulty resulted in limited data size, especially for minority classes, and inadequate representativeness of the underlying scientific domains. Concerning algorithmic classification, state-of-the-art deep learning-based methods are flawed by generating a feature vector for the whole citation context (or sentence) and failing to exploit the full realm of citation modelling options. Responding to these issues, this paper studied contextualised citation function classification. Specifically, a large new citation context dataset was created by merging and re-annotating six datasets about computational linguistics. A variety of strong SciBERT-based citation function classification models were proposed, and new states of the art were achieved. Through deeper performance analysis, this study focused on answering several research questions about the effective ways of performing citation function classification. More specifically, the study justified the necessity of modelling in-text citations in context and confirmed the superiority of doing citation function classification at citation (segment) level. A particular emphasis was placed on in-depth per-class performance analysis to understand whether citation function classification is robust enough to suit various popular downstream applications and what further efforts are required to meet such analytic needs. Finally, a naïve ensemble classifier was proposed, which greatly improved citation function classification performance.
Original languageEnglish
Pages (from-to)5117-5158
Number of pages42
JournalScientometrics
Volume128
Issue number9
Early online date12 Jul 2023
DOIs
Publication statusPublished - Sept 2023

Bibliographical note

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Funder

The first author Xiaorui Jiang is partially supported by National Planning Office for Philosophy and Social Sciences of China (18ZDA238). Both authors have no competing interests to declare that are relevant to the content of this article.

Keywords

  • Citation context analysis
  • citation function classification
  • deep learning
  • SciBERT
  • Ensemble

ASJC Scopus subject areas

  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Contextualised Segment-Wise Citation Function Classification'. Together they form a unique fingerprint.

Cite this