Projects per year
Abstract
Citations play a fundamental role in supporting authors’ contribution claims throughout a scientific paper. Labelling citation instances with different function labels is indispensable for understanding a scientific text. A single citation is the linkage between two scientific papers in the citation network. These citations encompass rich native information, including context of the citation, citation location, citing and cited paper titles, DOI, and the website’s URL. Nevertheless, previous studies have ignored such rich native information during the process of datasets’ accumulation, thereby resulting in a lack of comprehensive yet significantly valuable features for the citation function classification task. In this paper, we argue that such important information should not be ignored, and accordingly, we extract and integrate all of the native information features into different neural text representation models via trainable embeddings and free text. We first construct a new dataset entitled, NI-Cite, comprising a large number of labelled citations with five key native features (Citation Context, Section Name, Title, DOI, Web URL) against each dataset instance. In addition, we propose to exploit the recently developed text representation models integrated with such information to evaluate the performance of citation function classification task. The experimental results demonstrate that the native information features suggested in this paper enhance the overall classification performance.
| Original language | English |
|---|---|
| Pages (from-to) | 6557-6577 |
| Number of pages | 21 |
| Journal | Scientometrics |
| Volume | 127 |
| Issue number | 11 |
| Early online date | 16 Jan 2022 |
| DOIs | |
| Publication status | Published - Nov 2022 |
Bibliographical note
An correction exists for this article and can be found in Scientometrics, 127(11), p.6579 , doi: 10.1007/s11192-022-04451-1Keywords
- Citation function classification
- Pretrained language model
- Natural language processing
- Native information
Fingerprint
Dive into the research topics of 'Towards employing native information in citation function classification'. Together they form a unique fingerprint.Projects
- 1 Finished
-
What Can You Trust in the Large and Noisy Web?
Sheng, M. (Primary Chief Investigator), Yang, J. (Chief Investigator), Zhang, W. (Chief Investigator) & Dustdar, S. (Partner Investigator)
1/05/20 → 30/04/23
Project: Research