TY - JOUR
T1 - Extracting the Evolutionary Backbone of Scientific Domains
T2 - The semantic main path network analysis approach based on citation context analysis
AU - Jiang, Xiaorui
AU - Liu, Junjun
N1 - This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
PY - 2023/3/21
Y1 - 2023/3/21
N2 - Main path analysis is a popular method for extracting the scientific backbone from the citation network of a research domain. Existing approaches ignored the semantic relationships between the citing and cited publications, resulting in several adverse issues, in terms of coherence of main paths and coverage of significant studies. This paper advocated the semantic main path analysis approach to alleviate these issues based on citation function analysis. A wide variety of SciBERT-based deep learning models were designed for identifying citation functions. Semantic citation networks were built by either including important citations, e.g., extension, motivation, usage and similarity, or excluding incidental citations like background and future work. Semantic main path network was built by merging the top-K main paths extracted from various time slices of semantic citation network. In addition, this study proposed a three-way framework for quantitative evaluation of main path analysis results. Both qualitative and quantitative analysis on three research areas of computational linguistics demonstrated that, compared to semantics-agnostic counterparts, different types of semantic main path networks provide complementary views scientific knowledge flows. Combining them together, we can obtain a more precise and comprehensive picture of domain evolution and uncover more coherent development pathways between scientific ideas.
AB - Main path analysis is a popular method for extracting the scientific backbone from the citation network of a research domain. Existing approaches ignored the semantic relationships between the citing and cited publications, resulting in several adverse issues, in terms of coherence of main paths and coverage of significant studies. This paper advocated the semantic main path analysis approach to alleviate these issues based on citation function analysis. A wide variety of SciBERT-based deep learning models were designed for identifying citation functions. Semantic citation networks were built by either including important citations, e.g., extension, motivation, usage and similarity, or excluding incidental citations like background and future work. Semantic main path network was built by merging the top-K main paths extracted from various time slices of semantic citation network. In addition, this study proposed a three-way framework for quantitative evaluation of main path analysis results. Both qualitative and quantitative analysis on three research areas of computational linguistics demonstrated that, compared to semantics-agnostic counterparts, different types of semantic main path networks provide complementary views scientific knowledge flows. Combining them together, we can obtain a more precise and comprehensive picture of domain evolution and uncover more coherent development pathways between scientific ideas.
KW - Computer Networks and Communications
KW - Information Systems
KW - Information Systems and Management
KW - Library and Information Sciences
UR - http://www.scopus.com/inward/record.url?scp=85150879526&partnerID=8YFLogxK
U2 - 10.1002/asi.24748
DO - 10.1002/asi.24748
M3 - Article
SN - 2330-1643
VL - 74
SP - 546
EP - 569
JO - Journal of the Association for Information Science and Technology
JF - Journal of the Association for Information Science and Technology
IS - 5
ER -