What is it about?
Classifying citation trajectories of scientific publications is crucial. However, they diffuse anomalously due to non-linear, non-stationary, and long-ranged correlations. Previous studies define hard thresholds, arbitrary parameters, and subjective rules to classify based on their rise and fall patterns. It leads to substantial variance and, thus, ambiguous classification. This paper proposes CiteDEK, a hybrid EMD-kNN-DTW classification model framework. It predicts the nature of 5,039 trajectories, each 30 years in length, using only raw time series. We get a classification accuracy of 76%, and Cohen's kappa-statistic is 0.63, which is significant.
Featured Image
Why is it important?
There are several diverse applications of classifying citation time series. It can help evaluate same or cross-discipline articles and researchers, track field evolution, retract articles based on their obsolescence, and recommend breakthrough discoveries early.
Perspectives
Read the Original
This page is a summary of: CiteDEK: A hybrid EMD-KNN-DTW model for classification of paper citation trajectories, January 2024, ACM (Association for Computing Machinery),
DOI: 10.1145/3632410.3632481.
You can read the full text:
Contributors
The following have contributed to this page