New Publication on What Drives Citations in Software Engineering

June 15, 2022 /

Lorenz Graf-Vlachy, Daniel Graziotin and Stefan Wagner of the Empirical Software Engineering group published and presented their research on what characteristics drive citations in software engineering publications at the EASE 2022: The International Conference on Evaluation and Assessment in Software Engineering 2022.

Citations are a key measure of scientific performance in most fields, including software engineering. However, there is limited research that studies which characteristics of articles’ metadata (title, abstract, keywords, and author list) are driving citations in this field. In this study, we propose a simple theoretical model for how citations come to be with respect to article metadata, we hypothesize theoretical linkages between metadata characteristics and citations of articles, and we empirically test these hypotheses. We use multiple regression analyses to examine a data set comprising the titles, abstracts, keywords, and authors of 16,131 software engineering articles published between 1990 and 2020 in 20 highly influential software engineering venues. Results: We find that number of authors, number of keywords, number of question marks and dividers in the title, number of acronyms, abstract length, abstract propositional idea density, and corresponding authors in the core Anglosphere are significantly related to citations. Various characteristics of articles’ metadata are linked to the frequency with which the corresponding articles are cited. These results partially confirm and partially go counter to prior findings in software engineering and other disciplines.


To the top of the page