Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling

No Thumbnail Available

Date

2022

Journal Title

Journal ISSN

Volume Title

Publisher

Ieee-inst Electrical Electronics Engineers inc

Open Access Color

GOLD

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Average
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however, not many of them demonstrate a holistic view of the field. From this perspective, this study aimed to reveal a holistic view that reflects topics, trends, and trajectories in software engineering research by analyzing the majority of domain-specific articles published over the last 40 years. This study first presents an objective and systematic method for corpus creation through major publication sources in the field. A corpus was then created using this method, which includes 44 domain-specific conferences and journals and 57,174 articles published between 1980 and 2019. Next, this corpus was analyzed using an automated text-mining methodology based on a probabilistic topic-modeling approach. As a result of this analysis, 24 main topics were found. In addition, topical trends in the field were revealed. Finally, three main developmental stages of the field were identified as: the programming age, the software development age, and the software optimization age.

Description

Cagiltay, Nergiz/0000-0003-0875-9276; Menekse Dalveren, Gonca Gokce/0000-0002-8649-1909; GURCAN, Fatih/0000-0001-9915-6686; Forti, Stefano/0000-0002-4159-8761

Keywords

Market research, Systematics, Software engineering, Software, Bibliometrics, Text mining, Licenses, Corpus creation, research trends and topics, software engineering, text mining, topic model, research trends and topics, Corpus creation, text mining, Electrical engineering. Electronics. Nuclear engineering, topic model, software engineering, TK1-9971

Turkish CoHE Thesis Center URL

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q2

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
21

Source

IEEE Access

Volume

10

Issue

Start Page

74638

End Page

74654

Collections

PlumX Metrics
Citations

CrossRef : 11

Scopus : 25

Captures

Mendeley Readers : 48

SCOPUS™ Citations

25

checked on Jan 25, 2026

Web of Science™ Citations

20

checked on Jan 25, 2026

Page Views

2

checked on Jan 25, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
11.39973332

Sustainable Development Goals

SDG data is not available