Exploratory Analysis of Topic Interests and Their Evolution in Bioinformatics Research Using Semantic Text Mining and Probabilistic Topic Modeling

Loading...
Publication Logo

Date

2022

Journal Title

Journal ISSN

Volume Title

Publisher

Ieee-inst Electrical Electronics Engineers inc

Open Access Color

GOLD

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Top 10%
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

Bioinformatics, which has developed rapidly in recent years with the collaborative contributions of the fields of biology and informatics, provides a deeper perspective on the analysis and understanding of complex biological data. In this regard, bioinformatics has an interdisciplinary background and a rich literature in terms of domain-specific studies. Providing a holistic picture of bioinformatics research by analyzing the major topics and their trends and developmental stages is critical for an understanding of the field. From this perspective, this study aimed to analyze the last 50 years of bioinformatics studies (a total of 71,490 articles) by using an automated text-mining methodology based on probabilistic topic modeling to reveal the main topics, trends, and the evolution of the field. As a result, 24 major topics that reflect the focuses and trends of the field were identified. Based on the discovered topics and their temporal tendencies from 1970 until 2020, the developmental periods of the field were divided into seven phases, from the "newborn" to the "wisdom" stages. Moreover, the findings indicated a recent increase in the popularity of the topics "Statistical Estimation", "Data Analysis Tools", "Genomic Data", "Gene Expression", and "Prediction". The results of the study revealed that, in bioinformatics studies, interest in innovative computing and data analysis methods based on artificial intelligence and machine learning has gradually increased, thereby marking a significant improvement in contemporary analysis tools and techniques based on prediction.

Description

GURCAN, Fatih/0000-0001-9915-6686; Cagiltay, Nergiz/0000-0003-0875-9276

Keywords

Bioinformatics, Market research, Biology, Analytical models, Genomics, Proteins, Computational modeling, Bioinformatics corpus, probabilistic topic modeling, textual content analysis, scientometric analysis, bioinformatics topics and trends, scientometric analysis, Bioinformatics corpus, textual content analysis, bioinformatics topics and trends, Electrical engineering. Electronics. Nuclear engineering, probabilistic topic modeling, TK1-9971

Turkish CoHE Thesis Center URL

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q2

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
24

Source

IEEE Access

Volume

10

Issue

Start Page

31480

End Page

31493

Collections

PlumX Metrics
Citations

CrossRef : 18

Scopus : 36

Captures

Mendeley Readers : 39

SCOPUS™ Citations

36

checked on Feb 07, 2026

Web of Science™ Citations

26

checked on Feb 07, 2026

Page Views

2

checked on Feb 07, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
5.41504295

Sustainable Development Goals

3

GOOD HEALTH AND WELL-BEING
GOOD HEALTH AND WELL-BEING Logo

4

QUALITY EDUCATION
QUALITY EDUCATION Logo

5

GENDER EQUALITY
GENDER EQUALITY Logo

8

DECENT WORK AND ECONOMIC GROWTH
DECENT WORK AND ECONOMIC GROWTH Logo

9

INDUSTRY, INNOVATION AND INFRASTRUCTURE
INDUSTRY, INNOVATION AND INFRASTRUCTURE Logo

10

REDUCED INEQUALITIES
REDUCED INEQUALITIES Logo

12

RESPONSIBLE CONSUMPTION AND PRODUCTION
RESPONSIBLE CONSUMPTION AND PRODUCTION Logo

14

LIFE BELOW WATER
LIFE BELOW WATER Logo

16

PEACE, JUSTICE AND STRONG INSTITUTIONS
PEACE, JUSTICE AND STRONG INSTITUTIONS Logo