A Hybrid Approach for Semantic Image Annotation

Loading...

Date

Journal Title

Journal ISSN

Volume Title

Open Access Color

GOLD

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

In this study, a framework that generates natural language descriptions of images within a controlled environment is proposed. Previous work on neural networks mostly focused on choosing the right labels and/or increasing the number of related labels to depict an image. However, creating a textual description of an image is a completely different phenomenon, structurally, syntactically, and semantically. The proposed semantic image annotation framework presents a novel combination of deep learning models and aligned annotation results derived from the instances of the ontology classes to generate sentential descriptions of images. Our hybrid approach benefits from the unique combination of deep learning and semantic web technologies. We detect objects from unlabeled sports images using a deep learning model based on a residual network and a feature pyramid network, with the focal loss technique to obtain predictions with high probability. The proposed framework not only produces probabilistically labeled images, but also the contextual results obtained from a knowledge base exploiting the relationship between the objects. The framework's object detection and prediction performances are tested with two datasets where the first one includes individual instances of images containing everyday scenes of common objects and the second custom dataset contains sports images collected from the web. Moreover, a sample image set is created to obtain annotation result data by applying all framework layers. Experimental results show that the framework is effective in this controlled environment and can be used with other applications via web services within the supported sports domain.

Description

Turhan, Cigdem/0000-0002-6595-7095; Şengül, Gökhan/0000-0003-2273-4411; Sezen, Arda/0000-0002-7615-3623

Keywords

Annotations, Ontologies, Sports, Image annotation, Semantics, Training, Computational modeling, Semantic image annotation, picture interpretation, ontology, Semantic image annotation, picture interpretation, ontology, Electrical engineering. Electronics. Nuclear engineering, TK1-9971

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Scopus Q

OpenCitations Logo
OpenCitations Citation Count
2

Volume

9

Issue

Start Page

131977

End Page

131994

Collections

PlumX Metrics
Citations

Scopus : 3

Captures

Mendeley Readers : 10

SCOPUS™ Citations

3

checked on May 29, 2026

Web of Science™ Citations

2

checked on May 29, 2026

Page Views

2

checked on May 29, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.19

Sustainable Development Goals

SDG data is not available