Extractive Text Summarization for Turkish: Implementation of Tf-Idf and Pagerank Algorithms

No Thumbnail Available

Date

2023

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Science and Business Media Deutschland GmbH

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

Due to the massive amount of information available on the web, reaching the desired content has become more and more difficult. Automatic text summarization helps to solve the problem by minimizing the document size while keeping its core information. In this study, two extractive single document automatic text summarization systems for Turkish are presented which implement the statistical-based TF-IDF algorithm as well as the combination of TF-IDF with the graph-based PageRank algorithm. The study aims to reveal the usability and effectiveness of these algorithms for Turkish documents. Moreover, the results of the TF-IDF implementation and the hybrid approach are compared using the co-selection measures, precision, recall, and F-score. In the evaluation phase, the system-generated summaries are categorized and tested based on their word sizes and the predetermined thresholds and compared against the human-generated summaries. The results indicate that the hybrid system performs better than the TF-IDF system even in lower thresholds, and also both systems are inclined to improve average F-scores in higher threshold generated summarization. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Description

Keywords

PageRank, Text summarization, TF-IDF, Turkish

Turkish CoHE Thesis Center URL

Fields of Science

Citation

WoS Q

Scopus Q

Q4
OpenCitations Logo
OpenCitations Citation Count
1

Source

Lecture Notes in Networks and Systems -- Intelligent Systems Conference, IntelliSys 2022 -- 1 September 2022 through 2 September 2022 -- Virtual, Online -- 282539

Volume

544 LNNS

Issue

Start Page

688

End Page

704

Collections

PlumX Metrics
Citations

Scopus : 1

Captures

Mendeley Readers : 6

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
1.0931942

Sustainable Development Goals

3

GOOD HEALTH AND WELL-BEING
GOOD HEALTH AND WELL-BEING Logo

5

GENDER EQUALITY
GENDER EQUALITY Logo

17

PARTNERSHIPS FOR THE GOALS
PARTNERSHIPS FOR THE GOALS Logo