Extractive Text Summarization for Turkish: Implementation of Tf-Idf and Pagerank Algorithms
Loading...

Date
2023
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Springer Science and Business Media Deutschland GmbH
Open Access Color
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Due to the massive amount of information available on the web, reaching the desired content has become more and more difficult. Automatic text summarization helps to solve the problem by minimizing the document size while keeping its core information. In this study, two extractive single document automatic text summarization systems for Turkish are presented which implement the statistical-based TF-IDF algorithm as well as the combination of TF-IDF with the graph-based PageRank algorithm. The study aims to reveal the usability and effectiveness of these algorithms for Turkish documents. Moreover, the results of the TF-IDF implementation and the hybrid approach are compared using the co-selection measures, precision, recall, and F-score. In the evaluation phase, the system-generated summaries are categorized and tested based on their word sizes and the predetermined thresholds and compared against the human-generated summaries. The results indicate that the hybrid system performs better than the TF-IDF system even in lower thresholds, and also both systems are inclined to improve average F-scores in higher threshold generated summarization. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Description
Keywords
PageRank, Text summarization, TF-IDF, Turkish
Fields of Science
Citation
WoS Q
Scopus Q
Q4

OpenCitations Citation Count
2
Source
Lecture Notes in Networks and Systems -- Intelligent Systems Conference, IntelliSys 2022 -- 1 September 2022 through 2 September 2022 -- Virtual, Online -- 282539
Volume
544 LNNS
Issue
Start Page
688
End Page
704
Collections
PlumX Metrics
Citations
Scopus : 1
Captures
Mendeley Readers : 6
SCOPUS™ Citations
1
checked on Feb 14, 2026
Page Views
2
checked on Feb 14, 2026
Google Scholar™


