Extractive Text Summarization for Turkish: Implementation of TF-IDF and PageRank Algorithms

dc.authorscopusid57895357200
dc.authorscopusid24315330000
dc.contributor.authorTurhan, Çiğdem
dc.contributor.authorTurhan,Ç.
dc.contributor.otherSoftware Engineering
dc.date.accessioned2024-07-05T15:50:21Z
dc.date.available2024-07-05T15:50:21Z
dc.date.issued2023
dc.departmentAtılım Universityen_US
dc.department-tempAkülker E., Havelsan, Ankara, Turkey; Turhan Ç., Department of Software Engineering, Atılım University, Ankara, Turkeyen_US
dc.description.abstractDue to the massive amount of information available on the web, reaching the desired content has become more and more difficult. Automatic text summarization helps to solve the problem by minimizing the document size while keeping its core information. In this study, two extractive single document automatic text summarization systems for Turkish are presented which implement the statistical-based TF-IDF algorithm as well as the combination of TF-IDF with the graph-based PageRank algorithm. The study aims to reveal the usability and effectiveness of these algorithms for Turkish documents. Moreover, the results of the TF-IDF implementation and the hybrid approach are compared using the co-selection measures, precision, recall, and F-score. In the evaluation phase, the system-generated summaries are categorized and tested based on their word sizes and the predetermined thresholds and compared against the human-generated summaries. The results indicate that the hybrid system performs better than the TF-IDF system even in lower thresholds, and also both systems are inclined to improve average F-scores in higher threshold generated summarization. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.en_US
dc.identifier.citation0
dc.identifier.doi10.1007/978-3-031-16075-2_51
dc.identifier.endpage704en_US
dc.identifier.isbn978-303116074-5
dc.identifier.issn2367-3370
dc.identifier.scopus2-s2.0-85138243155
dc.identifier.scopusqualityQ4
dc.identifier.startpage688en_US
dc.identifier.urihttps://doi.org/10.1007/978-3-031-16075-2_51
dc.identifier.urihttps://hdl.handle.net/20.500.14411/4136
dc.identifier.volume544 LNNSen_US
dc.language.isoenen_US
dc.publisherSpringer Science and Business Media Deutschland GmbHen_US
dc.relation.ispartofLecture Notes in Networks and Systems -- Intelligent Systems Conference, IntelliSys 2022 -- 1 September 2022 through 2 September 2022 -- Virtual, Online -- 282539en_US
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectPageRanken_US
dc.subjectText summarizationen_US
dc.subjectTF-IDFen_US
dc.subjectTurkishen_US
dc.titleExtractive Text Summarization for Turkish: Implementation of TF-IDF and PageRank Algorithmsen_US
dc.typeConference Objecten_US
dspace.entity.typePublication
relation.isAuthorOfPublicationdf768b22-7cc0-4650-882f-5af552c7a5f2
relation.isAuthorOfPublication.latestForDiscoverydf768b22-7cc0-4650-882f-5af552c7a5f2
relation.isOrgUnitOfPublicationd86bbe4b-0f69-4303-a6de-c7ec0c515da5
relation.isOrgUnitOfPublication.latestForDiscoveryd86bbe4b-0f69-4303-a6de-c7ec0c515da5

Files

Collections