A Rule Based Prosody Model for Turkish Text-To Synthesis

dc.contributor.author Uslu, Ibrahim Baran
dc.contributor.author Ilk, Hakki Gokhan
dc.contributor.author Yilmaz, Asim Egemen
dc.contributor.other Department of Electrical & Electronics Engineering
dc.contributor.other Department of Electrical & Electronics Engineering
dc.contributor.other 15. Graduate School of Natural and Applied Sciences
dc.contributor.other 01. Atılım University
dc.date.accessioned 2024-10-06T10:56:45Z
dc.date.available 2024-10-06T10:56:45Z
dc.date.issued 2013
dc.description ILK, Hakki Gokhan/0000-0003-4365-8286 en_US
dc.description.abstract This paper presents our novel prosody model in a Turkish text-to-speech synthesis (TTS) system. After developing a TTS system driven by parametric features consisting of duration, pitch and energy modifications, we try to figure out some prosody rules in order to increase the naturalness of our synthesizer. Since the inflected verbs in Turkish can be stand-alone sentences with the suffixes they take, we build a perceptual prosody model by defining rules on the stress patterns of verb inflections. Affirmative, negative and interrogative (both positive and negative) forms of many verbs were examined in a systematic way. Not only verbs, but in the same way, some phrases were examined for obtaining a proper prosody. According to the results of listening tests, the defined rules based on duration, pitch and energy modification weights, result in perceptually better speech synthesis, namely about 1,78/5,0 improvement in average in the CMOS (Comparative Mean Opinion Score) test. This improvement shows the success of our novel prosody model. en_US
dc.identifier.issn 1330-3651
dc.identifier.scopus 2-s2.0-84876386858
dc.identifier.uri https://hdl.handle.net/20.500.14411/8607
dc.language.iso en en_US
dc.publisher Univ Osijek, Tech Fac en_US
dc.relation.ispartof Tehnicki Vjesnik en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject CMOS test en_US
dc.subject diphone en_US
dc.subject natural speech en_US
dc.subject prosody en_US
dc.subject PSOLA en_US
dc.subject text-to-speech synthesis (TTS) en_US
dc.subject verb inflection en_US
dc.title A Rule Based Prosody Model for Turkish Text-To Synthesis en_US
dc.title.alternative Prozodijski model za sintezu Turskog teksta u govor na temelju pravila en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id ILK, Hakki Gokhan/0000-0003-4365-8286
gdc.author.institutional Uslu, İbrahim Baran
gdc.author.institutional Uslu, İbrahim Baran
gdc.author.scopusid 54884771800
gdc.author.scopusid 55891484200
gdc.author.scopusid 55638847900
gdc.author.wosid ILK, HAKKI/AAA-1055-2021
gdc.author.wosid Uslu, Baran/AAR-1071-2020
gdc.author.wosid , AEYilmaz/AAH-3914-2020
gdc.coar.access metadata only access
gdc.coar.type text::journal::journal article
gdc.description.department Atılım University en_US
gdc.description.departmenttemp [Uslu, Ibrahim Baran] Atilim Univ, Fac Engn, Elect Elect Eng Dept, TR-06836 Incek Ankara, Turkey; [Ilk, Hakki Gokhan; Yilmaz, Asim Egemen] Ankara Univ, Fac Engn, Elect Elect Eng Dept, TR-06100 Tandogan, Turkey en_US
gdc.description.endpage 223 en_US
gdc.description.issue 2 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q3
gdc.description.startpage 217 en_US
gdc.description.volume 20 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q4
gdc.identifier.wos WOS:000317728700003
gdc.scopus.citedcount 0
gdc.wos.citedcount 0
relation.isAuthorOfPublication 186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8
relation.isAuthorOfPublication.latestForDiscovery 186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8
relation.isOrgUnitOfPublication c3c9b34a-b165-4cd6-8959-dc25e91e206b
relation.isOrgUnitOfPublication dff2e5a6-d02d-4bef-8b9e-efebe3919b10
relation.isOrgUnitOfPublication 50be38c5-40c4-4d5f-b8e6-463e9514c6dd
relation.isOrgUnitOfPublication.latestForDiscovery c3c9b34a-b165-4cd6-8959-dc25e91e206b

Files

Collections