A Rule Based Prosody Model for Turkish Text-To Synthesis

dc.authorid ILK, Hakki Gokhan/0000-0003-4365-8286
dc.authorscopusid 54884771800
dc.authorscopusid 55891484200
dc.authorscopusid 55638847900
dc.authorwosid ILK, HAKKI/AAA-1055-2021
dc.authorwosid Uslu, Baran/AAR-1071-2020
dc.authorwosid , AEYilmaz/AAH-3914-2020
dc.contributor.author Uslu, Ibrahim Baran
dc.contributor.author Ilk, Hakki Gokhan
dc.contributor.author Yilmaz, Asim Egemen
dc.contributor.other Department of Electrical & Electronics Engineering
dc.contributor.other Department of Electrical & Electronics Engineering
dc.date.accessioned 2024-10-06T10:56:45Z
dc.date.available 2024-10-06T10:56:45Z
dc.date.issued 2013
dc.department Atılım University en_US
dc.department-temp [Uslu, Ibrahim Baran] Atilim Univ, Fac Engn, Elect Elect Eng Dept, TR-06836 Incek Ankara, Turkey; [Ilk, Hakki Gokhan; Yilmaz, Asim Egemen] Ankara Univ, Fac Engn, Elect Elect Eng Dept, TR-06100 Tandogan, Turkey en_US
dc.description ILK, Hakki Gokhan/0000-0003-4365-8286 en_US
dc.description.abstract This paper presents our novel prosody model in a Turkish text-to-speech synthesis (TTS) system. After developing a TTS system driven by parametric features consisting of duration, pitch and energy modifications, we try to figure out some prosody rules in order to increase the naturalness of our synthesizer. Since the inflected verbs in Turkish can be stand-alone sentences with the suffixes they take, we build a perceptual prosody model by defining rules on the stress patterns of verb inflections. Affirmative, negative and interrogative (both positive and negative) forms of many verbs were examined in a systematic way. Not only verbs, but in the same way, some phrases were examined for obtaining a proper prosody. According to the results of listening tests, the defined rules based on duration, pitch and energy modification weights, result in perceptually better speech synthesis, namely about 1,78/5,0 improvement in average in the CMOS (Comparative Mean Opinion Score) test. This improvement shows the success of our novel prosody model. en_US
dc.description.woscitationindex Science Citation Index Expanded
dc.identifier.citationcount 0
dc.identifier.endpage 223 en_US
dc.identifier.issn 1330-3651
dc.identifier.issue 2 en_US
dc.identifier.scopus 2-s2.0-84876386858
dc.identifier.scopusquality Q3
dc.identifier.startpage 217 en_US
dc.identifier.uri https://hdl.handle.net/20.500.14411/8607
dc.identifier.volume 20 en_US
dc.identifier.wos WOS:000317728700003
dc.identifier.wosquality Q4
dc.institutionauthor Uslu, İbrahim Baran
dc.institutionauthor Uslu, İbrahim Baran
dc.language.iso en en_US
dc.publisher Univ Osijek, Tech Fac en_US
dc.relation.ispartof Tehnicki Vjesnik en_US
dc.relation.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.scopus.citedbyCount 0
dc.subject CMOS test en_US
dc.subject diphone en_US
dc.subject natural speech en_US
dc.subject prosody en_US
dc.subject PSOLA en_US
dc.subject text-to-speech synthesis (TTS) en_US
dc.subject verb inflection en_US
dc.title A Rule Based Prosody Model for Turkish Text-To Synthesis en_US
dc.title.alternative Prozodijski model za sintezu Turskog teksta u govor na temelju pravila en_US
dc.type Article en_US
dc.wos.citedbyCount 0
dspace.entity.type Publication
relation.isAuthorOfPublication 186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8
relation.isAuthorOfPublication.latestForDiscovery 186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8
relation.isOrgUnitOfPublication c3c9b34a-b165-4cd6-8959-dc25e91e206b
relation.isOrgUnitOfPublication.latestForDiscovery c3c9b34a-b165-4cd6-8959-dc25e91e206b

Files

Collections