A rule based prosody model for Turkish text-to-speech synthesis;

dc.authorscopusid54884771800
dc.authorscopusid55891484200
dc.authorscopusid55638847900
dc.contributor.authorUslu, İbrahim Baran
dc.contributor.authorIlk,H.G.
dc.contributor.authorYilmaz,A.E.
dc.contributor.otherDepartment of Electrical & Electronics Engineering
dc.date.accessioned2024-10-06T11:15:00Z
dc.date.available2024-10-06T11:15:00Z
dc.date.issued2013
dc.departmentAtılım Universityen_US
dc.department-tempUslu I.B., Atilim University, Electrical-Electronics Eng. Dept., Atilim Universitesi Elektrik-Elektronik Muhendisligi Bolumu, Kizilcasar Mahallesi 06836 Incek Ankara, Turkey; Ilk H.G., Ankara University, Electrical-Electronics Eng. Dept., Ankara Universitesi Elektrik-Elektronik Muhendisligi Bolumu, 06100 Tandogan Ankara, Turkey; Yilmaz A.E., Ankara University, Electrical-Electronics Eng. Dept., Ankara Universitesi Elektrik-Elektronik Muhendisligi Bolumu, 06100 Tandogan Ankara, Turkeyen_US
dc.description.abstractThis paper presents our novel prosody model in a Turkish text-to-speech synthesis (TTS) system. After developing a TTS system driven by parametric features consisting of duration, pitch and energy modifications, we try to figure out some prosody rules in order to increase the naturalness of our synthesizer. Since the inflected verbs in Turkish can be stand-alone sentences with the suffixes they take, we build a perceptual prosody model by defining rules on the stress patterns of verb inflections. Affirmative, negative and interrogative (both positive and negative) forms of many verbs were examined in a systematic way. Not only verbs, but in the same way, some phrases were examined for obtaining a proper prosody. According to the results of listening tests, the defined rules based on duration, pitch and energy modification weights, result in perceptually better speech synthesis, namely about 1,78/5,0 improvement in average in the CMOS (Comparative Mean Opinion Score) test. This improvement shows the success of our novel prosody model.en_US
dc.identifier.citation0
dc.identifier.doi[SCOPUS-DOI-BELIRLENECEK-218]
dc.identifier.endpage223en_US
dc.identifier.issn1330-3651
dc.identifier.issue2en_US
dc.identifier.scopus2-s2.0-84876386858
dc.identifier.scopusqualityQ3
dc.identifier.startpage217en_US
dc.identifier.urihttps://hdl.handle.net/20.500.14411/9360
dc.identifier.volume20en_US
dc.identifier.wosqualityQ4
dc.language.isoenen_US
dc.relation.ispartofTehnicki Vjesniken_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectCMOS testen_US
dc.subjectDiphoneen_US
dc.subjectNatural speechen_US
dc.subjectProsodyen_US
dc.subjectPSOLAen_US
dc.subjectText-to-speech synthesis (TTS)en_US
dc.subjectVerb inflectionen_US
dc.titleA rule based prosody model for Turkish text-to-speech synthesis;en_US
dc.title.alternativeProzodijski model za sintezu Turskog teksta u govor na temelju pravilaen_US
dc.typeArticleen_US
dspace.entity.typePublication
relation.isAuthorOfPublication186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8
relation.isAuthorOfPublication.latestForDiscovery186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8
relation.isOrgUnitOfPublicationc3c9b34a-b165-4cd6-8959-dc25e91e206b
relation.isOrgUnitOfPublication.latestForDiscoveryc3c9b34a-b165-4cd6-8959-dc25e91e206b

Files

Collections