A Rule Based Prosody Model for Turkish Text-To Synthesis

dc.authoridILK, Hakki Gokhan/0000-0003-4365-8286
dc.authorscopusid54884771800
dc.authorscopusid55891484200
dc.authorscopusid55638847900
dc.authorwosidILK, HAKKI/AAA-1055-2021
dc.authorwosidUslu, Baran/AAR-1071-2020
dc.authorwosid, AEYilmaz/AAH-3914-2020
dc.contributor.authorUslu, Ibrahim Baran
dc.contributor.authorIlk, Hakki Gokhan
dc.contributor.authorYilmaz, Asim Egemen
dc.contributor.otherDepartment of Electrical & Electronics Engineering
dc.contributor.otherDepartment of Electrical & Electronics Engineering
dc.date.accessioned2024-10-06T10:56:45Z
dc.date.available2024-10-06T10:56:45Z
dc.date.issued2013
dc.departmentAtılım Universityen_US
dc.department-temp[Uslu, Ibrahim Baran] Atilim Univ, Fac Engn, Elect Elect Eng Dept, TR-06836 Incek Ankara, Turkey; [Ilk, Hakki Gokhan; Yilmaz, Asim Egemen] Ankara Univ, Fac Engn, Elect Elect Eng Dept, TR-06100 Tandogan, Turkeyen_US
dc.descriptionILK, Hakki Gokhan/0000-0003-4365-8286en_US
dc.description.abstractThis paper presents our novel prosody model in a Turkish text-to-speech synthesis (TTS) system. After developing a TTS system driven by parametric features consisting of duration, pitch and energy modifications, we try to figure out some prosody rules in order to increase the naturalness of our synthesizer. Since the inflected verbs in Turkish can be stand-alone sentences with the suffixes they take, we build a perceptual prosody model by defining rules on the stress patterns of verb inflections. Affirmative, negative and interrogative (both positive and negative) forms of many verbs were examined in a systematic way. Not only verbs, but in the same way, some phrases were examined for obtaining a proper prosody. According to the results of listening tests, the defined rules based on duration, pitch and energy modification weights, result in perceptually better speech synthesis, namely about 1,78/5,0 improvement in average in the CMOS (Comparative Mean Opinion Score) test. This improvement shows the success of our novel prosody model.en_US
dc.description.woscitationindexScience Citation Index Expanded
dc.identifier.citationcount0
dc.identifier.endpage223en_US
dc.identifier.issn1330-3651
dc.identifier.issue2en_US
dc.identifier.scopus2-s2.0-84876386858
dc.identifier.scopusqualityQ3
dc.identifier.startpage217en_US
dc.identifier.urihttps://hdl.handle.net/20.500.14411/8607
dc.identifier.volume20en_US
dc.identifier.wosWOS:000317728700003
dc.identifier.wosqualityQ4
dc.institutionauthorUslu, İbrahim Baran
dc.institutionauthorUslu, İbrahim Baran
dc.language.isoenen_US
dc.publisherUniv Osijek, Tech Facen_US
dc.relation.ispartofTehnicki Vjesniken_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.scopus.citedbyCount0
dc.subjectCMOS testen_US
dc.subjectdiphoneen_US
dc.subjectnatural speechen_US
dc.subjectprosodyen_US
dc.subjectPSOLAen_US
dc.subjecttext-to-speech synthesis (TTS)en_US
dc.subjectverb inflectionen_US
dc.titleA Rule Based Prosody Model for Turkish Text-To Synthesisen_US
dc.title.alternativeProzodijski model za sintezu Turskog teksta u govor na temelju pravilaen_US
dc.typeArticleen_US
dc.wos.citedbyCount0
dspace.entity.typePublication
relation.isAuthorOfPublication186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8
relation.isAuthorOfPublication.latestForDiscovery186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8
relation.isOrgUnitOfPublicationc3c9b34a-b165-4cd6-8959-dc25e91e206b
relation.isOrgUnitOfPublication.latestForDiscoveryc3c9b34a-b165-4cd6-8959-dc25e91e206b

Files

Collections