A Rule Based Prosody Model for Turkish Text-To Synthesis
| dc.contributor.author | Uslu, Ibrahim Baran | |
| dc.contributor.author | Ilk, Hakki Gokhan | |
| dc.contributor.author | Yilmaz, Asim Egemen | |
| dc.contributor.other | Department of Electrical & Electronics Engineering | |
| dc.contributor.other | Department of Electrical & Electronics Engineering | |
| dc.contributor.other | 15. Graduate School of Natural and Applied Sciences | |
| dc.contributor.other | 01. Atılım University | |
| dc.date.accessioned | 2024-10-06T10:56:45Z | |
| dc.date.available | 2024-10-06T10:56:45Z | |
| dc.date.issued | 2013 | |
| dc.description | ILK, Hakki Gokhan/0000-0003-4365-8286 | en_US |
| dc.description.abstract | This paper presents our novel prosody model in a Turkish text-to-speech synthesis (TTS) system. After developing a TTS system driven by parametric features consisting of duration, pitch and energy modifications, we try to figure out some prosody rules in order to increase the naturalness of our synthesizer. Since the inflected verbs in Turkish can be stand-alone sentences with the suffixes they take, we build a perceptual prosody model by defining rules on the stress patterns of verb inflections. Affirmative, negative and interrogative (both positive and negative) forms of many verbs were examined in a systematic way. Not only verbs, but in the same way, some phrases were examined for obtaining a proper prosody. According to the results of listening tests, the defined rules based on duration, pitch and energy modification weights, result in perceptually better speech synthesis, namely about 1,78/5,0 improvement in average in the CMOS (Comparative Mean Opinion Score) test. This improvement shows the success of our novel prosody model. | en_US |
| dc.identifier.issn | 1330-3651 | |
| dc.identifier.scopus | 2-s2.0-84876386858 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14411/8607 | |
| dc.language.iso | en | en_US |
| dc.publisher | Univ Osijek, Tech Fac | en_US |
| dc.relation.ispartof | Tehnicki Vjesnik | en_US |
| dc.rights | info:eu-repo/semantics/closedAccess | en_US |
| dc.subject | CMOS test | en_US |
| dc.subject | diphone | en_US |
| dc.subject | natural speech | en_US |
| dc.subject | prosody | en_US |
| dc.subject | PSOLA | en_US |
| dc.subject | text-to-speech synthesis (TTS) | en_US |
| dc.subject | verb inflection | en_US |
| dc.title | A Rule Based Prosody Model for Turkish Text-To Synthesis | en_US |
| dc.title.alternative | Prozodijski model za sintezu Turskog teksta u govor na temelju pravila | en_US |
| dc.type | Article | en_US |
| dspace.entity.type | Publication | |
| gdc.author.id | ILK, Hakki Gokhan/0000-0003-4365-8286 | |
| gdc.author.institutional | Uslu, İbrahim Baran | |
| gdc.author.institutional | Uslu, İbrahim Baran | |
| gdc.author.scopusid | 54884771800 | |
| gdc.author.scopusid | 55891484200 | |
| gdc.author.scopusid | 55638847900 | |
| gdc.author.wosid | ILK, HAKKI/AAA-1055-2021 | |
| gdc.author.wosid | Uslu, Baran/AAR-1071-2020 | |
| gdc.author.wosid | , AEYilmaz/AAH-3914-2020 | |
| gdc.coar.access | metadata only access | |
| gdc.coar.type | text::journal::journal article | |
| gdc.description.department | Atılım University | en_US |
| gdc.description.departmenttemp | [Uslu, Ibrahim Baran] Atilim Univ, Fac Engn, Elect Elect Eng Dept, TR-06836 Incek Ankara, Turkey; [Ilk, Hakki Gokhan; Yilmaz, Asim Egemen] Ankara Univ, Fac Engn, Elect Elect Eng Dept, TR-06100 Tandogan, Turkey | en_US |
| gdc.description.endpage | 223 | en_US |
| gdc.description.issue | 2 | en_US |
| gdc.description.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | Q3 | |
| gdc.description.startpage | 217 | en_US |
| gdc.description.volume | 20 | en_US |
| gdc.description.woscitationindex | Science Citation Index Expanded | |
| gdc.description.wosquality | Q4 | |
| gdc.identifier.wos | WOS:000317728700003 | |
| gdc.scopus.citedcount | 0 | |
| gdc.wos.citedcount | 0 | |
| relation.isAuthorOfPublication | 186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8 | |
| relation.isAuthorOfPublication.latestForDiscovery | 186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8 | |
| relation.isOrgUnitOfPublication | c3c9b34a-b165-4cd6-8959-dc25e91e206b | |
| relation.isOrgUnitOfPublication | dff2e5a6-d02d-4bef-8b9e-efebe3919b10 | |
| relation.isOrgUnitOfPublication | 50be38c5-40c4-4d5f-b8e6-463e9514c6dd | |
| relation.isOrgUnitOfPublication.latestForDiscovery | c3c9b34a-b165-4cd6-8959-dc25e91e206b |