A rule based prosody model for Turkish text-to-speech synthesis;

Uslu, İbrahim Baran; Ilk,H.G.; Yilmaz,A.E.

A rule based prosody model for Turkish text-to-speech synthesis;

dc.authorscopusid	54884771800
dc.authorscopusid	55891484200
dc.authorscopusid	55638847900
dc.contributor.author	Uslu, İbrahim Baran
dc.contributor.author	Ilk,H.G.
dc.contributor.author	Yilmaz,A.E.
dc.contributor.other	Department of Electrical & Electronics Engineering
dc.date.accessioned	2024-10-06T11:15:00Z
dc.date.available	2024-10-06T11:15:00Z
dc.date.issued	2013
dc.department	Atılım University	en_US
dc.department-temp	Uslu I.B., Atilim University, Electrical-Electronics Eng. Dept., Atilim Universitesi Elektrik-Elektronik Muhendisligi Bolumu, Kizilcasar Mahallesi 06836 Incek Ankara, Turkey; Ilk H.G., Ankara University, Electrical-Electronics Eng. Dept., Ankara Universitesi Elektrik-Elektronik Muhendisligi Bolumu, 06100 Tandogan Ankara, Turkey; Yilmaz A.E., Ankara University, Electrical-Electronics Eng. Dept., Ankara Universitesi Elektrik-Elektronik Muhendisligi Bolumu, 06100 Tandogan Ankara, Turkey	en_US
dc.description.abstract	This paper presents our novel prosody model in a Turkish text-to-speech synthesis (TTS) system. After developing a TTS system driven by parametric features consisting of duration, pitch and energy modifications, we try to figure out some prosody rules in order to increase the naturalness of our synthesizer. Since the inflected verbs in Turkish can be stand-alone sentences with the suffixes they take, we build a perceptual prosody model by defining rules on the stress patterns of verb inflections. Affirmative, negative and interrogative (both positive and negative) forms of many verbs were examined in a systematic way. Not only verbs, but in the same way, some phrases were examined for obtaining a proper prosody. According to the results of listening tests, the defined rules based on duration, pitch and energy modification weights, result in perceptually better speech synthesis, namely about 1,78/5,0 improvement in average in the CMOS (Comparative Mean Opinion Score) test. This improvement shows the success of our novel prosody model.	en_US
dc.identifier.citation	0
dc.identifier.doi	[SCOPUS-DOI-BELIRLENECEK-218]
dc.identifier.endpage	223	en_US
dc.identifier.issn	1330-3651
dc.identifier.issue	2	en_US
dc.identifier.scopus	2-s2.0-84876386858
dc.identifier.scopusquality	Q3
dc.identifier.startpage	217	en_US
dc.identifier.uri	https://hdl.handle.net/20.500.14411/9360
dc.identifier.volume	20	en_US
dc.identifier.wosquality	Q4
dc.language.iso	en	en_US
dc.relation.ispartof	Tehnicki Vjesnik	en_US
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	CMOS test	en_US
dc.subject	Diphone	en_US
dc.subject	Natural speech	en_US
dc.subject	Prosody	en_US
dc.subject	PSOLA	en_US
dc.subject	Text-to-speech synthesis (TTS)	en_US
dc.subject	Verb inflection	en_US
dc.title	A rule based prosody model for Turkish text-to-speech synthesis;	en_US
dc.title.alternative	Prozodijski model za sintezu Turskog teksta u govor na temelju pravila	en_US
dc.type	Article	en_US
dspace.entity.type	Publication
relation.isAuthorOfPublication	186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8
relation.isAuthorOfPublication.latestForDiscovery	186f4f6f-718b-4cd3-bc88-2f9bb73a0ed8
relation.isOrgUnitOfPublication	c3c9b34a-b165-4cd6-8959-dc25e91e206b
relation.isOrgUnitOfPublication.latestForDiscovery	c3c9b34a-b165-4cd6-8959-dc25e91e206b

Collections

Scopus

A rule based prosody model for Turkish text-to-speech synthesis;

Files

Collections