Search Results

Now showing 1 - 10 of 11
  • Conference Object
    Design and Implementation of an Expressive Talking Mobile Robot: Toztorus
    (Ieee, 2018) Tozan, Ozalp; Tora, Hakan; Uslu, Baran; Unal, Bulcnt; Ceylan, Ece
    This paper is about a brand new robot and all its development stages from the design to the show time. As an undergraduate research project (the LAP program at Atilim University), the robot TozTorUs is the outcome of the dense efforts of a team. With the sensors equipped, it navigates autonomously in the environment in which it is located by avoiding the obstacles. It can understand your questions and answer them using Google's speech technologies. Although it is not a humanoid robot, with eyes and mouth simulator LED displays, it is as friendly as a human. We can also control TozTorUs using a mobile phone. Apart from these, it is able to adjust its height with respect to the visitor's, thus allowing it to make an eye contact with the person. Although TozTorUs is designed for welcoming, it may also be employed for consulting, security and elderly assistance.
  • Conference Object
    Higher Order Statistical Analysis of Turkish Phones
    (Ieee, 2014) Tora, Hakan; Uslu, Baran
    In this study, histograms of Turkish phones were examined using higher order cumulants. As is known, phones constituting a language, are composed of letters as vowels and consonants. These letters can also be grouped as voiced and unvoiced phones. It is observed that unvoiced letters show a Gaussian-like distribution and result in small values of skewness and kurtosis. On the other hand, vowels and voiced consonants lead to a non-Gaussian distribution. Voiced and unvoiced phones are related with their skewness and kurtosis values. It is empirically shown that higher order cumulants are likely to be a feature in describing Turkish phones.
  • Conference Object
    Naturalness Analysis of the Speech Synthesized by a Tts Card
    (Ieee, 2016) Tora, Hakan; Uslu, Baran
    It is known that the performance of a developed text-to-speech (TTS) synthesis system is assessed by subjective tests. These assessments are usually based on the intelligibility and naturalness of the synthesized speech. In this study, an investigation on relating these subjective test results, thus the naturalness of the synthesized speech, to which acoustic features is accomplished. Consequently the features which will increase the performance while synthesizing the speech are determined. Our work is focused especially on the pitch frequency and energy parameters.
  • Conference Object
    Citation - WoS: 1
    THE USE OF CUMULANTS FOR VOICED-UNVOICED SEGMENTS IDENTIFICATION IN SPEECH SIGNALS
    (Ieee, 2014) Uslu, Baran; Tora, Hakan
    In this study, voiced-unvoiced classification performance of Turkish sounds using skewness and kurtosis is examined. The analyses show that higher order cumulants can be employed as a feature in voiced-unvoiced classification that is vital in speech processing applications. Furthermore, it has been shown that cumulants are also useful for identifying voiced and unvoiced segments in noisy speech signals.
  • Conference Object
    Segmentation of Isolated Words Into Voiced-Unvoiced Sound Components by Kurtosis
    (Ieee, 2015) Uslu, Baran; Tora, Hakan
    This study presents a new approach to the segmentation of isolated words into their voiced/unvoiced parts. It is well known that voiced/unvoiced discrimination has an important role in speech synthesis and coding applications. The offered method makes this discrimination using the kurtosis values of the words. The performance of the proposed approach was tested on Turkish digit recordings from zero to nine. It has been observed that this approach segments the parts successfully in not only clean speech but also in noisy speech.
  • Conference Object
    Citation - WoS: 1
    Recognition of Hand-Sketched Digital Logic Gates
    (Ieee, 2015) Gul, Nuray; Tora, Hakan
    Hand-Sketched circuit recognition is a very useful tool in engineering area. Because most of the engineers prefer to design their circuits on the paper firstly. So, this can cause time wasting and some mistakes. In this study, which is based on the solving these kinds of problems, classification and recognition of the handwritten digital logic gates according to their complex and scalar FDs (Fourier Descriptors) is presented. Test results are obtained as 84.3 % accuracy rate for complex FDs, 98.6 % for scalar FDs. Then these results are compared and decided the optimum FDs type for this study.
  • Conference Object
    An Approach for Perceptual Similarity Detection Between Audios Independent of Genre Via Metadata Extraction and Correlation
    (Ieee, 2007) Komsu, Fatma; Tora, Hakan; Oeztoprak, Kasim; Tora, Hakan; Tora, Hakan; Airframe and Powerplant Maintenance; Airframe and Powerplant Maintenance
    This study presents an approach for perceptual similarity detection between audios independent of genre. The study is formed of three phases; signal pre-processing as the first phase, metadata extraction via various perceptually compatible features as the second phase, and correlation methodology for similarity identification as the third phase. The performance and relative importance of the selected features for perceptual similarity analysis are presented, as testing results. Moreover, relative importance of preprocessing is introduced. Using the proposed methodology, perceptual similarity detection between genre independent audios is achieved with a 96.85% performance. Contribution highly lies on the independency of genre.
  • Conference Object
    Performance Evaluation of Self Organizing Neural Networks for Clustering in Esm Systems
    (Ieee, 2014) Gencol, Kenan; Tora, Hakan
    Electronic Support Measures (ESM) system is an important function of electronic warfare which provides the real time projection of radar activities. Such systems may encounter with very high density pulse sequences and it is the main task of an ESM system to deinterleave these mixed pulse trains with high accuracy and minimum computation time. These systems heavily depend on time of arrival analysis and need efficient clustering algorithms to assist deinterleaving process in modern evolving environments. On the other hand, self organizing neural networks stand very promising for this type of radar pulse clustering. In this study, performances of self organizing neural networks that meet such clustering criteria are evaluated in detail and the results are presented.
  • Conference Object
    Citation - WoS: 1
    Lip Shape Based Emotion Identification
    (Ieee, 2016) Gul, Nuray; Tora, Hakan
    Emotion recognition systems have an important role to play in the human-computer interactive applications (HCI). These systems are using facial features of face images and they are verifying or identifying the emotions. In this study, emotion identification algorithms are improved by using just mouth region features of a face. Region of interest (mouth region) is detected by Viola-Jones algorithms from video frames which are including different emotional face expressions. Outer boundaries of lip shapes are extracted by manually and calculated the scalar Fourier Descriptors (FDs) of the boundaries. Classification and recognition of the emotions is presented according to scalar FDs of lip contours. Test results are obtained as 93.9 % accuracy rate for scalar FDs.
  • Conference Object
    Effect of Secret Image Transformation on the Steganography Process
    (Ieee, 2017) Buker, Mohamed; Tora, Hakan; Gokcay, Erhan
    Steganography is the art of hiding information in something else. It is favorable over encryption because encryption only hides the meaning of the information; whereas steganography hides the existence of the information. The existence of a hidden image decreases Peak Signal to Noise Ratio (PSNR) and increases Mean Square Error (MSE) values of the stego image. We propose an approach to improve PSNR and MSE values in stego images. In this method a transformation is applied to the secret image, concealed within another image, before embedding into the cover image. The effect of the transformation is tested with Least Significant Bit (LSB) insertion and Discrete Cosine Transformation (DCT) techniques. MSE and PSNR are calculated for both techniques with and without transformation. Results show a better MSE and PSNR values when a transformation is applied for LSB technique but no significant difference was shown in DCT technique.