Search Results

Now showing 1 - 10 of 16
  • Conference Object
    Producing Synthetic Speech From Turkish Text Via a Single Sound Synthesizer Ic;
    (2010) Tora,H.; Cengizler,Ç.
    In this study, new speech sounds were created for Turkish letters from the allophones listed in SpeakJet Magnevation complex sound synthesizer IC, which is intended for English, by selecting and paring the most similiar phonems. Consequently, Turkish Text to Speech synthesizer was implemented with minimum enviroment elements and no physical modification on IC. Paring based on SAMPA showed that all Turkish phonems can be obtained from English allophones with high compatibility. The text to speech and paring algorithm presented herein is developed to be embedded in a microcontroller and capable to vocalize a random Turkish text with SpeakJet IC.
  • Conference Object
    Citation - Scopus: 2
    Recognition of Hand-Sketched Digital Logic Gates;
    (Institute of Electrical and Electronics Engineers Inc., 2015) Gül,N.; Tora,H.
    Hand-Sketched circuit recognition is a very useful tool in engineering area. Because most of the engineers prefer to design their circuits on the paper firstly. So, this can cause time wasting and some mistakes. In this study, which is based on the solving these kinds of problems, classification and recognition of the handwritten digital logic gates according to their complex and scalar FDs (Fourier Descriptors) is presented. Test results are obtained as 84.3 % accuracy rate for complex FDs, 98.6 % for scalar FDs. Then these results are compared and decided the optimum FDs type for this study. © 2015 IEEE.
  • Conference Object
    Citation - Scopus: 2
    Real Time Infrared Image Enhancement;
    (2012) Akdeniz,N.; Tora,H.
    This study evaluates the implementation of Balanced Contrast Limited Adaptive Histogram Equalization (BCLAHE) for infrared images (IR) on an embedded platform. The aim was to achieve real time performance for the operator display target application. The system configured for this aim is a dual processor media application device OMAP3530, which consists of an ARM and a DSP processor. System is configured so that hardware sources are used efficiently and various performance improvement techniques are investigated. Performance analysis is done over IR images with different dynamic range. © 2012 IEEE.
  • Conference Object
    An Approach for Perceptual Similarity Detection Between Audios Independent of Genre Via Metadata Extraction and Correlation;
    (2007) Komsu,F.; Öztoprak,K.; Tora,H.
    This study presents an approach for perceptual similarity detection between audios independent of genre. The study is formed of three phases; signal pre-processing as the first phase, metadata extraction via various perceptually compatible features as the second phase, and correlation methodology for similarity identification as the third phase. The performance and relative importance of the selected features for perceptual similarity analysis are presented as testing results. Moreover, relative importance of preprocessing is introduced. Using the proposed methodology, perceptual similarity detection between genre independent audios is achieved with a 96.85% performance. Contribution highly lies on the independency of genre.
  • Conference Object
    Citation - Scopus: 11
    Hand Gesture Classification Using Inertial Based Sensors Via a Neural Network
    (Institute of Electrical and Electronics Engineers Inc., 2017) Akan,E.; Tora,H.; Uslu,B.
    In this study, a mobile phone equipped with four types of sensors namely, accelerometer, gyroscope, magnetometer and orientation, is used for gesture classification. Without feature selection, the raw data from the sensor outputs are processed and fed into a Multi-Layer Perceptron classifier for recognition. The user independent, single user dependent and multiple user dependent cases are all examined. Accuracy values of 91.66% for single user dependent case, 87.48% for multiple user dependent case and 60% for the user independent case are obtained. In addition, performance of each sensor is assessed separately and the highest performance is achieved with the orientation sensor. © 2017 IEEE.
  • Conference Object
    Citation - Scopus: 1
    Higher Order Statistical Analysis of Turkish Phones;
    (IEEE Computer Society, 2014) Tora,H.; Uslu,B.
    In this study, histograms of Turkish phones were examined using higher order cumulants. As is known, phones constituting a language, are composed of letters as vowels and consonants. These letters can also be grouped as voiced and unvoiced phones. It is observed that unvoiced letters show a Gaussian-like distribution and result in small values of skewness and kurtosis. On the other hand, vowels and voiced consonants lead to a non-Gaussian distribution. Voiced and unvoiced phones are related with their skewness and kurtosis values. It is empirically shown that higher order cumulants are likely to be a feature in describing Turkish phones. © 2014 IEEE.
  • Conference Object
    Citation - Scopus: 3
    Emotion classification using hidden layer outputs
    (2012) Günler,M.A.; Tora,H.
    Neural network (NN) with Multi-Layer Perceptron (MLP) is a supervised learning algorithm composed of artificial neurons. Multilayer NN is capable of solving nonlinear classification problems such as emotion identification by using facial expressions that is presented in this paper. Hidden layer outputs of NN provide useful information about facial appearance. This study addresses that without fully training NN hidden layer outputs can be used as feature. It is shown that an acceptable recognition rate is obtained by means of hidden layer outputs. © 2012 IEEE.
  • Conference Object
    Citation - Scopus: 1
    Effect of Secret Image Transformation on the Steganography Process
    (Institute of Electrical and Electronics Engineers Inc., 2017) Buke,M.; Tora,H.; Gokcay,E.
    Steganography is the art of hiding information in something else. It is favorable over encryption because encryption only hides the meaning of the information; whereas steganography hides the existence of the information. The existence of a hidden image decreases Peak Signal to Noise Ratio (PSNR) and increases Mean Square Error (MSE) values of the stego image. We propose an approach to improve PSNR and MSE values in stego images. In this method a transformation is applied to the secret image, concealed within another image, before embedding into the cover image. The effect of the transformation is tested with Least Significant Bit (LSB) insertion and Discrete Cosine Transformation (DCT) techniques. MSE and PSNR are calculated for both techniques with and without transformation. Results show a better MSE and PSNR values when a transformation is applied for LSB technique but no significant difference was shown in DCT technique. © 2017 IEEE.
  • Conference Object
    Recognition of Characters on Vehicle License Plates;
    (2010) Tora,H.; Bora,K.
    In this study, a simple and effective method is proposed for segmenting alphanumeric and numeric characters on vehicle license plates and recognizing the segmented characters.The proposed approach is basically based on template matching technique. Features used for matching are obtained by scanning the segmented characters from left-to-right, right-to-left, top-to-bottom, and bottom-to-top. The features extracted in this way reveals the fact that how a character is moving and changing along its four-side.The character recognition is accomplished by using this information of the character.Experiments done show that successful results are obtained. ©2010 IEEE.
  • Conference Object
    Segmentation of Isolated Words Into Voiced-Unvoiced Sound Components by Kurtosis;
    (Institute of Electrical and Electronics Engineers Inc., 2015) Uslu,B.; Tora, Hakan; Tora,H.; Tora, Hakan; Airframe and Powerplant Maintenance; Airframe and Powerplant Maintenance
    This study presents a new approach to the segmentation of isolated words into their voiced/ unvoiced parts. It is well known that voiced/ unvoiced discrimination has an important role in speech synthesis and coding applications. The offered method makes this discrimination using the kurtosis values of the words. The performance of the proposed approach was tested on Turkish digit recordings from zero to nine. It has been observed that this approach segments the parts successfully in not only clean speech but also in noisy speech. © 2015 IEEE.