GCRIS :: Search

Search Results

Now showing 1 - 10 of 18

Citation - Scopus: 4
Convolution Neural Network (cnn) Based Automatic Sorting of Cherries
(Institute of Electrical and Electronics Engineers Inc., 2021) Park,H.; Khan,M.U.
Cherries are spring fruits enriched with nutrients, and are easily available in food markets around the world. Due to their excess demand, many enterprises solely focused on their processing. Cherries are especially susceptible to pathological-, physiological-diseases and structural degradation due to their soft outer skin. The post-harvest life of the fruit is limited by various characteristics. The agricultural industry has also been at the forefront to get benefits from the advanced machine learning tools. This study presents an image processing-based system for sorting cherries using the convolutional neural network (CNN). For this study, Prunus avium L cherries of export quality, available in Turkey, tagged as ‘0900 Ziraat’, are used. Surprisingly, there exists no dataset for these cherries; hence, we developed our dataset. Through the proposed approach based upon U-Net, the binary classification accuracy of 99% is achieved. Clear identification is demonstrated by the test results of varying mixture ratios of good and bad cherries. It can therefore be said that for cherry sorting and grading, U-Net can be applied as a reliable and promising machine learning tool. ©2021 IEEE
Makine Öğrenmesi Algoritmalarının Oyun Seviyelerinin Zorluklarının Belirlenmesinde Kullanılması
(2020) Şahbenderoğlu, Turan Ozan; Karakaya, Ziya
In game design, adjusting difficulty is one of the key aspects of financial success. However, this task is costly since it is time-consuming. In the literature, there are very limited studies according to determining the game difficulty. Instead, almost every study is about difficulty adjustment which skips the determining process. This thesis aims to develop a game environment to observe if the machine learning can determine the difficulty of a game and the game levels. For this purpose, a game with five different levels from easy to hard is developed in Unity Engine. A machine learning agent that uses reinforcement learning is also developed and each game level used as learning environment of the agent. In general, the learning process shows that the Cumulative Reward of the agents is decreased as levels become harder. The complexity of the game significantly decreases Cumulative Rewards. The results of this thesis have shown that those level difficulties of a game can be determined by comparing the reinforcement learning agent's performance on collecting rewards in the training area. In other words, machine learning algorithms have a big potential to support the game design phase of the game development process when it comes to determining the level of difficulties.
Citation - WoS: 2
Citation - Scopus: 3
Modeling of Kappa Factor Using Multivariate Adaptive Regression Splines: Application To the Western Türkiye Ground Motion Dataset
(Springer, 2024) Kurtulmus, Tevfik Ozgur; Yerlikaya-Ozkurt, Fatma; Askan, Aysegul
The recent seismic activity on Turkiye's west coast, especially in the Aegean Sea region, shows that this region requires further attention. The region has significant seismic hazards because of its location in an active tectonic regime of North-South extension with multiple basin structures on soft soil deposits. Recently, despite being 70 km from the earthquake source, the Samos event (with a moment magnitude of 7.0 on October 30, 2020) caused significant localized damage and collapse in the Izmir city center due to a combination of basin effects and structural susceptibility. Despite this activity, research on site characterization and site response modeling, such as local velocity models and kappa estimates, remains sparse in this region. Kappa values display regional characteristics, necessitating the use of local kappa estimations from previous earthquake data in region-specific applications. Kappa estimates are multivariate and incorporate several characteristics such as magnitude and distance. In this study, we assess and predict the trend in mean kappa values using three-component strong-ground motion data from accelerometer sites with known VS30 values throughout western Turkiye. Multiple linear regression (MLR) and multivariate adaptive regression splines (MARS) were used to build the prediction models. The effects of epicentral distance Repi, magnitude Mw, and site class (VS30) were investigated, and the contributions of each parameter were examined using a large dataset containing recent seismic activity. The models were evaluated using well-known statistical accuracy criteria for kappa assessment. In all performance measures, the MARS model outperforms the MLR model across the selected sites.
Citation - WoS: 6
Citation - Scopus: 11
Beyond Rouge: a Comprehensive Evaluation Metric for Abstractive Summarization Leveraging Similarity, Entailment, and Acceptability
(World Scientific Publ Co Pte Ltd, 2024) Briman, Mohammed Khalid Hilmi; Yıldız, Beytullah; Yildiz, Beytullah; Yıldız, Beytullah
A vast amount of textual information on the internet has amplified the importance of text summarization models. Abstractive summarization generates original words and sentences that may not exist in the source document to be summarized. Such abstractive models may suffer from shortcomings such as linguistic acceptability and hallucinations. Recall-Oriented Understudy for Gisting Evaluation (ROUGE) is a metric commonly used to evaluate abstractive summarization models. However, due to its n-gram-based approach, it ignores several critical linguistic aspects. In this work, we propose Similarity, Entailment, and Acceptability Score (SEAScore), an automatic evaluation metric for evaluating abstractive text summarization models using the power of state-of-the-art pre-trained language models. SEAScore comprises three language models (LMs) that extract meaningful linguistic features from candidate and reference summaries and a weighted sum aggregator that computes an evaluation score. Experimental results show that our LM-based SEAScore metric correlates better with human judgment than standard evaluation metrics such as ROUGE-N and BERTScore.
Citation - WoS: 11
Citation - Scopus: 17
Student Engagement Research Trends of Past 10 Years: a Machine Learning-Based Analysis of 42,000 Research Articles
(Springer, 2023) Gurcan, Fatih; Erdogdu, Fatih; Cagiltay, Nergiz Ercil; Cagiltay, Kursat
Student engagement is critical for both academic achievement and learner satisfaction because it promotes successful learning outcomes. Despite its importance in various learning environments, research into the trends and themes of student engagement is scarce. In this regard, topic modeling, a machine learning technique, allows for the analysis of large amounts of content in any field. Thus, topic modeling provides a systematic methodology for identifying research themes, trends, and application areas in a comprehensive framework. In the literature, there is a lack of topic modeling-based studies that analyze the holistic landscape of student engagement research. Such research is important for identifying wide-ranging topics and trends in the field and guiding researchers and educators. Therefore, this study aimed to analyze student engagement research using a topic modeling approach and to reveal research interests and trends with their temporal development, thereby addressing a lack of research in this area. To this end, this study analyzed 42,517 peer-reviewed journal articles published from 2010 to 2019 using machine learning techniques. According to our findings, two new dimensions, "Community Engagement" and "School Engagement", were identified in addition to the existing ones. It is also envisaged that the next period of research and applications in student engagement will focus on the motivation-oriented tools and methods, dimensions of student engagement, such as social and behavioral engagement, and specific learning contexts such as English as a Foreign Language "EFL" and Science, Technology, Engineering and Math "STEM".
Makine Öğrenmesinde, Farklı Veri Temizleme Tekniklerlerinin Sonuç Ölçevleri Üzerindeki Etkisinin İncelenmesi
(2022) Abbas, Israa Mustafa; Toker, Sacip
E-ticaret platformları ve çevrimiçi uygulamalar nedeniyle verilerin katlanarak büyümesi, veri analizi ve işlenmesi için büyük bir zorluk yarattı. Artık internetteki e-ticaret sitelerinin müşterilerinin satın aldıkları ürünler hakkında yorum yazmalarını sağlamak sık kullanılan bir uygulamadır. Bu incelemeler, bu ürünler hakkında değerli bilgi kaynakları sağlar. Bir ürün incelemesi, tüm çevrimiçi ürün şirketlerinde kullanılan duygusal analiz için önemli bir veri kaynağı içerir. Bu büyük miktarda veri etkisi büyük bir zorluk yaratır. Ancak, bu veri kümelerinin farklı veri sorunları vardır. Çoğu durumda, genellikle veriler yayınlanmadan önce çeşitli veri madenciliği teknikleri kullanılır. Mekansal olarak, görünmeyen verileri tahmin etmek için geçmiş ve etiketlenmiş veriler üzerinde eğitilen denetimli makine öğrenimi modellerinde, modelin daha önce öğrenmediği veriler. Bu tezde ayrıca makine öğrenmesinde deney çalışması tasarımına odaklandık. [1]. Bir sebep-sonuç ilişkisi bulmak için düzenli olarak Ronald Fisher'ın teorilerini [2] uygularız. Bu deneysel çalışma tasarımını uygulamak için, doğal dil işleme (NLP) yaklaşımı olan duygusal analiz ile denetimli makine öğrenmesi sınıflandırma algoritmalarını seçtik. Kuruluşların bir ürün veya hizmet hakkındaki görüşleri tanımlaması ve kategorilere ayırması için ortak bir yol. Duyguları ve öznel bilgileri elde etmek için metin madenciliği yapmak için veri madenciliği, makine öğrenimi ve yapay zeka kullanmayı içerir [3]. Bu çalışma, beş deney grubunun (yinelenen veri, noktalama işaretleri, durdurma sözcükleri, limmatezr, TF-IDF transform) etkisini analiz etmek ve bunları bir kontrol grubuyla karşılaştırmak (veri temizleme işlemi yapılmamış) için Multinominal Naïve Bays, Random Forest ve Lojistik Regresyon ile kurulmuştur. Uygulamalı. Deney grubunun üç modelin verimliliğine ve sınıflandırma oranına etkisini belirlemek ve ilginç gözlemleri açıklamak. Simülasyonlar, yirmi dört farklı kategoriden Amazon Product Review veri kümesinden rastgele seçilen 353 proje üzerinde çalıştırıldı. Böylece, veri seti Amazon.com'dan McAuley ve Leskovec [4][5] tarafından toplanmıştır. Metrik veri seti toplandıktan sonra analiz için SPSS yazılımı kullanılmıştır. Bu araştırma sorusunu ve kullanılan ölçeğin tanımlayıcı istatistiklerini incelemek için tekrarlı ölçüm ANOVA yapılmıştır. Analizin sonucu, veri temizlemenin makine öğrenimi modellerinin performansı üzerinde farklı bir etkisinin olduğunu göstermektedir. Aynı durumlarda rasgele ormanda olumlu, çok taraflı naif koylarda ve lojistik regresyonda olumsuz etkilenir. Diğer durumlarda, hiçbir etkisi olmadı. Genel olarak, deneysel sonuçlar Random Forest sınıflandırıcısının, Multinominal Naive Bayes sınıflandırıcısına ve Logistic Regression sınıflandırıcısına göre veri temizlemeye daha duyarlı olduğunu ve iki algoritmanın da temiz olmayan veri setinde yüksek bir sınıflandırma puanı elde ettiğini göstermiştir. Ayrıca, deney sonuçları, veri sorunları davranışının makine öğrenimi modelinde farklılık gösterdiğini gösterdi. Tüm makine öğrenimi algoritmalarında veri kalitesi sorunlarını alakasız veriler olarak kabul edemeyiz.
Kolektif Derin Öğrenme ve Transfer Öğrenme Yoluyla Mahsul ve Meyve Sınıflandırması
(2023) Daşkın, Zeynep Dilan; Khan, Muhammad Umer
Son yıllarda yapılan teknolojik gelişmeler, tarım sektörünü hızlı bir şekilde yeniden şekillendirerek, daha önce kullanılmakta olan geleneksel metotlarda devrim yaratıyor ve insanlık için daha sürdürülebilir ve üretken bir geleceğin yolunu açıyor. Tarım sektörü, makine öğrenmesi, sensor ve mekanizmaları kullanarak otomatikleşirken, aynı zamanda verimlilik artımı, kaynak yönetimi ve mahsul sağlığı açısından da köklü bir değişim yaşıyor. Bu tez çalışmasında, son teknoloji makine öğrenimi tabanlı görüntü işleme teknikleri ve algoritmaları kapsamlı bir şekilde araştırılmış ve analiz edilmiştir. Amaç, çeşitli mahsulleri, meyveleri ve sebzeleri doğru bir şekilde tespit eden, tanımlayan ve sınıflandıran sağlam metodolojiler geliştirmektir. Nihai hedef, gelişen tarımsal otomasyon endüstrisine önemli ölçüde katkıda bulunmak, süreçleri kolaylaştırmak ve tarım sektöründe verimliliği arttırmaktır. Gerçek yaşam koşullarına en yakın sonuçları elde etmek için, yazarların kendi oluşturduğu mahsul veri kümeleri bu araştırma boyunca önerilen algoritmalara entegre edilmiş ve kullanılmıştır. Bu çalışma sırasında kullanılan mevcut Evrişimsel Sinir Ağları algoritmaları AlexNet, GoogleNet, InceptionV3, SqueezeNet, DenseNet ve VGG-16'dır. Bu çalışmada, doğruluk, kayıp, F1-skoru, tahmin, kesinlik ve duyarlılık olmak üzere genel değerlendirme metriklerinin performansını yükseltmek için çeşitli gelişmiş algoritmalar araştırılmış ve incelenmiştir. Özellikle, tarımsal otomasyon sisteminin etkinliğini ve güvenilirliğini arttırmayı amaçlayan Kolektif Öğrenme ve Öğrenme Aktarımı adlı iki metot tanıtılmış ve kapsamlı bir şekilde analiz edilmiştir. Çalışmanın sonuçları, önerilen algoritmaların tarım endüstrisinde son derece etkili olduğunu ve istenen sonuçları benzersiz bir doğruluk ve hassasiyetle sunma yeteneklerini kanıtlamaktadır. Bu sonuçlar, bu algoritmaların operasyonel verimliliği önemli ölçüde arttırma, kaynak tahsisini optimize etme ve gelecek için tarımsal otomasyonda sürdürülebilir uygulamaları teşvik etme gibi potansiyellerini de göstermekte ve teyit etmektedir.
Citation - WoS: 16
Citation - Scopus: 19
Enhancement of Quality and Quantity of Woody Biomass Produced in Forests Using Machine Learning Algorithms
(Pergamon-elsevier Science Ltd, 2023) Peng, Wei; Sadaghiani, Omid Karimi
Forest is considered a significant source of woody biomass production. Sustainable production of wood, lower emittance of CO2 from burning, and lower amount of sulfur and heavy metals are the advantages of wood rather than fossil fuels. The quality and quantity of woody biomass production are a function of some operations including genetic modifications, high-quality forestry, evaluation, monitoring, storage, and transportation. Due to surveying numerous related works, it was found that there is a considerable reviewing gap in analyzing and collecting the applications of Machine Learning in the quality and quantity of woody biomass. To fill this gap in the current work, the above-mentioned operations are explained followed by the applications of Machine Learning algorithms. Conclusively, Machine Learning and Deep Learning can be employed in estimating main effective factors on trees growth, classification of seeds, trees, and regions, as well as providing decision-making tools for farmers or governors, evaluation of biomass, understanding the relation between the woody bimass internal structure and bio-fuel production, the ultimate and proximate analyses, prediction of wood contents and dimensions, determination of the proportion of mixed woody materials, monitoring for early disease identifi-cation and classification, classifying trees diseases, estimating evapotranspiration, collecting information about forest regions and its quality, nitrogen concentration in trees, choosing viable storage sites for storage depots and improving the solution, classifying different filling levels in silage, estimating acetic acid synthesis and aerobic reactions in silage, determining crop quantity in silo, estimating the methane production, and monitoring and predicting water content, quality and quantity of stored biomass, forecasting the demand, path way and on-time performance predicting, truck traffic predicting, and behavioral analysis and facility planning.
Citation - WoS: 12
Citation - Scopus: 20
A Hybrid Approach for Predicting Customers' Individual Purchase Behavior
(Emerald Group Publishing Ltd, 2017) Peker, Serhat; Kocyigit, Altan; Eren, P. Erhan
Purpose - Predicting customers' purchase behaviors is a challenging task. The literature has introduced the individual-level and the segment-based predictive modeling approaches for this purpose. Each method has its own advantages and drawbacks, and performs in certain cases. The purpose of this paper is to propose a hybrid approach which predicts customers' individual purchase behaviors and reduces the limitations of these two methods by combining the advantages of them. Design/methodology/approach - The proposed hybrid approach is established based on individual-level and segment-based approaches and utilizes the historical transactional data and predictive algorithms to generate predictions. The effectiveness of the proposed approach is experimentally evaluated in the domain of supermarket shopping by using real-world data and using five popular machine learning classification algorithms including logistic regression, decision trees, support vector machines, neural networks and random forests. Findings - A comparison of results shows that the proposed hybrid approach substantially outperforms the individual-level and the segment-based approaches in terms of prediction coverage while maintaining roughly comparable prediction accuracy to the individual-level method. Moreover, the experimental results demonstrate that logistic regression performs better than the other classifiers in predicting customer purchase behavior. Practical implications - The study concludes that the proposed approach would be beneficial for enterprises in terms of designing customized services and one-to-one marketing strategies. Originality/value - This study is the first attempt to adopt a hybrid approach combining individual-level and segment-based approaches to predict customers' individual purchase behaviors.
Citation - WoS: 3
Citation - Scopus: 5
Real-Time Anomaly Detection System Within the Scope of Smart Factories
(Springer, 2023) Bayraktar, Cihan; Karakaya, Ziya; Gokcen, Hadi
Anomaly detection is the process of identifying patterns that move differently from normal in a certain order. This process is considered one of the necessary measures for the safety of intelligent production systems. This study proposes a real-time anomaly detection system capable of using and analyzing data in smart production systems consisting of interconnected devices. Synthetic data were preferred in the study because it has difficulties such as high cost and a long time to obtain real anomaly data naturally for learning and testing processes. In order to obtain the necessary synthetic data, a simulation was developed by taking the popcorn production systems as an example. Multi-class anomalies were defined in the obtained data set, and the analysis performances were tested by creating learning models with AutoML libraries. In the field of production systems, while studies on anomaly detection generally focus on whether there is an anomaly in the system, it is aimed to determine which type of anomaly occurs in which device, together with the detection of anomaly by using multi-class tags in the data of this study. As a result of the tests, the Auto-Sklearn library presented the learning models with the highest performance on all data sets. As a result of the study, a real-time anomaly detection system was developed on dynamic data by using the obtained learning models.

Filters

Settings

Sort By

Results per page

Search Results