Kayıp Verilerin Tamamlanması için Bir Hibrit Model

Al-brge, Basma

Kayıp Verilerin Tamamlanması için Bir Hibrit Model

dc.contributor.advisor	Koyuncu, Murat
dc.contributor.author	Al-brge, Basma
dc.date.accessioned	2024-07-07T12:49:22Z
dc.date.available	2024-07-07T12:49:22Z
dc.date.issued	2019
dc.description.abstract	Eksik veriler neredeyse tüm ciddi istatistiksel analizlerde ortaya çıkmaktadır. İstatistiksel analizler, eksik verileri işlemek için, rastgele değerlendirme yaklaşımı gibi genellikle makul sonuçlar verebilecek bazı basit yaklaşımlar da dahil olmak üzere çeşitli yöntemlere sahiptir. Eksik veri değerlendirme süreci, doğru tamamlamalar yapabilmek için modellenmelidir. Veri setlerini ampirik uygulamalarda kullanmak bazı görevleri gerçekleştirmek için çok yaygındır, ancak veri setlerindeki eksik değerler veri setlerinden çıkarılmalı ya da veri madenciliğinin ön işleme aşamasında tahmin edilmelidir. Bu tezde, veri algılamasını iyileştirmek ve orijinal eksik değerlerle yüksek korelasyonlu veri üretmek için K-En Yakın Komşu (KNN) ile Tekil Değer Ayrıştırma (SVD) algoritmasını birleştiren bir karma yaklaşım kullanılmaktadır. Önerilen hibrit yöntemin test sonuçları, farklı kayıp değerlerin oranı için çeşitli alternatif yöntemlerin sonuçlarıyla karşılaştırılmış ve önerilen yöntemin performansı diğerlerinden daha iyi çıkmıştır. Ayrıca sonuçlar, önerilen modelin performansı hakkında bir fikir vermesi amacıyla literatürdeki raporlanan diğer sonuçlarla da karşılaştırılmıştır. Anahtar Kelimeler: Hibrit yaklaşım, Kayıp değerler, K-en yakın komşu, Tekil Değer Ayrışımı.
dc.description.abstract	Missing data arises in almost all serious statistical analyses. Statistical analyses have a variety of methods to handle missing data, including some relatively simple approaches that can often yield reasonable results such as the random imputation approach. The missing data imputation process must be modeled in order to perform imputations correctly. Using datasets in empirical applications is very common to perform some tasks; however, missing values in datasets should be extracted from the datasets or should be estimated before they are used for processing to produce correct association rules or clustering in the preprocessing stage of data mining and processing. In this thesis, a hybrid approach is used that combines K-Nearest Neighbor (KNN) with Singular Value Decomposition (SVD) algorithm to improve the data imputation and produce data with high correlation with original missing values. The test results of the proposed hybrid method are compared with the results of several alternative methods for different rate of missing values and the results of the proposed method yields better performance than the others. The results are also compared with the reported results in the literature to give an idea about its performance. Hybrid approach, Missing values, K-nearest Neighbour, Singular Value Decomposition.	en
dc.identifier.uri	https://hdl.handle.net/20.500.14411/5407
dc.language.iso	en
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.title	Kayıp Verilerin Tamamlanması için Bir Hibrit Model
dc.title	A Hybrid Method for Missing Value Imputation	en_US
dc.type	Master Thesis
dspace.entity.type	Publication
gdc.coar.type	text::thesis::master thesis
gdc.description.department	Fen Bilimleri Enstitüsü / Bilgi Teknolojileri Ana Bilim Dalı
gdc.description.endpage	76
gdc.description.startpage	0
gdc.identifier.yoktezid	543640
gdc.virtual.author	Koyuncu, Murat
relation.isAuthorOfPublication	948643aa-7723-4c65-8da8-fcc884405cd1
relation.isAuthorOfPublication.latestForDiscovery	948643aa-7723-4c65-8da8-fcc884405cd1
relation.isOrgUnitOfPublication	cf0fb36c-0500-438e-b4cc-ad1d4ef25579
relation.isOrgUnitOfPublication	4abda634-67fd-417f-bee6-59c29fc99997
relation.isOrgUnitOfPublication	50be38c5-40c4-4d5f-b8e6-463e9514c6dd
relation.isOrgUnitOfPublication.latestForDiscovery	cf0fb36c-0500-438e-b4cc-ad1d4ef25579

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 543640 A hybrid method for missing value imputation.pdf
Size:: 4.05 MB
Format:: Adobe Portable Document Format

Download

Collections

Yüksek Lisans Tezleri