Show simple item record

dc.contributor.authorKursun, Olcay
dc.contributor.authorGumus, Ergun
dc.contributor.authorGörmez, Zeliha
dc.date.accessioned2021-03-03T15:39:59Z
dc.date.available2021-03-03T15:39:59Z
dc.identifier.citationGumus E., Görmez Z., Kursun O., "Multi objective SNP selection using pareto optimality", COMPUTATIONAL BIOLOGY AND CHEMISTRY, cilt.43, ss.23-28, 2013
dc.identifier.issn1476-9271
dc.identifier.othervv_1032021
dc.identifier.otherav_4022deae-bac4-4e2b-af69-0e937d7c4790
dc.identifier.urihttp://hdl.handle.net/20.500.12627/46886
dc.identifier.urihttps://doi.org/10.1016/j.compbiolchem.2012.12.006
dc.description.abstractBiomarker discovery is a challenging task of bioinformatics especially when targeting high dimensional problems such as SNP (single nucleotide polymorphism) datasets. Various types of feature selection methods can be applied to accomplish this task. Typically, using features versus class labels of samples in the training dataset, these methods aim at selecting feature subsets with maximal classification accuracies. Although finding such class-discriminative features is crucial, selection of relevant SNPs for maximizing other properties that exist in the nature of population genetics such as the correlation between genetic diversity and geographical distance of ethnic groups can also be equally important. In this work, a methodology using a multi objective optimization technique called Pareto Optimal is utilized for selecting SNP subsets offering both high classification accuracy and correlation between genomic and geographical distances. In this method, discriminatory power of an SNP is determined using mutual information and its contribution to the genomic-geographical correlation is estimated using its loadings on principal components. Combining these objectives, the proposed method identifies SNP subsets that can better discriminate ethnic groups than those obtained with sole mutual information and yield higher correlation than those obtained with sole principal components on the Human Genome Diversity Project (HGDP) SNP dataset. (C) 2012 Elsevier Ltd. All rights reserved.
dc.language.isoeng
dc.subjectMühendislik ve Teknoloji
dc.subjectBİLGİSAYAR BİLİMİ, İNTERDİSİPLİNER UYGULAMALAR
dc.subjectBİYOLOJİ
dc.subjectBiyoloji ve Biyokimya
dc.subjectTıp
dc.subjectYaşam Bilimleri (LIFE)
dc.subjectBilgisayar Bilimi
dc.subjectMühendislik, Bilişim ve Teknoloji (ENG)
dc.subjectSağlık Bilimleri
dc.subjectTemel Tıp Bilimleri
dc.subjectBiyokimya
dc.subjectTıbbi Biyoloji
dc.subjectBilgisayar Bilimleri
dc.subjectBilgisayar Grafiği
dc.subjectYaşam Bilimleri
dc.subjectTemel Bilimler
dc.titleMulti objective SNP selection using pareto optimality
dc.typeMakale
dc.relation.journalCOMPUTATIONAL BIOLOGY AND CHEMISTRY
dc.contributor.departmentİstanbul Üniversitesi , ,
dc.identifier.volume43
dc.identifier.startpage23
dc.identifier.endpage28
dc.contributor.firstauthorID74443


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record