• Türkçe
    • English
  • Türkçe 
    • Türkçe
    • English
  • Giriş
Öğe Göster 
  •   Açık Erişim Ana Sayfası
  • Avesis
  • Dokümanı Olmayanlar
  • Makale
  • Öğe Göster
  •   Açık Erişim Ana Sayfası
  • Avesis
  • Dokümanı Olmayanlar
  • Makale
  • Öğe Göster
JavaScript is disabled for your browser. Some features of this site may not work without it.

A method for combining mutual information and canonical correlation analysis: Predictive Mutual Information and its use in feature selection

Tarih
2012
Yazar
Kursun, Olcay
Sakar, C. Okan
Üst veri
Tüm öğe kaydını göster
Özet
Feature selection is a critical step in many artificial intelligence and pattern recognition problems. Shannon's Mutual Information (MI) is a classical and widely used measure of dependence measure that serves as a good feature selection algorithm. However, as it is a measure of mutual information in average, under-sampled classes (rare events) can be overlooked by this measure, which can cause critical false negatives (missing a relevant feature very predictive of some rare but important classes). Shannon's mutual information requires a well sampled database, which is not typical of many fields of modern science (such as biomedical), in which there are limited number of samples to learn from, or at least, not all the classes of the target function (such as certain phenotypes in biomedical) are well-sampled. On the other hand, Kernel Canonical Correlation Analysis (KCCA) is a nonlinear correlation measure effectively used to detect independence but its use for feature selection or ranking is limited due to the fact that its formulation is not intended to measure the amount of information (entropy) of the dependence. In this paper, we propose a hybrid measure of relevance, Predictive Mutual Information (PMI) based on MI, which also accounts for predictability of signals from each other in its calculation as in KCCA. We show that PMI has more improved feature detection capability than MI, especially in catching suspicious coincidences that are rare but potentially important not only for experimental studies but also for building computational models. We demonstrate the usefulness of PM!, and superiority over MI, on both toy and real datasets. (C) 2011 Elsevier Ltd. All rights reserved.
Bağlantı
http://hdl.handle.net/20.500.12627/147920
https://doi.org/10.1016/j.eswa.2011.09.020
Koleksiyonlar
  • Makale [92796]

Creative Commons Lisansı

İstanbul Üniversitesi Akademik Arşiv Sistemi (ilgili içerikte aksi belirtilmediği sürece) Creative Commons Alıntı-GayriTicari-Türetilemez 4.0 Uluslararası Lisansı ile lisanslanmıştır.

DSpace software copyright © 2002-2016  DuraSpace
İletişim | Geri Bildirim
Theme by 
Atmire NV
 

 


Hakkımızda
Açık Erişim PolitikasıVeri Giriş Rehberleriİletişim
sherpa/romeo
Dergi Adı/ISSN || Yayıncı

Exact phrase only All keywords Any

BaşlıkbaşlayaniçerenISSN

Göz at

Tüm DSpaceBölümler & KoleksiyonlarTarihe GöreYazara GöreBaşlığa GöreKonuya GöreTürlere GöreBu KoleksiyonTarihe GöreYazara GöreBaşlığa GöreKonuya GöreTürlere Göre

Hesabım

GirişKayıt

Creative Commons Lisansı

İstanbul Üniversitesi Akademik Arşiv Sistemi (ilgili içerikte aksi belirtilmediği sürece) Creative Commons Alıntı-GayriTicari-Türetilemez 4.0 Uluslararası Lisansı ile lisanslanmıştır.

DSpace software copyright © 2002-2016  DuraSpace
İletişim | Geri Bildirim
Theme by 
Atmire NV