There were some additional questions about using Chi-square tests for decision trees. I have found this excellent tutorial on the web: http://www.ics.uci.edu/~welling/teaching/273ASpring10/recitation4_decision_tree.pdf.
Some important point to keep in mind:
In class we only looked at one value out of two for the attribute, but in case there are more values it is easiest to simply compute the chi-square statistic for all values of the label, Y, and for all values of the attribute F. So, the statistic is now a double sum, one over Y-values and another one over possible F-values. It is important that you correct for this by selecting the correct number of degrees of freedom for the chi-square test, which is now given by: dof=(|F|-1)x(|Y|-1). For Y=2 and F=2, as before, we have dof=1. However, if one feature has many more F-values, it will need more dofs in the chi-square test and hence it will be automatically more penalized for having many choices. So, after you compute chi^2 using this double sum, you first check which feature has the smallest p-value. Then for the feature with the smallest p-value you ask if the null hypothesis is rejected (are the observed counts significantly different than what can be expected from random fluctuations around the expected values?). One usually rejects the null hypothesis for p<0.05. When you reject, you do not add any feature to the tree.
Tuesday, April 20, 2010
Subscribe to:
Post Comments (Atom)
well explained .Keep updating Artificial intelligence Online Trining
ReplyDeleteThanks for the informative article. This is one of the best resources I have found in quite some time. Nicely written and great info. I really cannot thank you enough for sharing.
ReplyDeleteDigital Marketing Training in Chennai
Digital Marketing Course in Chennai
FON PERDE MODELLERİ
ReplyDeleteSms Onay
Vodafone Mobil Ödeme Bozdurma
nft nasıl alınır
ankara evden eve nakliyat
trafik sigortası
DEDEKTOR
Kurma Website
aşk kitapları
beykoz vestel klima servisi
ReplyDeletekadıköy daikin klima servisi
kartal toshiba klima servisi
tuzla lg klima servisi
ataşehir arçelik klima servisi
maltepe samsung klima servisi
pendik vestel klima servisi
üsküdar toshiba klima servisi
beykoz beko klima servisi
en son çıkan perde modelleri
ReplyDeleteminecraft premium
yurtdışı kargo
özel ambulans
uc satın al
en son çıkan perde modelleri
lisans satın al
nft nasıl alınır