|
|
本帖最后由 Menuett 于 2013-12-22 15:59 编辑 . p6 [; C. l* o$ f
煮酒正熟 发表于 2013-12-20 12:05 $ K4 W( g+ @% |% e( O$ @
基本可以说是显著的。总的来说,在商界做统计学分析,95%信心水平是用得最多的,当95%上不显著时,都会去 ... # ?; O: b' R+ y9 G) g4 M! _
( U3 W9 ? y; h+ [这个其实是一种binomial response,应该用Contigency Table或者Logisitic Regression(In case there are cofactors)来做。只记比率丢弃了Number of trial的信息(6841和1217个客户)。 $ t4 B+ ?2 c% U! ~, a2 H+ f1 C
U4 g& y% [5 q% o9 r
结果p=0.5731。 远远不显著。要在alpha level 0.05的水平上检验出76.42%和75.62%的区别,即使实验组和对照组各自样本大小相同,各自尚需44735个样本(At power level 80%)。see: Statistical Methods for Rates and Proportions by Joseph L. Fleiss (1981)
6 j* f' |6 T- ` p& m# J I/ ?3 k& A x, Z& S/ m
R example:
$ D0 ~' o) c) n1 H
2 q* e$ t$ W( I- j7 `# F( X+ o> M<-as.table(rbind(c(1668,5173),c(287,930)))
2 G# v5 I: k1 _/ B# t) m( x> chisq.test(M)
0 e) W( i6 ~" A+ E% ^; h
/ y7 t. v: e/ l Pearson's Chi-squared test with Yates' continuity correction
& I6 u; M* W* Z- W# w6 F# C
1 m# W3 K9 i* @" bdata: M
9 C* J* S9 L$ r, h' C0 S% `X-squared = 0.3175, df = 1, p-value = 0.5731
& H/ w9 H5 j+ e
0 J" T2 d; J$ _3 T9 @, ?, ?6 qPython example:( Y3 Q, `; {* h" f. a8 f
. m# ~" L( l; d5 k>>> from scipy import stats7 |2 Z2 g7 F, V8 k$ d. l+ C5 y, r
>>> stats.chi2_contingency([[6841-5173,5173],[1217-930,930]])
% k- f- k$ u1 s: ^9 {(0.31748297614660292, 0.57312422493552839, 1, array([[ 1659.73628692, 5181.26371308],( u' }4 V ]& J- Q1 w& w5 N2 h
[ 295.26371308, 921.73628692]])) |
|