|
|
本帖最后由 Menuett 于 2013-12-22 15:59 编辑 % D* C; n" F; J6 @& c) ^
煮酒正熟 发表于 2013-12-20 12:05 ![]()
6 {. U6 H* r) D" R基本可以说是显著的。总的来说,在商界做统计学分析,95%信心水平是用得最多的,当95%上不显著时,都会去 ... 5 z! p1 T) t5 ^9 }/ Q
5 d- Y4 u: }1 u+ t: s$ G
这个其实是一种binomial response,应该用Contigency Table或者Logisitic Regression(In case there are cofactors)来做。只记比率丢弃了Number of trial的信息(6841和1217个客户)。 7 `" b) P* U- {2 j- D; v
3 j4 R5 L0 ?3 W9 c. b2 J6 U结果p=0.5731。 远远不显著。要在alpha level 0.05的水平上检验出76.42%和75.62%的区别,即使实验组和对照组各自样本大小相同,各自尚需44735个样本(At power level 80%)。see: Statistical Methods for Rates and Proportions by Joseph L. Fleiss (1981)3 y. j, o& }! d0 Q8 X3 b% z
: Y: r- i7 i8 l6 d
R example:: {. r v* Z4 M' G$ Y0 {! {
3 s3 Q2 x7 a( t8 I, W
> M<-as.table(rbind(c(1668,5173),c(287,930)))
5 {2 C1 U6 e4 O. F> chisq.test(M)6 P3 [# G8 X5 e E5 H
! }% K6 D' v1 g2 T6 ~ r, z }& m Pearson's Chi-squared test with Yates' continuity correction$ }8 b' ?% T& Z: H4 ]
1 ?% y, `/ Z: B( x. ]data: M# }8 d' _( D! c! F
X-squared = 0.3175, df = 1, p-value = 0.5731
$ Q! }, V2 {' G) k- k" s
2 g) h$ r/ `6 Y, d- [( Q. vPython example:; }! |7 h* U, ]% Q0 _2 q9 L
; s, i- T2 |% `1 k6 p
>>> from scipy import stats* R$ n* ]2 J4 h" F
>>> stats.chi2_contingency([[6841-5173,5173],[1217-930,930]])$ N" k) C9 V2 x2 y. q
(0.31748297614660292, 0.57312422493552839, 1, array([[ 1659.73628692, 5181.26371308],2 B W2 Z: w1 H% Q( t6 N
[ 295.26371308, 921.73628692]])) |
|