|
|
本帖最后由 Menuett 于 2013-12-22 15:59 编辑 ( V1 g/ c! c+ B" B3 [
煮酒正熟 发表于 2013-12-20 12:05 7 x( ]2 W3 a4 ?+ _+ }) o
基本可以说是显著的。总的来说,在商界做统计学分析,95%信心水平是用得最多的,当95%上不显著时,都会去 ... & O: h1 `5 \. ?. X) k0 Q1 T2 L
1 d& }4 }' ?" x! U! O
这个其实是一种binomial response,应该用Contigency Table或者Logisitic Regression(In case there are cofactors)来做。只记比率丢弃了Number of trial的信息(6841和1217个客户)。
* G; u& h1 z3 [. a) y) H! C% l3 C% ?' R8 B* v0 N& O4 V% A1 v
结果p=0.5731。 远远不显著。要在alpha level 0.05的水平上检验出76.42%和75.62%的区别,即使实验组和对照组各自样本大小相同,各自尚需44735个样本(At power level 80%)。see: Statistical Methods for Rates and Proportions by Joseph L. Fleiss (1981)' W+ A- k9 I5 N: w- a
) F. O' G- Q/ G. g& o6 ^2 F
R example:
' j8 B) N2 S4 `
; x3 H4 u W' ]# d) _' F> M<-as.table(rbind(c(1668,5173),c(287,930)))
1 w- P/ d) h8 W- d2 a> chisq.test(M)( u @ c8 g2 E$ `3 W; W8 x' O
8 v" q( _" O( X! t$ g Q
Pearson's Chi-squared test with Yates' continuity correction
+ T4 c" u4 E. v/ C/ j k a* s& O
. l* L3 \7 W+ gdata: M2 k" w- m! g) `, g8 s+ D
X-squared = 0.3175, df = 1, p-value = 0.5731! a" A6 E$ p& h1 \3 h
T7 x/ k7 p% n
Python example:
7 W7 K2 v- ^0 M
5 l4 E4 T; |* D' Z. X; ]2 c>>> from scipy import stats
0 l! k% M7 ]8 M x9 H# |! k8 Z>>> stats.chi2_contingency([[6841-5173,5173],[1217-930,930]])8 B: |/ m4 `! h }/ T
(0.31748297614660292, 0.57312422493552839, 1, array([[ 1659.73628692, 5181.26371308],
; {) ~9 |7 x5 Z2 [% c. l [ 295.26371308, 921.73628692]])) |
|