|
|
本帖最后由 Menuett 于 2013-12-22 15:59 编辑
V9 G8 u+ L6 H- h煮酒正熟 发表于 2013-12-20 12:05 ) x. V, z; h# x/ W5 C' E0 @
基本可以说是显著的。总的来说,在商界做统计学分析,95%信心水平是用得最多的,当95%上不显著时,都会去 ... 5 b7 e5 u" |, @4 S) U
* `8 E: C1 A8 n$ R9 |% U6 {$ [
这个其实是一种binomial response,应该用Contigency Table或者Logisitic Regression(In case there are cofactors)来做。只记比率丢弃了Number of trial的信息(6841和1217个客户)。
! l' s$ d2 A; R$ i! L
, i3 V4 i, z+ @- H% k1 ^- p0 ^结果p=0.5731。 远远不显著。要在alpha level 0.05的水平上检验出76.42%和75.62%的区别,即使实验组和对照组各自样本大小相同,各自尚需44735个样本(At power level 80%)。see: Statistical Methods for Rates and Proportions by Joseph L. Fleiss (1981)" Q' M* `1 H" v" y
f0 g) _6 E; o p A' {) b
R example:* [1 g7 v. H- s- n; F% X3 _
; R9 t/ ]3 l4 n; b> M<-as.table(rbind(c(1668,5173),c(287,930)))
/ o6 q J! E5 d6 r% y z" L+ ]> chisq.test(M); h) k& m. E4 Q0 I% M" P0 K* D/ {$ E
; Q! `: v6 Q& b; a% r) g Pearson's Chi-squared test with Yates' continuity correction
. [/ M( t5 ]2 `4 i% P1 H& r7 E& w u& m) U
data: M
) Q7 U/ W4 c. j& HX-squared = 0.3175, df = 1, p-value = 0.5731. v& R+ F# Y0 Q' q& {& y
6 d2 N" Y+ o( e7 v: U7 ^- {; }7 mPython example:
3 Z7 B, @8 a( X$ S: e5 x# H. I6 ]3 `2 ^' x) T# Q0 H, c
>>> from scipy import stats
/ Q4 V% m w$ E6 Y6 H>>> stats.chi2_contingency([[6841-5173,5173],[1217-930,930]])
) e( ]* N* y# o( n7 X! [ k(0.31748297614660292, 0.57312422493552839, 1, array([[ 1659.73628692, 5181.26371308],
6 z0 g, C3 d: Y8 X6 m [ 295.26371308, 921.73628692]])) |
|