|
|
本帖最后由 Menuett 于 2013-12-22 15:59 编辑
+ W( ^( M W3 r2 a1 X煮酒正熟 发表于 2013-12-20 12:05 ![]()
9 q/ M' K. F' K# Q基本可以说是显著的。总的来说,在商界做统计学分析,95%信心水平是用得最多的,当95%上不显著时,都会去 ... . C+ P. r0 m% M/ X! `
1 a; T/ O8 ]" V/ J+ a这个其实是一种binomial response,应该用Contigency Table或者Logisitic Regression(In case there are cofactors)来做。只记比率丢弃了Number of trial的信息(6841和1217个客户)。 " o. z9 ]+ p" I' p: v, X
0 o8 X# d K, [* i! J4 @& g* d
结果p=0.5731。 远远不显著。要在alpha level 0.05的水平上检验出76.42%和75.62%的区别,即使实验组和对照组各自样本大小相同,各自尚需44735个样本(At power level 80%)。see: Statistical Methods for Rates and Proportions by Joseph L. Fleiss (1981)% K& Y' P- E2 Q
2 e# B& u& ?6 Z0 H% T; t, ~R example:4 Y% N+ e% @2 ]' Y& A
: ~ |1 e: {3 J> M<-as.table(rbind(c(1668,5173),c(287,930)))$ I8 Z+ @& K" F- l r# J9 x
> chisq.test(M)
, H1 E7 G6 ?- s0 ~5 M' @& }: h, N4 ]% ~
Pearson's Chi-squared test with Yates' continuity correction8 z5 O% N, c: C& X; j- T% ~0 J
8 j" _( v! F0 cdata: M/ y7 C6 Z4 t/ D9 H8 D+ B; U) J, F. `. l
X-squared = 0.3175, df = 1, p-value = 0.5731* P! m* f5 V" ~; Q
- u. M- Q5 c6 P& S% R: ]3 zPython example:
& j9 Y3 R3 _( `
+ P; `. S7 I |; B- B/ b>>> from scipy import stats
2 e. X# J! n; P7 n! X- m3 K" \( C>>> stats.chi2_contingency([[6841-5173,5173],[1217-930,930]])6 S3 Z9 L7 X! U5 [1 W; A& N. v4 Z
(0.31748297614660292, 0.57312422493552839, 1, array([[ 1659.73628692, 5181.26371308],8 q4 C2 F# @" D: S9 I8 k8 X& J
[ 295.26371308, 921.73628692]])) |
|