References

Adichie, J.N., 1975. On the use of ranks for testing the coincidence of several regression lines. Annals of Statistics, 3: 521-527.

Agresti, A., 2002. Categorical data analysis, 2nd ed. Hoboken: Wiley.

Agresti, A., 2007. An introduction to categorical data analysis, 2nd ed. Hoboken: Wiley.

Armitage, P., 1955. Tests for linear trends in proportions and frequencies. Biometrics, 11: 375-386.

Baichwal, V.R., & Sugden, B., 1989. The multiple membrane-spanning segments of the BNLF-1 oncogene from Epstein-Barr virus are required for transformation. Oncogene, 4: 67-74.

Barnard, G.A., 1947. Significance tests for 2 × 2 tables. Biometrika, 34: 123-138.

Benjamini, Y., & Hochberg, Y., 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Series B, 57: 289-300.

Bilger, A., Bennett, L.M., Carabeo, R.A., Chiaverotti, T.A., Dvorak, C., Liss, K.M., Schadewald, S.A., Pitot, H.C., & Drinkwater, N.R., 2004. a potent modifier of liver cancer risk on distal mouse chromosome 1: linkage analysis and characterization of congenic lines. Genetics, 167: 859-866.

Casagrande, J.T., Pike, M.C., & Smith, P.G., 1978. An improved approximate formula for calculating sample sizes for comparing two binomial distributions. Biometrics, 34: 483-486.

Churchill, G.A., & Doerge, R.W., 1994. Empirical threshold values for quantitative trait mapping. Genetics, 138: 963-971.

Cochran, W.G., 1954. Some methods for strengthening the common chi-square tests. Biometrics, 10: 417-451.

Conover, W.J., 1999. Practical nonparametric statistics, 3rd ed. New York: Wiley.

Conover, W.J., Johnson, M.E., & Johnson, M.M., 1981. A comparative study of tests for homogeneity of variances, with applications to the outer continental shelf bidding data. Technometrics, 23: 351-361.

Cox, D.R., & Oakes, D., 1984. Analysis of survival data. Boca Raton: Chapman & Hall/CRC.

Drinkwater, N. R., & Klotz, J. H., 1981. Statistical methods for the analysis of tumor multiplicity data. Cancer Research, 41: 113-119.

Dupont, W.D., 1988. Power calculations for matched case-control studies. Biometrics, 44: 1157-1168.

Efron, B., & Tibshirani, R.J., 1993. An introduction to the bootstrap. Boca Raton: Chapman & Hall/CRC.

Efron, B., 1979. Bootstrap methods: another look at the jackknife. Annals of Statistics, 7: 1-26.

Efron, B., 1981. Censored data and the bootstrap. Journal of the American Statistical Association, 76: 312-319.

Fang, Z., Du, R., and Cui, X., 2012. Uniform approximation is more appropriate for Wilcoxon rank-sum test in gene set analysis. PLoS One, 7:e31505.

Fisher, R.A., 1973. Statistical methods for research workers, 14th ed. New York: Hafner.

Hodges, J.L., Ramsey, P.H., and Wechsler, S., 1990. Improved significance probabilities of the Wilcoxon test. J. Educ. Statistics, 15:249-265.

Hollander, M., Wolfe, D.A., & Chicken, E. 2013. Nonparametric statistical methods, 3rd ed. New York: Wiley.

Jonckheere, A.R., 1954. A distribution-free k-sample test against ordered alternatives. Biometrika, 4: 133-145.

Kaplan, E.L., & Meier, P., 1958. Nonparametric estimation from incomplete observations. Journal of the American Statistical Association, 53: 457-481.

Kempthorne, O., 1979. In dispraise of the exact test: reactions. Journal of Statistical Planning and Inference, 3: 199-213.

Kendall, M.G., 1938. A new measure of rank correlation. Biometrika, 30: 81-93.

Kruskal, W.H., & Wallis, W.A., 1952. Use of ranks in one-criterion variance analysis. J. American Stat. Assoc., 47: 583-621.

Lee, E.T., & Wang, J.W., 2003. Statistical methods for survival data analysis, 3rd ed. Hoboken: Wiley.

Lee, G.H., & Drinkwater, N.R., 1995. Hepatocarcinogenesis in BXH recombinant inbred strains of mice: analysis of diverse phenotypic effects of the hepatocarcinogen sensitivity loci. Molecular Carcinogenesis, 14: 190-197.

Lehman, E.L., 1993. The Fisher, Neyman-Pearson theories of testing hypotheses: one theory or two? Journal American Statistical Association, 88: 1242-1249.

Lehman, E.L., 1998. Nonparametrics: statistical methods based on ranks, 1st ed. revised. Upper Saddle River: Prentice Hall.

Lystig, T.C., 2003. Adjusted P values for genome-wide scans. Genetics, 164: 1683-1687.

Manly, B.F.J., 2001. Randomization, bootstrap, and Monte Carlo methods in biology, 2 ed. Boca Raton: Chapman & Hall/CRC.

Mann, H.B., & Whitney, D.R., 1947. On a test of whether one of two random variables is stochastically larger than the other. Annals of Mathematical Statistics, 18: 50-60.

Mantel, N., & Haenszel, W., 1959. Statistical aspects of the analysis of data from retrospective studies of disease. Journal of the National Cancer Institute, 22: 719-748.

Martín Andrés, A., & Silva Mato, A., 1994. Choosing the optimal unconditioned test for comparing two independent proportions. Computational Statistics & Data Analysis, 17: 555-574.

Martín Andrés, A., Silva Mato, A., Tapia García, J.M., Sánchez Quevedo, M.J., 2004. Comparing the asymptotic power of exact tests in 2 × 2 tables. Computational Statistics & Data Analysis, 47: 745-756.

McNemar, Q., 1947. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika, 12: 153-157.

Miller, R., 1968. Jackknifing variances. Ann. Math. Stat, 39: 567-582.

Radelet, M.L., & Pierce, G.L., 1991. Choosing those who will die: race and the death penalty in Florida. Florida Law Review, 43: 1-34.

Perneger, T.V., 1998. What's wrong with Bonferroni adjustments. British Medical Journal, 316: 1236-1238.

Peto, R., & Peto, J., 1972. Asymptotically efficient rank invariant test procedures. Journal of the Royal Statistical Society, Series A, 135: 185-207.

Sen, P.K., 1969. On a class of rank order tests for the parallelism of several regression lines. Annals of Mathematical Statistics, 40: 1668-1683.

Smyth, G.K., 2004. Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Statistical Applications in Genetics and Molecular Biology 3, No. 1, Article 3.

Sokal, R.R., & Rohlf, F.J., 2012. Biometry: the principles and practice of statistics in biological research, 4th ed. New York: W.H. Freeman.

Spearman, C., 1904. The proof and measurement of association between two things. American Journal of Psychology, 15: 72-101.

Storer, B.E., & Kim, C., 1990. Exact properties of some exact test statistics for comparing two binomial proportions. Journal of the American Statistical Association, 85: 146-155.

Storey, J.D., 2002. A direct approach to false discovery rates. Journal of the Royal Statistical Society, Series B, 64: 479-498.

Storey, J.D., & Tibshirani, R., 2003. Statistical significance for genomewide studies. Proceedings of the National Academy of Sciences, 100: 9440-9445.

Sugden, B., & Metzenberg, S., 1983. Characterization of an antigen whose cell surface expression is induced by infection with Epstein-Barr virus. Journal of Virology, 46: 800-807.

Suissa, S., & Shuster, J.J., 1985. Exact unconditional sample sizes for the 2 × 2 binomial trial. Journal of the Royal Statistical Society, Series A, 148: 317-327.

Tessier, C.R., Doyle, G.A., Clark, B.A., Pitot, H.C., & Ross, J., 2004. Mammary tumor induction in transgenic mice expressing an RNA-binding protein. Cancer Research, 64: 209-215.

Ury, H.K., 1976. A comparison of four procedures for multiple comparisons among means (pairwise contrasts) for arbitrary sample sizes. Technometrics, 18: 89-97.

Wilcoxon, F., 1945. Individual comparisons by ranking methods. Biometrics Bulletin, 1: 80-83.