Homework 7: t-tests and ANOVA

Objectives:

Explore null and alternate hypothesis concepts.
Explain relationship between outcomes of a test of hypothesis and interpretation of a diagnostic test result.
Evaluate error rates (Type I, Type II) and critical value, p-value.

Homework 7 expectations

Read through the entire homework before starting to answer a question. You are expected to have read the chapter and to have completed preceding homework. Answers are provided to odd numbered problems — turn in your work for even numbered problems.

How to work this homework

You may work together, but each of your must turn in your own report. Don’t “plagiarize” from each other. Do include in your report who you worked with.

What to turn in: A pdf file containing your R code, statistical results, and your answer to the questions. Use of RMarkdown recommended; however copy/paste into a word document is also acceptable.

Submit your work to CANVAS. Obey proper file naming formats.

Resources for this homework

Chapter 10. Mike’s Biostatistics Book

Chapter 12. Mike’s Biostatistics Book

Mike’s Workbook for Biostatistics: A quick look at R and R Commander, Part01 – Part10 and previous homework pages presented in this workbook.

Additional R commands and or code provided below.

Answers to selected problems

Questions

1. In an experiment, immortalized lung epithelial cells were exposed to dilute copper solutions for 30 minutes then washed with PBS. The comet assay was applied to these cells and for comparison, to cells without copper exposure but otherwise treated the same way (controls). Length of comet tails indicate DNA damage.

Make a box plot
Test for assumption of normality

Perform the t-test (not the Welch test), i.e., two-tailed hypothesis with assumption of equal variance.

Which cell group had the greater mean value, Copper-exposed or Control-exposed cells?
What are the assumptions necessary for you to use the independent sample t-test?
What does “two-sided” mean?
What was the null hypothesis?
Was this a one-tailed or two-tailed test of the null hypothesis?
What is the value of the test statistic?
How many degrees of freedom?
What is the critical value for this test?
What is the value of the lower limit of the 95% confidence interval?
What is the value of the lower limit of the 99% confidence interval?
True or False. If the null hypothesis is accepted, then zero is a value included in the 95% confidence interval.
Do you accept the null hypothesis? Explain your selection.

2. Microsoft Excel, LibreOffice Calc, and Google sheets spreadsheet software all include t-test functions and return the p-value. Consider two variables big (100, 110, 120, 100, 110, 210, 200) and small (0,1,1,2,0,1,0). (Note — these two groups are obviously very different, calculating a t-test on their difference is silly, just for this question.) If formatting is set to the default two decimal places for Number cell category, the p-value will return as “0.00.” How should you report the p-value in this case?

3. For the t-test, and in general for reporting of all statistical tests, what three numbers reported in the R output should you minimally report?

4. For the following abstract, please identify

Reference population? Type of sampling from population?
Type of study: Observational or experimental study?
Identify the names of the variables?
Identify data types for each variable
What is the main scientific hypothesis that was tested?
Identify the Treatment (Predictor) variables and the Outcome variables
Identify levels (groups) of Treatment or Predictor variables
What was(were) the sampling unit(s)?
How were subjects (sampling units) assigned to Treatment or Predictor variables?

Abstract01

Determining the costs of sexual ornaments is complicated by the fact that ornaments are often integrated with other, non-sexual traits, making it difficult to dissect the effect of ornaments independent of other aspects of the phenotype. Hybridization can produce reduced phenotypic integration, allowing one to evaluate performance across a broad range of multivariate trait values. Here we assess the relationship between morphology and performance in the swordtails Xiphophorus malinche and X. birchmanni, two naturally-hybridizing fish species that differ extensively in non-sexual as well as sexual traits. We took advantage of novel trait variation in hybrids to determine if sexual ornaments incur a cost in terms of locomotor ability. For both fast-start and endurance swimming, hybrids performed at least as well as the two parental species. The sexually-dimorphic sword did not impair swimming performance per se. Rather, the sword negatively affected performance only when paired with a sub-optimal body shape. Studies seeking to quantify the costs of ornaments should consider that covariance with non-sexual traits may create the spurious appearance of costs.

Citation: Johnson JB, Macedo DC, Passow CN, Rosenthal GG (2014) Sexual Ornaments, Body Morphology, and Swimming Performance in Naturally Hybridizing Swordtails (Teleostei: Xiphophorus). PLoS ONE 9(10): e109025. doi:10.1371/journal.pone.0109025

5. Test score on a school achievement test where students have received differing types of test preparation
(group A = individual instruction with tutor; group B = lecture; group C = computer). Example from Kirby 1993, p. 277.

Write out the null and alternate hypotheses.
Test assumption of normality
Conduct one-way ANOVA using GLM
Use and interpret appropriate posthoc test.

6. O’hia collected from three elevations, grown common garden. Height of plant recorded after several weeks of growth.

Write out the null and alternate hypotheses.
Test assumption of normality
Conduct one-way ANOVA using GLM
Use and interpret appropriate posthoc test.

R or Rcmdr commands

myData <- read.table(header=TRUE, sep="\t", text = "
insert your data table here
")

head(myData)

Test normality:

Rcmdr → Statistics → Summaries → Test for normality

(General) linear model:

Rcmdr → Statistics → Fit models → Linear model

Data

Comet data set (Dohm unpublished)

Treatment	CometTail
Control	17.856139
Control	16.52125
Control	14.925449
Control	14.029174
Control	13.332945
Control	8.811185
Control	14.701654
Control	9.261025
Control	21.779311
Control	6.180284
Control	9.201752
Control	5.54472
Control	6.717885
Control	2.625092
Control	7.191583
Control	5.392866
Control	11.284813
Control	15.441254
Control	17.857176
Control	4.250956
Copper	53.214287
Copper	38.92857
Copper	18.928572
Copper	30
Copper	28.928572
Copper	15.357142
Copper	17.857143
Copper	17.5
Copper	21.071428
Copper	29.285715
Copper	28.214285
Copper	16.785715
Copper	21.071428
Copper	37.5
Copper	38.214287
Copper	17.857143
Copper	29.642857
Copper	11.071428
Copper	35
Copper	49.285713

Kirby et al data set

Student	Instruction.method	test.score
1	A	94.4
2	A	75.7
3	A	88.1
4	A	108.1
5	A	94.8
6	A	130.6
7	A	121.1
8	A	82.9
9	A	112
10	A	85.2
11	A	98.7
12	A	50.1
13	A	86.1
14	A	99.8
15	A	121.8
16	B	95.3
17	B	117.2
18	B	97.9
19	B	82.7
20	B	105.3
21	B	85.2
22	B	86.7
23	B	104.8
24	B	67.9
25	B	106.1
26	C	98.1
27	C	101.2
28	C	120.1
29	C	77.5
30	C	124.7
31	C	136.1
32	C	132.6
33	C	130.5
34	C	130
35	C	138.8
36	C	105.8
37	C	70.4

Ohia

Site	Height
M-1	12.5567
M-1	13.2019
M-1	8.0699
M-1	6.0952
M-1	11.3879
M-1	12.2242
M-1	16.0147
M-1	19.7403
M-1	36.4824
M-1	13.1233
M-1	21.7725
M-1	14.2013
M-1	37.7629
M-1	2.8652
M-1	0.6456
M-1	29.623
M-1	10.5812
M-1	18.3046
M-1	19.0528
M-1	2.5693
M-2	45.0162
M-2	40.8404
M-2	27.1032
M-2	29.8036
M-2	63.8316
M-2	42.107
M-2	30.0322
M-2	34.0516
M-2	15.7664
M-2	35.1262
M-2	43.6988
M-2	26.7585
M-2	36.7895
M-2	30.9458
M-2	26.8465
M-2	40.3883
M-2	30.6555
M-2	19.9736
M-2	27.676
M-2	44.084
M-3	15.2646
M-3	19.6745
M-3	23.275
M-3	16.1161
M-3	16.8393
M-3	23.107
M-3	21.5322
M-3	13.4191
M-3	14.7273
M-3	18.4245

/MD