1. Summarize the data using summary statistics.
2. Is there a relationship between person's S and C? Put together a 2x3 contingency table reflecting the joint distribution of two factors. Set an appropriate null hypothesis and test it at 5% significance level. What is the P-value of the test?
3. Draw your conclusions after doing normal probability plots on
the weights and on the heights.
4. Fit a straight line to the scatterplot of weights vs heights. What
is your conclusion about the relationship between them? Give an appropriate
measure of dependence between the weight and height of a person.
5. Estimate from the data the population means for the weight and the height. What are the standard errors of these estimates?
6. Present the results of your analysis in a nice readable form.
data <- importData("your_file.txt",colNameRow=1) # Here your data is saved in SPLUS as data, the first row of your ASCII file data is used to name the columns # <- (and also _) assigns a name to an object
data.females <- importData("your_file.txt", colNameRow=1, filter="sex = 1") # Here only the data for females is imported
data.females
data[,n] # Here the n:th column is selected data[n,] # Here the n:th row is selected heigth <- data[,4] # Here a vector called height is created (the 4th column is saved as a vector called height)
table(x,y) # Here x and y are the variables you want to tabulate
crosstabs(~x+y)
plot(x,y, xlab="The label of x-axis", ylab="The label of y-axis") title("A scatter plot of x and y") #It is possible to add lines and points to the plot (z and w are the cordinates or the lines or the points) lines(z,w) points(z,w)
hist(x)
qqnorm(x)
mean(x) var(x) # The sample variance stdev(x) # The sample standard deviation median(x) min(x) max(x)
cor(x,y)
chisq.test(x,y) # Here x and y are the variables whose relationship you are interested in
lm(y ~ x) # Here x is the independent variable and y is the dependent variable