(1)The survey data set in R contains contains the responses of 237 Statistics I
ID: 3304662 • Letter: #
Question
(1)The survey data set in R contains contains the responses of 237 Statistics I students at the University of Adelaide to a number of questions. The codebook for select variables in this data can be found in the Appendix.
(a) Define the data type for all variables listed in the codebook.
(b) What variable(s) would it be appropriate to create a scatterplot of?
(c) What variable(s) would it be appropriate to create a bar chart of?
(d) Describe one interesting relationship in this data and what type of graphic you would create to examine this relationship. You can sketch a picture if you think it would help.
(2)
Rosiglitazone is the active ingredient in the controversial type 2 diabetes medicine Avandia and hasbeen linked to an increased risk of serious cardiovascular problems such as stroke, heart failure, and death. A common alternative treatment is pioglitazone, the active ingredient in a diabetes medicine called Actos. Data collected as part of a nationwide retrospective observational study of 227,571 Medicare beneficiaries aged 65 years or older are summarized in the contingency table below.
Use the information in this table to answer the following questions.
(a) How many patients on Pioglitazone had cardiovascular problems? How many patients on Rosiglitazonehad problems? Can we conclude from these numbers that the rate of cardiovascular problems for those on a Pioglitazone treatment is higher.
(b) Do the data suggest that diabetic patients who are taking rosiglitazone are more likely to have cardiovascular problems than those on pioglitazone? Justify your answer.
(c) What proportion of all patients had cardiovascular problems?
Explanation / Answer
a) The data type is Nominal data type for all variables listed in the codebook.
b)Exer how often the student exercise,Smoke how often the student smoke,Height height of the student in centimeter, Age age of the student in year these variables would be appropriate to create a scatterplot.
c)
Exer how often the student exercise,Smoke how often the student smoke,Height height of the student in centimeter, Age age of the student in year these variables would be appropriate to create a barchart.
d) One interesting relationship in this data is height of the student in centimeter and age of the student in year i.e. height is depend on the age.graphic to examine this relationship is correlation plot which plot the linear relationship betwwen height and age.
Q2)
a) 5386 patients on Pioglitazone had cardiovascular problems.
2593 patients on Rosiglitazonehad problems.
Yes we conclude from these numbers that the rate of cardiovascular problems for those on a Pioglitazone treatment is higher. as rate of cardiovascular problems with Rosiglitazone is 2593 / 67593 = 3.84% and rate of Pioglitazone treatment is 5386 / 159978 = 53.98 is more.
b)
No ,data doesnot suggest that suggest that diabetic patients who are taking rosiglitazone are more likely to have cardiovascular problems than those on pioglitazone because rate of cardiovascular problems with Rosiglitazone is 2593 / 67593 = 3.84% and rate of Pioglitazone treatment is 5386 / 159978 = 53.98
c) roportion of all patients had cardiovascular problems = (7979 / 227571 ) = 0.0351
Proportion of all patients had cardiovascular problems is 0.0351
Related Questions
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.