A study was conducted to determine if the consumption of more than 50 grams of sugar
per day by males over 30 year old leads to an increased risk of Type-2 diabetes. Over a
span of 30 years, the study attracted 11,790 male volunteers in the cities of Atlanta,
Brunswick, Savannah, and Valdosta who were older than 30 when they began the
study. Thirty years later it was determined that 2,790 had Type-2 diabetes. From these
results, it was determined that males consuming 50 or more grams of sugar per day
were 27% more likely to have Type-2 diabetes than those males avoiding such sugar
consumption habits.
Which of the following best describes who the target population is for this study? -
correct answer all males over 30 year olds
A study was conducted to determine if the consumption of more than 50 grams of sugar
per day by males over 30 year old leads to an increased risk of Type-2 diabetes. Over a
span of 30 years, the study attracted 11,790 male volunteers in the cities of Atlanta,
Brunswick, Savannah, and Valdosta who were older than 30 when they began the
study. Thirty years later it was determined that 2,790 had Type-2 diabetes. From these
results, it was determined that males consuming 50 or more grams of sugar per day
were 27% more likely to have Type-2 diabetes than those males avoiding such sugar
consumption habits.
Which of the following best describes the sample is for this study? - correct answer the
11,790 male volunteers who participated in the study
A Fulton county pollster obtains a list of all the residential addresses in Sandy Springs
and uses a computer random number generator to choose 725 of them. The pollster
visits each of the 725 households and interviews all the adults in each household about
their television viewing habits. What is the sampling method? - correct answer Clustered
sample
General Motors is estimating the costs of defects across all of their models of vehicles.
During this estimation process, a sample is chosen by separating all cars by models,
and selecting 100 of each type of model vehicle to test for defects. What is the sampling
method? - correct answer Stratified sampling
A relationship between a treatment and the outcome must indicate a causality. In
another word, a person tried a medicine and felt better, does it indicate the medicine
healed him/her? - correct answer No
Which of the following describes core concepts of Data Science?
- Data Science is about drawing useful conclusions from large and diverse data sets
through exploration, prediction, and inference.
, - Exploration involves identifying patterns in information.
- Prediction involves using information we know to make informed guesses about values
we wish we knew.
- Inference involves quantifying our degree of certainty: will the patterns that we found in
our data also appear in new observations? How accurate are our predictions? - correct
answer All of the above
Which of the following properly describes statistics, computing, and data science?
- Statistics is a central component of data science because statistics studies how to
make robust conclusions based on incomplete information.
- Computing is also a central component of data science because programming allows
us to apply analysis techniques to the large and diverse data sets that arise in real-
world applications: not just numbers, but text, images, videos, and sensor reading.
- Data science includes statistics and computing, but it is more than the sum of them
because of the applications. - correct answer All of the above
Which of the following describes Python and Anaconda?
- Python is an interpreted high-level general-purpose programming language.
- Python is a popular programming language for both data science and application
development.
- Anaconda is a software solution that packages together the Python 3 language
interpreter, ipython libraries, and the Jupyter notebook environment. - correct answer All
of the above
Which of the following does NOT describe why ethics are important in data science? -
correct answer Data models and algorithms cannot be used to do good in the world
Which of the following correctly describes data science and statistics?
- The discipline of statistics has long addressed the same fundamental challenge as
data science: how to draw robust conclusions about the world using incomplete
information.
- One of the most important contributions of statistics is a consistent and precise
vocabulary for describing the relationship between observations and conclusions.