D204- Data Analytics Tools and Techniques# 34
Complete Questions & Answers.
What do open-source software tools and widely available analysis tools such as
spreadsheets help accomplish> - -Data democritization
-What is a feature of SQL? - -The basic language is the same across database services
It is used with structured data and unstructured data
-What is an example of unstructured data? - -Credit card numbers that include a credit
score
-data that doesn't follow a specific format
-data that doesn't currently fit in a structured relational database table
-EX) satellite images, photographs, video, radar images, text within word documents,
social media content like tweets and FB posts, along content.. etc.
-unstructured data files can be stored within a structured database, however, their
content still remains unstructured
-unstructured data a major source of the "variety" in big data
-Which tool should a researcher use to conduct a univariate analysis on complex
statistical data? - -R
-Which statistical technique should be used to draw conclusions about an entire
population based on representative sample? - -Hypothesis testing?
-What is an example of random sampling of college students? - -Surveying students
chosen arbitrarily from around the entire college campus
-Which type of analysis would be used to predict a binary outcome based on a set of
independent variables? - -Regression
-Which type of data analysis is appropriate if the goal is to minimize the cost of a diet,
using a data set consisting of the following variables: protein content, fat content, and
cost per unit? - -Optimization
-Which technique can be used to determine the likelihood that a positive diagnostic test
result indicates whether the disease is actually present? - -Bayes theorem
-Which concept should be considered when choosing variables for inclusion in a linear
regression model? - -Feasibility of controlling the variables
-A neural network algorithm in machine learning endeavors to recognize underlying
relationships in a set of data. What does this process mimic? - -The way the human
brain operates
Complete Questions & Answers.
What do open-source software tools and widely available analysis tools such as
spreadsheets help accomplish> - -Data democritization
-What is a feature of SQL? - -The basic language is the same across database services
It is used with structured data and unstructured data
-What is an example of unstructured data? - -Credit card numbers that include a credit
score
-data that doesn't follow a specific format
-data that doesn't currently fit in a structured relational database table
-EX) satellite images, photographs, video, radar images, text within word documents,
social media content like tweets and FB posts, along content.. etc.
-unstructured data files can be stored within a structured database, however, their
content still remains unstructured
-unstructured data a major source of the "variety" in big data
-Which tool should a researcher use to conduct a univariate analysis on complex
statistical data? - -R
-Which statistical technique should be used to draw conclusions about an entire
population based on representative sample? - -Hypothesis testing?
-What is an example of random sampling of college students? - -Surveying students
chosen arbitrarily from around the entire college campus
-Which type of analysis would be used to predict a binary outcome based on a set of
independent variables? - -Regression
-Which type of data analysis is appropriate if the goal is to minimize the cost of a diet,
using a data set consisting of the following variables: protein content, fat content, and
cost per unit? - -Optimization
-Which technique can be used to determine the likelihood that a positive diagnostic test
result indicates whether the disease is actually present? - -Bayes theorem
-Which concept should be considered when choosing variables for inclusion in a linear
regression model? - -Feasibility of controlling the variables
-A neural network algorithm in machine learning endeavors to recognize underlying
relationships in a set of data. What does this process mimic? - -The way the human
brain operates