Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Exam (elaborations)

Business Intelligence, Analytics, Data Science, and AI, 5th Edition by Ramesh Sharda, Dursun Delen, Efraim Turban Chapter 5 Predictive Analytics I: Data Mining Process, Methods, and Algorithms

Rating
-
Sold
-
Pages
17
Grade
A+
Uploaded on
28-07-2024
Written in
2023/2024

Business Intelligence, Analytics, Data Science, and AI, 5th Edition by Ramesh Sharda, Dursun Delen, Efraim Turban Chapter 5 Predictive Analytics I: Data Mining Process, Methods, and Algorithms 1) Data mining is a way to develop intelligence (i.e., actionable information or knowledge) from textual data that an organization collects, organizes, and stores. Answer: FALSE Diff: 2 Page Ref: 279 2) The cost of data storage has plummeted recently, initiating the amount of data stored in electronic form grown at an explosive rate, making data mining feasible for more firms. Answer: TRUE Diff: 2 Page Ref: 284 3) Data mining can be very useful in detecting patterns such as credit card fraud but is of little help in improving sales. Answer: FALSE Diff: 1 Page Ref: 284 4) If using a mining analogy, "knowledge mining" would be a more appropriate term than "data mining." Answer: TRUE Diff: 2 Page Ref: 286 5) Consolidation of databases and other data repositories into a single location in the form of a data warehouse had no impact on increased popularity of data mining. Answer: FALSE Diff: 1 Page Ref: 284 6) Data mining requires specialized data analysts to ask ad hoc questions and obtain descriptive answers quickly from the varied data system. Answer: FALSE Diff: 2 Page Ref: 287 7) Confusion matrix can only be used for binary classification problems where there are two outcomes, such as true and false for the target variable. Answer: FALSE Diff: 1 Page Ref: 303 8) The value provided by area under the curve (AUC) ranges between -1 and 1 and is the true representation of the prediction accuracy. Answer: FALSE Diff: 2 Page Ref: 306 9) Bayesian classifiers use probability theory to build classification models based on past occurrences that are capable of placing a new instance into a most probable class. Answer: TRUE Diff: 2 Page Ref: 306 10) In data mining, classification models help in prediction of cases that belong to two or more categories. Answer: TRUE Diff: 2 Page Ref: 288 11) Statistics and data mining both look for data sets that are as large and as varied as possible. Answer: FALSE Diff: 2 Page Ref: 291 12) In CRISP-DM, testing and evaluation step is not focused on evaluating the accuracy of different models, rather it is about assessing whether the business problem is appropriately addressed. Answer: TRUE Diff: 1 Page Ref: 299 13) In KDD standardized process, data mining encompasses the whole process, not just one step. Answer: FALSE Diff: 2 Page Ref: 301 14) During classification in data mining, a false positive is an occurrence classified as positive by the algorithm while being negative in reality. Answer: TRUE Diff: 1 Page Ref: 302 15) k-fold cross-validation methodology can also be called rotation estimation. Answer: TRUE Diff: 2 Page Ref: 305 16) Bootstrapping is like the k-fold cross-validation where the k takes the value of 1. Answer: FALSE Diff: 3 Page Ref: 305 17) The area under the ROC curve is a graphical assessment technique where the true positive rate is plotted on the y-axis and the false negative rate is plotted on the x-axis. Answer: FALSE Diff: 3 Page Ref: 305 18) For classification type predictive modeling, confusion matrix is the primary source for accuracy estimation. Answer: TRUE Diff: 2 Page Ref: 303

Show more Read less
Institution
Course

Content preview

Business Intelligence, Analytics, Data Science,
and AI, 5th Edition by Ramesh Sharda, Dursun
Delen, Efraim Turban
Chapter 5 Predictive Analytics I: Data Mining Process, Methods, and Algorithms

1) Data mining is a way to develop intelligence (i.e., actionable information or knowledge) from
textual data that an organization collects, organizes, and stores.
Answer: FALSE
Diff: 2 Page Ref: 279

2) The cost of data storage has plummeted recently, initiating the amount of data stored in
electronic form grown at an explosive rate, making data mining feasible for more firms.
Answer: TRUE
Diff: 2 Page Ref: 284

3) Data mining can be very useful in detecting patterns such as credit card fraud but is of little
help in improving sales.
Answer: FALSE
Diff: 1 Page Ref: 284

4) If using a mining analogy, "knowledge mining" would be a more appropriate term than "data
mining."
Answer: TRUE
Diff: 2 Page Ref: 286

5) Consolidation of databases and other data repositories into a single location in the form of a
data warehouse had no impact on increased popularity of data mining.
Answer: FALSE
Diff: 1 Page Ref: 284

6) Data mining requires specialized data analysts to ask ad hoc questions and obtain descriptive
answers quickly from the varied data system.
Answer: FALSE
Diff: 2 Page Ref: 287

7) Confusion matrix can only be used for binary classification problems where there are two
outcomes, such as true and false for the target variable.
Answer: FALSE
Diff: 1 Page Ref: 303

8) The value provided by area under the curve (AUC) ranges between -1 and 1 and is the true
representation of the prediction accuracy.
Answer: FALSE
1
Copyright © 2024 Pearson Education Ltd.

,Diff: 2 Page Ref: 306

9) Bayesian classifiers use probability theory to build classification models based on past
occurrences that are capable of placing a new instance into a most probable class.
Answer: TRUE
Diff: 2 Page Ref: 306
10) In data mining, classification models help in prediction of cases that belong to two or more
categories.
Answer: TRUE
Diff: 2 Page Ref: 288

11) Statistics and data mining both look for data sets that are as large and as varied as possible.
Answer: FALSE
Diff: 2 Page Ref: 291

12) In CRISP-DM, testing and evaluation step is not focused on evaluating the accuracy of
different models, rather it is about assessing whether the business problem is appropriately
addressed.
Answer: TRUE
Diff: 1 Page Ref: 299

13) In KDD standardized process, data mining encompasses the whole process, not just one step.
Answer: FALSE
Diff: 2 Page Ref: 301

14) During classification in data mining, a false positive is an occurrence classified as positive by
the algorithm while being negative in reality.
Answer: TRUE
Diff: 1 Page Ref: 302

15) k-fold cross-validation methodology can also be called rotation estimation.
Answer: TRUE
Diff: 2 Page Ref: 305

16) Bootstrapping is like the k-fold cross-validation where the k takes the value of 1.
Answer: FALSE
Diff: 3 Page Ref: 305

17) The area under the ROC curve is a graphical assessment technique where the true positive
rate is plotted on the y-axis and the false negative rate is plotted on the x-axis.
Answer: FALSE
Diff: 3 Page Ref: 305

18) For classification type predictive modeling, confusion matrix is the primary source for
accuracy estimation.
Answer: TRUE
Diff: 2 Page Ref: 303

2
Copyright © 2024 Pearson Education Ltd.

, 19) In predictive model assessment, the model's ability to make reasonably accurate predictions,
given noisy data or data with missing and erroneous values is called scalability.
Answer: FALSE
Diff: 2 Page Ref: 303

20) Data that is collected, stored, and analyzed in data mining is often private and personal. This
type of data needs to go through a process of de-identification prior to applying data mining.
Answer: TRUE
Diff: 2 Page Ref: 322

21) Data mining provides instant, crystal-ball-like insights is nothing but a myth.
Answer: TRUE
Diff: 2 Page Ref: 324

22) K-means is the most popular algorithm used for association rule mining.
Answer: FALSE
Diff: 2 Page Ref: 312-313

23) In data mining, clustering is an exploratory unsupervised data analysis tool for solving
classification problems.
Answer: TRUE
Diff: 2 Page Ref: 310

24) The main idea in market basket analysis is to predict future customer demand patterns so that
an appropriate volume of merchandise would be produced or purchased.
Answer: FALSE
Diff: 2 Page Ref: 313

25) K in K-means clustering refers to the dimensionality of the input data (i.e., the number of
input variables) used in the analysis.
Answer: FALSE
Diff: 2 Page Ref: 312

26) Which of the following is not among the key reasons data mining is gaining attention in the
business world?
A) More intense competition at the global scale
B) Significant increase in mergers and acquisition in the marketplace
C) Recognition of the untapped value hidden in large data sources
D) The exponential increase in data processing and storage technologies
Answer: B
Diff: 2 Page Ref: 284

27) Amazon and others have used analytics to do all of the following, except?
A) Collect data about customers and competitors
B) Better understand their customers
C) Maximize their returns on investment

3
Copyright © 2024 Pearson Education Ltd.

Written for

Course

Document information

Uploaded on
July 28, 2024
Number of pages
17
Written in
2023/2024
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

$9.99
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF


Also available in package deal

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
AcademicSuperScores Chamberlain College Of Nursing
Follow You need to be logged in order to follow users or courses
Sold
296
Member since
3 year
Number of followers
37
Documents
7648
Last sold
2 hours ago
AcademicSuperScores

NURSING, ECONOMICS, MATHEMATICS, BIOLOGY AND HISTORY MATERIALS. BEST TUTORING, HOMEWORK HELP, EXAMS, TESTS AND STUDY GUIDE MATERIALS WITH GUARANTEE OF A+ I am a dedicated medical practitioner with diverse knowledge in matters Nursing and Mathematics. I also have an additional knowledge in Mathematics based courses (finance and economics)

4.5

155 reviews

5
124
4
9
3
11
2
5
1
6

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions