Data Mining Test 1
Study online at https://quizlet.com/_je120p
1. What is Data Min- The process of discovering interesting patterns and knowledge from large
ing? amounts of data. The data sources can include databases, data warehouses, the
Web, other information repositories, or data that are streamed into the system
dynamically.
2. What are some Generalization, Association Rule Discovery and Correlation Analysis, Classification,
Data Mining Cluster Analysis, and Outlier Analysis.
Functionalities?
3. Nominal data Means "relating to names". Values include: symbols, a category, code, state, etc.
type
4. Binary data type A nominal attribute with only two categories or states being 0 or 1. 0 typically
represents "absent", while 1 represents "present".
5. Ordinal data type An attribute with possible values that have a meaningful order or ranking among
them, but the magnitude between successive values is not known. One example
could be varying Drink sizes at a fast food restaurant such as "Small", "Medium"
and "Large".
6. Interval-scaled Is quantitative; that is, it is a measurable quantity, represented in integer or real
data type values. Numeric attributes can be interval-scaled or ratio-scaled. A temperature
attribute is interval-scaled.
7. Ratio-scaled data A numeric attribute with an inherent zero-point. That is, if a measurement is
type ratio-scaled, we can speak of a value as being a multiple (or ratio) of another
value. Examples of ratio-scaled attributes include count attributes such as years
of experience and number of words.
8. Vector data type
9. Three types of Distributive, Algebraic, Holistic
measures
1/7
, Data Mining Test 1
Study online at https://quizlet.com/_je120p
10. How is mid-range Min + Max divided by 2
calculated?
11. What is the Data cleaning, data integration, data selection, data transformation, data mining,
process of data pattern evaluation, knowledge presentation.
discovery?
12. Provide an exam-
ple of a predictive
mining task
13. Provide an exam-
ple of a descrip-
tive mining task
14. What is the Data cleaning, data integration, data selection, data transformation, data mining,
knowledge dis- pattern evaluation, knowledge presentation.
covery process?
15. What is a data A repository of information collected from multiple sources, stored under a unified
warehouse? schema, and usually residing at a single site.
16. Data selection Where data relevant to the analysis task are retrieved from the database.
17. Data transforma- Data are transformed and consolidated into forms appropriate for mining by
tion performing summary or aggregation operations.
18. Data Mining (as a An essential process where intelligent methods are applied to extract data patterns.
process)
19. Pattern Evalua- To identify the truly interesting patterns representing knowledge based on inter-
tion estingness measures.
20.
2/7
Study online at https://quizlet.com/_je120p
1. What is Data Min- The process of discovering interesting patterns and knowledge from large
ing? amounts of data. The data sources can include databases, data warehouses, the
Web, other information repositories, or data that are streamed into the system
dynamically.
2. What are some Generalization, Association Rule Discovery and Correlation Analysis, Classification,
Data Mining Cluster Analysis, and Outlier Analysis.
Functionalities?
3. Nominal data Means "relating to names". Values include: symbols, a category, code, state, etc.
type
4. Binary data type A nominal attribute with only two categories or states being 0 or 1. 0 typically
represents "absent", while 1 represents "present".
5. Ordinal data type An attribute with possible values that have a meaningful order or ranking among
them, but the magnitude between successive values is not known. One example
could be varying Drink sizes at a fast food restaurant such as "Small", "Medium"
and "Large".
6. Interval-scaled Is quantitative; that is, it is a measurable quantity, represented in integer or real
data type values. Numeric attributes can be interval-scaled or ratio-scaled. A temperature
attribute is interval-scaled.
7. Ratio-scaled data A numeric attribute with an inherent zero-point. That is, if a measurement is
type ratio-scaled, we can speak of a value as being a multiple (or ratio) of another
value. Examples of ratio-scaled attributes include count attributes such as years
of experience and number of words.
8. Vector data type
9. Three types of Distributive, Algebraic, Holistic
measures
1/7
, Data Mining Test 1
Study online at https://quizlet.com/_je120p
10. How is mid-range Min + Max divided by 2
calculated?
11. What is the Data cleaning, data integration, data selection, data transformation, data mining,
process of data pattern evaluation, knowledge presentation.
discovery?
12. Provide an exam-
ple of a predictive
mining task
13. Provide an exam-
ple of a descrip-
tive mining task
14. What is the Data cleaning, data integration, data selection, data transformation, data mining,
knowledge dis- pattern evaluation, knowledge presentation.
covery process?
15. What is a data A repository of information collected from multiple sources, stored under a unified
warehouse? schema, and usually residing at a single site.
16. Data selection Where data relevant to the analysis task are retrieved from the database.
17. Data transforma- Data are transformed and consolidated into forms appropriate for mining by
tion performing summary or aggregation operations.
18. Data Mining (as a An essential process where intelligent methods are applied to extract data patterns.
process)
19. Pattern Evalua- To identify the truly interesting patterns representing knowledge based on inter-
tion estingness measures.
20.
2/7