Geschreven door studenten die geslaagd zijn Direct beschikbaar na je betaling Online lezen of als PDF Verkeerd document? Gratis ruilen 4,6 TrustPilot
logo-home
Tentamen (uitwerkingen)

BADM 453 - EXAM 1 QUESTIONS ANSWERED CORRECTLY LATEST UPDATE 2026

Beoordeling
-
Verkocht
-
Pagina's
8
Cijfer
A+
Geüpload op
08-05-2026
Geschreven in
2025/2026

BADM 453 - EXAM 1 QUESTIONS ANSWERED CORRECTLY LATEST UPDATE 2026 Why is Data Mining Useful? - Answers -there has been an explosive growth of data (from terabytes to petabytes) -data collection and data availability make it useful -We are drowning in Data, but starving for knowledge! -We are Data Rich, but information poor What are major sources of data? - Answers -Business: Web, e-commerce, transactions, stocks -Science: remote sensing, bioinformatics, scientific simulation -society and everyone: news, digital camera, Youtube -Internet of Things Data Mining (4 qualities) -n/t -i -pr/un -po/us - Answers Knowledge Discovery from Data -Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or knowledge from huge amounts of data What are alternative names for Data Mining? - Answers -knowledge discovery in databases (KDD) -knowledge extraction -data/pattern analysis -information harvesting -business intelligence Knowledge Discovery from Data Process (KDD) (7 steps) - Answers -View from Typical Database Systems and Data warehousing communities Databases - Data Cleaning - Data Warehouse - Selection of Task Relevant Data - Data Mining - Pattern Evaluation - Knowledge -Data mining plays an essential role in the knowledge discovery process KDD Process: A Typical View from ML and Statistics 5 Steps - Answers Input Data - Data Pre-Processing - Data Mining - Post Processing - Pattern Information Knowledge Data Pre-Processing - Answers Data Integration, normalization, feature selection, dimension reduction Data Mining - Answers Pattern discovery, association & correlation, classification, clustering, outlier analysis Post- Processing - Answers Pattern evaluation, pattern selection, pattern interpretation, pattern visualization Data Mining in Business Intelligence 6 Steps - Answers Data Sources (Papers, files, web docs, science experiments, database systems) - Data preprocessing/ Integration, Data warehouses - Data Exploration (statistical summary, querying, and reporting) - Data Mining (information Discovery) - Data Presentation (Visualization Techniques) - Decision Making Data Scientists - Answers -exploring, asking questions, doing "what if" analysis, questioning existing assumptions and proccesses Business Analyst - Answers combine deep analytical skills with strong communication skills and strategic mind set to transform data into a competitive asset Data to be Mined - Answers Database data (extended relational, object oriented, hetergeneous, legacy), data warehouse, transactional data, stream, spatiotemporal, time-series, sequence, text and web, multi-media, graphs & social and information networks Knowledge to be Mined (or Data Mining Functions) - Answers -Characterization, discrimination, association, classification, clustering, trend/deviation, outlier analysis -descriptive v predictive data mining -multiple/integrated functions and mining at multiple levels Techniques Utilized - Answers Data-intensive, data warehouse (OLAP), machine learning, statistics, pattern recognition, visualization, high-performance Applications Adapted - Answers Retail, telecommunication, banking, fraud analysis, bio-data mining, stock market analysis, text mining, web mining What kind of Data can be mined? - Answers -Database-oriented data sets and applications like: 1.) relational database, SQL, trends, data patterns 2.) Data warehouse: organized around subjects, historical perspective, summarized 3.) Transactional db -Advanced data sets and advanced applications 1.) data streams and sensor data 2.) Time-series data, temporal data, sequence data 3.) structure data, graphs, social networks, multi-linked data 4.) object-relational databases 5.) heterogeneous db and legacy db 6.) Spatial data and spatiotemporal data 7.) Multimedia db 8.) Text db 9.) World Wide Web First Function of Data Mining: Generalization - Answers Information integration and data warehouse construction - data cleaning, transformation, integration, and multidimensional data model -Multidimensional concept description: characterization and discrimination - generalize, summarize, and contrast data characteristics (eg: dry v. wet region) Second Function of Data Mining: Association and Correlation Analysis - Answers -Frequent Patterns - What items are frequently purchased together in your Walmart? -Association, correlation v causality - A typical association rule -How to mine such patterns and rules efficiently in large dataset? Third Function of Data Mining: Classification - Answers -Classification and label prediction (construct models based on some training example, describe and distinguish classes or concepts for future prediction; predict class label) -Typical Methods (decision trees, naive Bayesian classification, support vector machines, neural networks, rule-based classification, pattern-based classification, logistic regression) -Typical Application (credit card fraud detection, direct marketing, classifying stars, diseases, web-pages Fourth Function of Data Mining: Cluster Analysis - Answers -Unsupervised learning (class label is unknown) -Group data to form categories (cluster) and cluster consumer to find patterns -Principle: Maximizing intra-class similarity and minimizing interclass similarity -Many methods and applications Fifth Function of Data Mining: Association and Correlation Analysis - Answers Outlier Analysis: -Outlier: data object that does not comply with general behavior of the data -Noise or Expection: one person's garbage could be another person's treasure -Methods: by product of clustering or regression analysis -useful in fraud detection, rare events analysis Evaluation of Knowledge - Answers -Are all mined knowledge interesting? -tremendous amount of "patterns" and knowledge -but may only fit certain dimensions -may not always be representative Mined Knowledge should be: - Answers -descriptive v predictive -coverage -typicality v novelty -accuracy -timeliness Types of Tech Used in Data Mining - Answers -Machine Learning -Pattern Recognition -Stats -Visualization -High Performance Computing -Database Tech -Algorithm -Applications Issues with Data Mining - Answers -Efficiency and Scalability (e&s of algorithms; parallel, distributed, stream, and incremental mining methods) -Diversity of Data Types (handling complex types of data; mining dynamic, networked, and global data repositories) -Data Mining and Society (social impact, privacy preserving, invisible) 5 Types of Data Mining Techniques - Answers -Outlier Analysis (used by -Association Analysis (used by -Classification (used by -Clustering (used by -Regression (used by Data Object - Answers an entity Data Objects - Answers described by attributes Database rows - Answers Database objects Database columns - Answers Attributes Attribute - Answers a data field representing a characteristic or feature 5 Types of Attributes (NBOIR) - Answers Nominal Binary Ordinal Numeric: quantitative (interval-scaled and Ratio scaled) Nominal - Answers categories or "names of things" (EG marital status, occupation, ID numbers , zip codes, hair color) Binary - Answers nominal attribute with 2 states (0 and 1) Symmetric Binary - Answers Both outcomes are equally significant (EG GENDER) Asymmetric Binary - Answers Outcomes are NOT equally significant (EG medical test -- positive or negative for color blindness) Convention is to assign 1 to more important (typically rarer) outcome (EG HIV positive) Ordinal - Answers Values have a meaningful order (ranking) but magnitude between successive values is not known (EG amount of pain from 1-10,, or rankings) Quantity (integer or real-valued) Interval - Answers A measure where the difference between two values is meaningful (EG difference bewteen 90-100 degrees is the same between 80-90 degrees) -NO true zero-pt-- can not multiple and divide (20 degrees is not twice as hot as 10 degrees) -EG (temperature in C or F or calendar years) Ratio - Answers All the properties of interval varaible but also has a clear defined zero-pt -We can speak of a values as being a multiple/ratio of another value (EG 10 miles = 2x as long as 5 miles) (EG distance, counts, monetary values) -Freq Dist -Median and Percentiles -Add/Subtract -Mean/S Dev/ S Error of Mean -Ratio/Coeff of Variation - Answers -NOIR -OIR -IR -IR -R Discrete Variable - Answers If a variable CANNOT take on any value between 2 specific values (EG head count, zipcodes etc) Continuous Variable - Answers If a variable can take on any value between 2 specific values (temp, height, weight) Why does Discrete v Continuous matter? - Answers Matters in probability distribution Positive Skew - Answers Mode Median Mean (Tail is on the right hand side) (Skews towards the left) Negative Skew - Answers Mean Median Mode (Tail is on the left hand side) (Skews towards the right) Disadvantage of using Mean - Answers affected by extreme values -Trimmed mean = chopping out extreme values (EG average salary) Disadvantage of using Median when Data set is large - Answers takes a long time to calculate -does not reflect the dispersion Measuring Dispersion of Data (5 ways) - Answers Quartiles Interquartile Range Five Number Summary BoxPlot Outlier Quartiles - Answers Q1 = 25th Percentile Q3 = 75th Percentile Interquartile Range - Answers Q3-Q1 Five Number Summary - Answers Min, Q1, median, Q3, Max BoxPlot - Answers Ends of the Box are quartiles, median is marked, add whiskers, and plot outliers individually Variance - Answers How far a set of data is spread out -STD ^2 Standard Deivation - Answers sq rt of vriance Normal Dist Curve - Answers 1st std = ~68% of measurements 2nd std = 95% 3rd std = 99.7% Graphic Displays of Basic Statistical Descriptions - Answers BoxPlot Histogram Quantile Plot Quantile-quantile (Q-Q) plot Scatter Plot Boxplot - Answers Graphc display of 5 num summary

Meer zien Lees minder
Instelling
BADM 453
Vak
BADM 453

Voorbeeld van de inhoud

BADM 453 - EXAM 1 QUESTIONS ANSWERED CORRECTLY LATEST UPDATE 2026

Why is Data Mining Useful? - Answers -there has been an explosive growth of data (from terabytes to
petabytes)

-data collection and data availability make it useful

-We are drowning in Data, but starving for knowledge!

-We are Data Rich, but information poor
What are major sources of data? - Answers -Business: Web, e-commerce, transactions, stocks

-Science: remote sensing, bioinformatics, scientific simulation

-society and everyone: news, digital camera, Youtube

-Internet of Things
Data Mining (4 qualities)
-n/t
-i
-pr/un
-po/us - Answers Knowledge Discovery from Data

-Extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) patterns or
knowledge from huge amounts of data
What are alternative names for Data Mining? - Answers -knowledge discovery in databases (KDD)
-knowledge extraction
-data/pattern analysis
-information harvesting
-business intelligence
Knowledge Discovery from Data Process (KDD)

(7 steps) - Answers -View from Typical Database Systems and Data warehousing communities

Databases -> Data Cleaning -> Data Warehouse -> Selection of Task Relevant Data -> Data Mining ->
Pattern Evaluation -> Knowledge

-Data mining plays an essential role in the knowledge discovery process
KDD Process: A Typical View from ML and Statistics

5 Steps - Answers Input Data -> Data Pre-Processing -> Data Mining -> Post Processing -> Pattern
Information Knowledge
Data Pre-Processing - Answers Data Integration, normalization, feature selection, dimension
reduction
Data Mining - Answers Pattern discovery, association & correlation, classification, clustering, outlier
analysis
Post- Processing - Answers Pattern evaluation, pattern selection, pattern interpretation, pattern
visualization
Data Mining in Business Intelligence

6 Steps - Answers Data Sources (Papers, files, web docs, science experiments, database systems) ->
Data preprocessing/ Integration, Data warehouses -> Data Exploration (statistical summary, querying,
and reporting) -> Data Mining (information Discovery) -> Data Presentation (Visualization Techniques)
-> Decision Making
Data Scientists - Answers -exploring, asking questions, doing "what if" analysis, questioning existing
assumptions and proccesses

, Business Analyst - Answers combine deep analytical skills with strong communication skills and
strategic mind set to transform data into a competitive asset
Data to be Mined - Answers Database data (extended relational, object oriented, hetergeneous,
legacy), data warehouse, transactional data, stream, spatiotemporal, time-series, sequence, text and
web, multi-media, graphs & social and information networks
Knowledge to be Mined (or Data Mining Functions) - Answers -Characterization, discrimination,
association, classification, clustering, trend/deviation, outlier analysis

-descriptive v predictive data mining
-multiple/integrated functions and mining at multiple levels
Techniques Utilized - Answers Data-intensive, data warehouse (OLAP), machine learning, statistics,
pattern recognition, visualization, high-performance
Applications Adapted - Answers Retail, telecommunication, banking, fraud analysis, bio-data mining,
stock market analysis, text mining, web mining
What kind of Data can be mined? - Answers -Database-oriented data sets and applications like:

1.) relational database, SQL, trends, data patterns
2.) Data warehouse: organized around subjects, historical perspective, summarized
3.) Transactional db

-Advanced data sets and advanced applications
1.) data streams and sensor data
2.) Time-series data, temporal data, sequence data
3.) structure data, graphs, social networks, multi-linked data
4.) object-relational databases
5.) heterogeneous db and legacy db
6.) Spatial data and spatiotemporal data
7.) Multimedia db
8.) Text db
9.) World Wide Web
First Function of Data Mining: Generalization - Answers Information integration and data warehouse
construction -> data cleaning, transformation, integration, and multidimensional data model

-Multidimensional concept description: characterization and discrimination -> generalize, summarize,
and contrast data characteristics (eg: dry v. wet region)
Second Function of Data Mining: Association and Correlation Analysis - Answers -Frequent Patterns ->
What items are frequently purchased together in your Walmart?

-Association, correlation v causality -> A typical association rule

-How to mine such patterns and rules efficiently in large dataset?
Third Function of Data Mining: Classification - Answers -Classification and label prediction (construct
models based on some training example, describe and distinguish classes or concepts for future
prediction; predict class label)

-Typical Methods (decision trees, naive Bayesian classification, support vector machines, neural
networks, rule-based classification, pattern-based classification, logistic regression)

-Typical Application (credit card fraud detection, direct marketing, classifying stars, diseases, web-
pages
Fourth Function of Data Mining: Cluster Analysis - Answers -Unsupervised
learning (class label is unknown)

-Group data to form categories (cluster) and cluster consumer to find patterns

-Principle: Maximizing intra-class similarity and minimizing interclass similarity

Geschreven voor

Instelling
BADM 453
Vak
BADM 453

Documentinformatie

Geüpload op
8 mei 2026
Aantal pagina's
8
Geschreven in
2025/2026
Type
Tentamen (uitwerkingen)
Bevat
Vragen en antwoorden

Onderwerpen

$11.49
Krijg toegang tot het volledige document:

Verkeerd document? Gratis ruilen Binnen 14 dagen na aankoop en voor het downloaden kun je een ander document kiezen. Je kunt het bedrag gewoon opnieuw besteden.
Geschreven door studenten die geslaagd zijn
Direct beschikbaar na je betaling
Online lezen of als PDF

Maak kennis met de verkoper

Seller avatar
De reputatie van een verkoper is gebaseerd op het aantal documenten dat iemand tegen betaling verkocht heeft en de beoordelingen die voor die items ontvangen zijn. Er zijn drie niveau’s te onderscheiden: brons, zilver en goud. Hoe beter de reputatie, hoe meer de kwaliteit van zijn of haar werk te vertrouwen is.
joshuawesonga22 Liberty University
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
97
Lid sinds
1 jaar
Aantal volgers
1
Documenten
14118
Laatst verkocht
1 dag geleden
Tutor Wes

Hi there! I'm Tutor Wes, a dedicated tutor with a passion for sharing knowledge and helping others succeed academically. All my notes are carefully organized, detailed, and easy to understand. Whether you're preparing for exams, catching up on lectures, or looking for clear summaries, you'll find useful study materials here. Let’s succeed together!

3.9

9 beoordelingen

5
4
4
1
3
3
2
1
1
0

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Bezig met je bronvermelding?

Maak nauwkeurige citaten in APA, MLA en Harvard met onze gratis bronnengenerator.

Bezig met je bronvermelding?

Veelgestelde vragen