Samenvatting

Introduction to Analytics Summary - HIR(B) 2026

Beoordeling

Verkocht

Pagina's

Geüpload op

15-05-2026

Geschreven in

2025/2026

This summary is based on the slides, completed with lecture and assignment insights for better understanding, all you need for the exam. The course was given by Jochen De Weerdt. Lecture notes from Introduction to Analytics at KU Leuven covering the fundamentals of the data analytics process. Topics include the distinction between data analytics, machine learning, and AI, data types and structures, the analytics spectrum (descriptive, predictive, prescriptive, cognitive), and supervised vs. unsupervised learning approaches. Essential for understanding core analytics concepts and exam preparation in the Bachelor handelsingenieur in beleidsinformatica program.

Meer zien Lees minder

Instelling

Vak

Voorbeeld van de inhoud

Introduction to Analytics
C1 - The Data Analytics Process
1. Introduction
Use of AI skyrockets
➔ More efficient
➔ More affordable
➔ More accessible
However, 95% of GenAI pilots fail
Can you create business value with AI?
➔ Real business value = in applications
o What do people look into when looking at simple implementations?
o Marketing, risk management, government, web, logistics, …
Data analytics
- Data contains value and knowledge
- Some claim data is the new oil (but I don’t agree)
- But to extract this knowledge, you need to be able to
o Store it
o Manage it
o Analyze it → remains a big issue, data in itself is not valuable, you need to extract the
information from it in order to realize the value
- Data Mining ≈ Big Data ≈ Data Analytics ≈ Data Science ≈ Machine Learning ≈ Deep Learning ≈
Decision Science ≈ AI ?
AI = Artificial Intelligence = a field of computer science focused on building systems that perform tasks that
normally require human intelligence (for example, pattern recognition, learning, and generalization)
ML = Machine Learning = methods that learn patterns from data to make predictions or decisions, without
being explicitly programmed with rules
Data Analytics = the application of data analysis and machine learning to extract insights from data and
support decision-making
Statistics
- Explain relationships in data (does variable X influence Y?
- Emphasis on assumptions, uncertainty, interpretability
- Often smaller data, parametric models
- Primary goal: explanation and understanding
 ML / AI
- Predict outcomes or support decisions
- Emphasis on performance and generalization
- Often larger data, flexible models
 Statistics: explanation & inference
 ML: prediction & decision-making
Data science = umbrella term for statistics + ML + AI →

1

,Business perspective of analytics:
- Given (lots of) data, extracting useful patterns and models from data
o Instead of hand-coding, let the data speak
o To help predict something, explain something, decide something (and more?)
- Using
1. Data
2. An algorithm
3. A purpose
That are
o Valid: hold on new data with some certainty (i.e. generalizable)
o Useful: should be possible to act on the item (i.e. actionable)
o Unexpected: non-obvious to the system (i.e. interesting)
o Understandable: humans should be able to interpret the pattern (i.e. explainable)
1) Data
o Structured  unstructured? Tabular, relational, text, imagery, audio, …
o Two main approaches to deal with non-tabular data
▪ Making it tabular (“featurization”)
▪ Using models that can directly utilize data as-is (“deep learning”)
o A tubular data set (“structured data”)
▪ Instances (examples, rows, observations, customers, cases, …)
▪ Features (attributes, fields, variables, predictors, covariates, explanatory variables,
regressors, independent variables)
• Numeric (continuous)
• Categorical (discrete, factor), either nominal (binary as a special case) or ordinal
▪ Target (label, class, dependent variable, response variable) can also be present
• = feature that you want to predict for
• Numeric, categorical, …

2) Algorithms
o Data analysis spectrum
▪ BI = Business intelligence = what you show is upfront decided by humans  you design
what you want to see yourself
▪ AI / ML / analytics → you don’t design yourself/make assumptions, the algorithm
decides
1. Descriptive analytics = finding hidden structure in data (e.g. clustering, pattern
mining, …)
2. Predictive analytics = build models that predict what will happen (ML techniques
like classification, regression, forecasting, …)
3. Prescriptive analytics = build models that predict what you should do (decision
making, recommender systems, reinforcement learning, …)
4. Cognitive analytics = self-learning systems, cognitive computing, artificial
general intelligence
2

, o 3 big types
▪ ( Reinforcement learning: learn by interacting with an environment )
▪ Supervised learning: learn from labeled data → predictive analytics
• Key idea: learn a function that maps inputs X to a known target Y
• Need labels!
• 2 problem types
o Classification → target is categorical (e.g. binary, multiclass, ordinal, …)
o Regression → target is numerical (continuous) (e.g. absolute values,
changes (deltas), quantiles)
• Generalizability to “unseen” data (= data not previously used for the training)
o ML is all about generalizable correlation (the model learns patterns) (not
causation! (no proof that a particular variable will have an influence on
another))
o E.g. identifying pictures of tanks: model focused on the clouds & weather
instead of the tank patterns themselves
• Example algorithm: decision tree learner
▪ Unsupervised learning: find structure in data → descriptive analytics
• Extract patterns from the data as is
o Clustering : construct groups over the data set
o Association/sequence/… rule mining : find rules of antecedents and
consequents that describe the data
o Anomaly detection: find outliers in the data set
o (Dimensionality reduction : from many variables to fewer)
3) Purpose
o Business question? Business problem?
o Types
▪ Exploratory: plots, distributions, quick charts, basic correlations – very visual
But who says you couldn’t build a supervised model to help here?
▪ Descriptive: unsupervised – clustering, association rules
Depends on which style of descriptions you want to get; very often you already
have some hypothesis going on
▪ Explanatory: unsupervised again?
Depending on target definition and model type used, a supervised model can be
used as an explanatory means with not much generalization power going forward
▪ Predictive: supervised for sure (right?)
Though in many cases unsupervised techniques can be used here as a
featurization or pre-processing step
▪ Prescriptive: “what should I do”
What-if analysis using a supervised model, or using good ole’ operations research
o ML isn’t the solution for every problem!

2. The data analytics process
KDD process = knowledge discovery in databases
➔ Linear process

3

, CRISP-DM = cross-industry standard process for data mining  
➔ no linear process but an iteration (won't get it completely
right on the first try)
(SEMMA = Sample, Explore, Modify, Model & Assess)
(The drivetrain approach)

The real data analytics process: complicated, a lot of skipping & going back

Where does it go wrong?
- Misaligned objectives
o Data science teams often optimize model accuracy
o Business teams care about value, insight, and usability
o Accuracy is easy to measure, impact is not
➔ Collaborate with business teams
- Wrong project mindset
o Data science is often treated as an execution task ( it won’t guarantee delivery)
o In reality, it is an exploratory learning process
o Models, features, and parameters are discovered through iteration
➔ Data science teams need the freedom to learn what works as they go (not before they go!)
Managing data science:
- Data science is not a linear project
o Goals, data, and models evolve during the project
o You cannot fully specify requirements upfront
- Key management challenges
o Bridging business goals and technical metrics
o Supporting experimentation and iteration
o Moving from prototype to production reliably
➔ Managing data science requires processes and infrastructure, not just algorithms
MLOps = a set of techniques and practices used to design, build and deploy machine learning models in an
efficient, optimized, and organized manner
➔ How to serve/deliver your models?
➔ Integrated thinking across the entire chain
➔ Key focus: deployment:
business problem → data engineering
→ ML model engineering → code engineering
➔ MLOps technologies:
o open source (TensorFlow, Airflow, Kubeflow, …)
o commercial (databricks, azure ML, …)
4

Meld schending auteursrecht

Geschreven voor

Instelling: Katholieke Universiteit Leuven (KU Leuven)
Studie: Bachelor handelsingenieur in de beleidsinformatica
Vak: Introduction to Analytics (D0H61A)

Alle documenten voor dit vak (2)

Documentinformatie

Geüpload op: 15 mei 2026
Aantal pagina's: 77
Geschreven in: 2025/2026
Type: SAMENVATTING

Onderwerpen

introduction to analytics
data analytics
machine learning
supervised learning
unsupervised learning
predictive analytics
beleidsinformatica
classification
regression
clustering
ku leuven

$11.68

Krijg toegang tot het volledige document:

Geschreven door studenten die geslaagd zijn

Direct beschikbaar na je betaling

Online lezen of als PDF

Maak kennis met de verkoper

kaatcommeine

4.6

(12)

Maak kennis met de verkoper

kaatcommeine Katholieke Universiteit Leuven

Bekijk profiel

Volgen

Verkocht

Lid sinds

2 jaar

Aantal volgers

Documenten

Laatst verkocht

1 week geleden

samenvattingen handelsingenieur (in de beleidsinformatica)

Op mijn profiel vind je heel wat samenvattingen van handelsingenieur en handelsingenieur in de beleidsinformatica. Deze zijn ook steeds verkrijgbaar in voordeelbundels, waarop jullie heel wat kunnen besparen. Aarzel niet om een review achter te laten wanneer je iets gekocht hebt, deze helpen mij en zo kan ik ook mijn samenvattingen eventueel verbeteren. Wanneer ik een nieuwe versie upload zullen jullie dan melding krijgen.

4.6

12 beoordelingen

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper kaatcommeine. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor $11.68. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews) Afgelopen 30 dagen zijn er 48849 samenvattingen verkocht Opgericht in 2010, al 16 jaar dé plek om samenvattingen te kopen

Introduction to Analytics Summary - HIR(B) 2026

Voorbeeld van de inhoud

Geschreven voor

Documentinformatie

Onderwerpen

Maak kennis met de verkoper

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Niet tevreden? Kies een ander document

Betaal zoals je wilt, start meteen met leren

Bezig met je bronvermelding?

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Tevredenheidsgarantie: hoe werkt dat?

Van wie koop ik deze samenvatting?

Zit ik meteen vast aan een abonnement?

Is Stuvia te vertrouwen?