Samenvatting

Full summary of Data Science for Business (ch 1-14) (Grade 8,5)

Name: Full summary of Data Science for Business (ch 1-14) (Grade 8,5)
SKU: doc_667379
Rating: 4.13 (8 reviews)
Author: hannah2501

Beoordeling

4,1

(8)

Verkocht

Pagina's

Geüpload op

10-03-2020

Geschreven in

2019/2020

Full summary of the book 'Data science for business' including graphs and pictures from the book!

Instelling

Vak

Voorbeeld van de inhoud

Summary of the book

Chapter 1 - Introduction

Chapter 2 – Business problems & data science solutions

Chapter 3 – Introduction to predictive modeling

Chapter 4 - Fitting a model to data

Chapter 5 – Overfitting and its avoidance

Chapter 6 – Similarity, neighbors and clusters

Chapter 7 – Decision analytic thinking I

Chapter 8 – Visualizing model performance

Chapter 9 – Evidence and probabilities

Chapter 10 – Representing and mining text

Chapter 11 – Decision analytic thinking II

Chapter 12 – Other data science tasks and techniques

Chapter 13 – Data science and business strategy

Chapter 14 – Conclusion

Chapter 1 – Introduction: Data-analytic Thinking

Data science = principles, processes and techniques for understanding phenomena via the
analysis of data
 Ultimate goal: improving decision making
Data-driven Decision making (DDD) = the practice of basing decisions on the analysis of
data, rather than purely on intuition
- Increases production (1 SD higher on the DDD scale equals 4-6% increase in
productivity)
- Higher return on assets, return on equity, asset utilization and market value

,2 types of decisions
1. Decisions for which discoveries need to be made within data
2. Decisions that repeat (especially at massive scale), so decision-making can benefit
from even small increases in decision-making accuracy
a. E.g. churn problems in big companies

Predictive model abstracts away most of the complexity of the world by focusing on a
particular set of indicators that correlate in some way with a quantity of interest

Data science supports data-driven decision making, but also overlaps with data-driven
decision making
 Business decisions are being made automatically by computer systems

Data engineering & processing critical to support data science, but are themselves more
general
 Many data processing skills, systems and technologies often mistaken as data
science

Difference data science vs. data processing
Data science = needs access to data and it often benefits from sophisticated data
engineering that data processing technologies may facilitate, but these technologies are not
data science technologies per se
Data processing = important for data-oriented business tasks that don’t involve extracting
knowledge or data-driven decision-making

Big data = datasets that are too large for traditional data processing systems and therefore
require new processing technologies
 Big data technologies expected to be used for implementing data mining techniques,
but more often used for supporting data mining techniques

Big data 1.0 = during web 1.0, businesses busied themselves with getting basic internet
technologies in place to they could establish web presence, build electronic capability and
improve efficiency of operations: firms are busying themselves with building capabilities to
process large data, largely in support of current operations
Big data 2.0 = Once firms have become capable of processing massive data in flexible
fashion, they begin asking what can I do now that I couldn’t do before or do better than I
could do before
 Implementation of social networking component and rise of the voice of the
individual consumer

Fundamental principle of data science: data and the capability to extract useful knowledge
from data, should be regarded as key strategic assets
 Too many businesses regard data analytics as pertaining mainly to realizing value
from some existing data, without checking if you have the appropriate analytical
talent

2

,Right talent & right data = complementary assets
If you don’t have the right data: buy it
Many firms nowadays exploit new & existing data resources for competitive advantage
 Data analytic projects reach into all business units: requires close interaction with
data scientists and business people

3

, Chapter 2 – Business problems and data science solutions

An individual = refers to an entity about which we have data (e.g. a consumer or business)

1.Classification and class probability estimation = attempts to predict, for each individual in
a population, which of a set of classes this individual belongs to (usually the classes are
mutually exclusive)
e.g. among all customers at MegaTelCo, which are most likely to respond to given offer
- Data mining produces a model that determines which class that individual belongs to
- Closely related task: scoring/ probability estimation = applies a score to individuals
representing the probability that the individual belongs to each of the classes
- Requires categorical (often binary) target

2.Regression (value estimation) = attempts to estimate or predict for each individual the
numerical value of some variable for that individual
e.g. how much will a given customer use the service?
- Related to classification but different: classification predicts whether something will
happen, regression predicts how much something will happen
- Requires numeric target

3.Similarity matching = attempts to identify similar individuals based on data known about
them (can be used to find similar entities)
e.g. IBM is interested in finding companies similar to their business customers
- Basis for one of the most popular methods for making product recommendations
(finding people who are similar to you in terms of the products they have liked/
purchased)

4.Clustering = attempts to group individuals in a population together by their similarity, but
not driven by any specific purpose
e.g. Do our customers form natural groups or segments?
- Useful in preliminary domain exploration to see which natural groups exist because
these groups in turn may suggest other data mining tasks/ approaches

5.Co-occurrence grouping = attempts to find associations between entities based on
transactions involving them (aka frequent itemset mining, association rule discovery,
market-basket analysis)
e.g. What items are commonly purchased together? Recommendation: customers who
bought X also bought Y
- While clustering looks at similarity between objects based on objects’ attributes, co-
occurrence grouping considers similarity of objects based on their appearing
together in transactions
- Co-occurrence of products is common type of grouping named market-basket
analysis (ch.12)

6.Profiling (behavior description) = attempts to characterize the typical behavior of an
individual, group or population
e.g. What is the typical cell phone usage of this customer segment?

4

Meld schending auteursrecht

Gekoppeld boek

Foster Provost, Tom Fawcett Data Science for Business

Uitgave:augustus 2013
ISBN:9781449361327
Druk:1

Geschreven voor

Instelling: Universiteit van Amsterdam (UvA)
Studie: Business Administration
Vak: Strategy analytics

Alle documenten voor dit vak (5)

Documentinformatie

Heel boek samengevat?: Ja
Geüpload op: 10 maart 2020
Bestand laatst geupdate op: 30 maart 2020
Aantal pagina's: 57
Geschreven in: 2019/2020
Type: SAMENVATTING

Onderwerpen

data science
summary

€10,98

Krijg toegang tot het volledige document:

Gekocht door 79 studenten

Geschreven door studenten die geslaagd zijn

Direct beschikbaar na je betaling

Online lezen of als PDF

Maak kennis met de verkoper

hannah2501

3,7

(32)

Beoordelingen van geverifieerde kopers

7 van 8 beoordelingen worden weergegeven

carlosjmorenos 191 · 2 beoordelingen

2 jaar geleden

shop Vwo · 53 beoordelingen

3 jaar geleden

taliaabdool · 2 beoordelingen

4 jaar geleden

bridgetharrell · 1 beoordeling

4 jaar geleden

tiesdelen Business Administration International Management

5 jaar geleden

thijsthijs Bedrijfskunde · 7 beoordelingen

5 jaar geleden

Goede samenvatting per hoofdstuk, redelijk uitgebreid. Voor diepgang is het soms wel nodig om ander onderzoek te doen!

raoulbouchrit Bedrijfseconomie · 13 beoordelingen

5 jaar geleden

4,1

8 beoordelingen

Betrouwbare reviews op Stuvia

Alle beoordelingen zijn geschreven door echte Stuvia-gebruikers na geverifieerde aankopen.

Maak kennis met de verkoper

hannah2501 Universiteit van Amsterdam

Bekijk profiel

Volgen

Verkocht

289

Lid sinds

10 jaar

Aantal volgers

229

Documenten

Laatst verkocht

2 maanden geleden

3,7

32 beoordelingen

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper hannah2501. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €10,98. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews) Afgelopen 30 dagen zijn er 50153 samenvattingen verkocht Opgericht in 2010, al 16 jaar dé plek om samenvattingen te kopen

Full summary of Data Science for Business (ch 1-14) (Grade 8,5)

Voorbeeld van de inhoud

Gekoppeld boek

Geschreven voor

Documentinformatie

Onderwerpen

Meer vakken binnen Universiteit van Amsterdam (UvA) > Business Administration

Beoordelingen van geverifieerde kopers

Maak kennis met de verkoper

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Niet tevreden? Kies een ander document

Betaal zoals je wilt, start meteen met leren

Bezig met je bronvermelding?

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Tevredenheidsgarantie: hoe werkt dat?

Van wie koop ik deze samenvatting?

Zit ik meteen vast aan een abonnement?

Is Stuvia te vertrouwen?