Presentation

Designing Machine Learning Systems

Rating

Sold

Pages

299

Uploaded on

02-08-2024

Written in

2021/2022

"Machine learning systems are both complex and unique. Complex because they consist of many different components and involve many different stakeholders. Unique because they're data dependent, with data varying wildly from one use case to the next. In this book, you'll learn a holistic approach to designing ML systems that are reliable, scalable, maintainable, and adaptive to changing environments and business requirements. Author Chip Huyen, co-founder of Claypot AI, considers each design decision--such as how to process and create training data, which features to use, how often to retrain models, and what to monitor--in the context of how it can help your system as a whole achieve its objectives. The iterative framework in this book uses actual case studies backed by ample references."

Show more Read less

Institution

Course

Content preview

,Chapter 1. Overview of Machine Learning
Systems
In November 2016, Google announced that it had incorporated its multilingual neural machine
translation system into Google Translate, marking one of the first success stories of deep artificial
neural networks in production at scale.1 According to Google, with this update, the quality of
translation improved more in a single leap than they had seen in the previous 10 years combined.

This success of deep learning renewed the interest in machine learning (ML) at large. Since then,
more and more companies have turned toward ML for solutions to their most challenging problems.
In just five years, ML has found its way into almost every aspect of our lives: how we access
information, how we communicate, how we work, how we find love. The spread of ML has been
so rapid that it’s already hard to imagine life without it. Yet there are still many more use cases for
ML waiting to be explored in fields such as health care, transportation, farming, and even in
helping us understand the universe.2

Many people, when they hear “machine learning system,” think of just the ML algorithms being
used such as logistic regression or different types of neural networks. However, the algorithm is
only a small part of an ML system in production. The system also includes the business
requirements that gave birth to the ML project in the first place, the interface where users and
developers interact with your system, the data stack, and the logic for developing, monitoring, and
updating your models, as well as the infrastructure that enables the delivery of that logic. Figure 1-
1 shows you the different components of an ML system and in which chapters of this book they
will be covered.

THE RELATIONSHIP BETWEEN
MLOPS AND ML SYSTEMS DESIGN
Ops in MLOps comes from DevOps, short for Developments and Operations. To operationalize something
means to bring it into production, which includes deploying, monitoring, and maintaining it. MLOps is a
set of tools and best practices for bringing ML into production.

ML systems design takes a system approach to MLOps, which means that it considers an ML system
holistically to ensure that all the components and their stakeholders can work together to satisfy the
specified objectives and requirements.

,Figure 1-1. Different components of an ML system. “ML algorithms” is usually what people think of when they say machine learning, but it’s only
a small part of the entire system.

There are many excellent books about various ML algorithms. This book doesn’t cover any
specific algorithms in detail but rather helps readers understand the entire ML system as a whole.
In other words, this book’s goal is to provide you with a framework to develop a solution that best
works for your problem, regardless of which algorithm you might end up using. Algorithms might
become outdated quickly as new algorithms are constantly being developed, but the framework
proposed in this book should still work with new algorithms.

The first chapter of the book aims to give you an overview of what it takes to bring an ML model
to production. Before discussing how to develop an ML system, it’s important to ask a fundamental
question of when and when not to use ML. We’ll cover some of the popular use cases of ML to
illustrate this point.

After the use cases, we’ll move on to the challenges of deploying ML systems, and we’ll do so by
comparing ML in production to ML in research as well as to traditional software. If you’ve been
in the trenches of developing applied ML systems, you might already be familiar with what’s
written in this chapter. However, if you have only had experience with ML in an academic setting,
this chapter will give an honest view of ML in the real world and set your first application up for
success.

When to Use Machine Learning
As its adoption in the industry quickly grows, ML has proven to be a powerful tool for a wide
range of problems. Despite an incredible amount of excitement and hype generated by people both

, inside and outside the field, ML is not a magic tool that can solve all problems. Even for problems
that ML can solve, ML solutions might not be the optimal solutions. Before starting an ML project,
you might want to ask whether ML is necessary or cost-effective.3

To understand what ML can do, let’s examine what ML solutions generally do:

Machine learning is an approach to (1) learn (2) complex patterns from (3) existing data and use
these patterns to make (4) predictions on (5) unseen data.

We’ll look at each of the italicized keyphrases in the above framing to understand its implications
to the problems ML can solve:

1. Learn: the system has the capacity to learn

A relational database isn’t an ML system because it doesn’t have the capacity to learn. You
can explicitly state the relationship between two columns in a relational database, but it’s
unlikely to have the capacity to figure out the relationship between these two columns by
itself.

For an ML system to learn, there must be something for it to learn from. In most cases, ML
systems learn from data. In supervised learning, based on example input and output pairs,
ML systems learn how to generate outputs for arbitrary inputs. For example, if you want
to build an ML system to learn to predict the rental price for Airbnb listings, you need to
provide a dataset where each input is a listing with relevant characteristics (square footage,
number of rooms, neighborhood, amenities, rating of that listing, etc.) and the associated
output is the rental price of that listing. Once learned, this ML system should be able to
predict the price of a new listing given its characteristics.

2. Complex patterns: there are patterns to learn, and they are complex

ML solutions are only useful when there are patterns to learn. Sane people don’t invest
money into building an ML system to predict the next outcome of a fair die because there’s
no pattern in how these outcomes are generated.4 However, there are patterns in how stocks
are priced, and therefore companies have invested billions of dollars in building ML
systems to learn those patterns.

Whether a pattern exists might not be obvious, or if patterns exist, your dataset or ML
algorithms might not be sufficient to capture them. For example, there might be a pattern
in how Elon Musk’s tweets affect cryptocurrency prices. However, you wouldn’t know
until you’ve rigorously trained and evaluated your ML models on his tweets. Even if all
your models fail to make reasonable predictions of cryptocurrency prices, it doesn’t mean
there’s no pattern.

Consider a website like Airbnb with a lot of house listings; each listing comes with a zip
code. If you want to sort listings into the states they are located in, you wouldn’t need an

Report Copyright Violation

Written for

Course: Designing Machine Learning Systems

All documents for this subject (1)

Document information

Uploaded on: August 2, 2024
Number of pages: 299
Written in: 2021/2022
Type: PRESENTATION
Person: Unknown

Subjects

designing machine learning systems

$4.99

Get access to the full document:

Written by students who passed

Immediately available after payment

Read online or as PDF

Get to know the seller

RobertCuong

Get to know the seller

RobertCuong Telecommunication

View profile

Sold

Member since

3 year

Number of followers

Documents

225

Last sold

GPON and WiFi

+ SDH solution based on Fujitsu/Alcatel/Huawei devices in deployment and troubleshoot + Switching and Routing network fundamental and advance + GPON solution with deep knowledge of PLOAM/OMCI, activation procedure. Analysis of Private/Public OMCI + WiFi solution with WiFi Management/Control/Data. WiFi bandsteering, WiFi mesh, and WiFi 6, 6E, 7, ...

0.0

0 reviews

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller RobertCuong. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $4.99. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 47096 documents were sold in the last 30 days Founded in 2010, the go-to place to buy study notes for 16 years now

Designing Machine Learning Systems

Content preview

Written for

Document information

Subjects

Get to know the seller

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Didn't get what you expected? Choose another document

Pay as you like, start learning right away

Working on your references?

Frequently asked questions

What do I get when I buy this document?

Satisfaction guarantee: how does it work?

Who am I buying these notes from?

Will I be stuck with a subscription?

Can Stuvia be trusted?