Presentation

Machine Learning Production Systems

Rating

Sold

Pages

146

Uploaded on

02-08-2024

Written in

2021/2022

"Using machine learning for products, services, and critical business processes is quite different from using ML in an academic or research setting—especially for recent ML graduates and those moving from research to a commercial environment. Whether you currently work to create products and services that use ML, or would like to in the future, this practical book gives you a broad view of the entire field. Authors Robert Crowe, Hannes Hapke, Emily Caveness, and Di Zhu help you identify topics that you can dive into deeper, along with reference materials and tutorials that teach you the details. You'll learn the state of the art of machine learning engineering, including a wide range of topics such as modeling, deployment, and MLOps. You'll learn the basics and advanced aspects to understand the production ML lifecycle."

Show more Read less

Institution

Course

Content preview

,Brief Table of Contents (Not Yet Final)
Chapter 1: Collecting, Labeling and Validating Data (available)

Chapter 2: Feature Engineering and Selection (available)

Chapter 3: Data Journey and Data Storage (available)

Chapter 4: Advanced Labeling, Automation, and Data Preprocessing (available)

Chapter 5: Model Resource Management Techniques (available)

Chapter 6: High Performance Modeling (available)

Chapter 7: Model Analysis (available)

Chapter 8: Interoperability (available)

Chapter 9: Neural Architecture Search (available)

Chapter 10: Introduction to Model Serving (unavailable)

Chapter 11: Model Serving Patterns and Infrastructure (unavailable)

Chapter 12: Model Management and Delivery (unavailable)

Chapter 13: Model Monitoring and Logging (unavailable)

Chapter 14: Privacy and Legal Requirements (unavailable)

Chapter 15: Productionalizing Machine Learning Pipelines (unavailable)

Chapter 16: Classifying Unstructured Texts (unavailable)

Chapter 17: Image Classification (unavailable)

Chapter 1. Introduction to Machine Learning
Production Systems
A NOTE FOR EARLY RELEASE
READERS

,With Early Release ebooks, you get books in their earliest form—the author’s raw and unedited
content as they write—so you can take advantage of these technologies long before the official
release of these titles.

This will be the 1st chapter of the final book. Please note that the GitHub repo will be made active
later on.

If you have comments about how we might improve the content and/or examples in this book, or
if you notice missing material within this chapter, please reach out to the author
at .

The field of machine learning engineering is so vast that it can be easy to get lost in the different
steps that are necessary to get a model from an experiment into a production deployment. Over the
last few years, machine learning, novel machine learning concepts such as attention, and more
recently large language models (LLMs), have been in the news almost every day. However, very
little discussion has focused on production machine learning, which brings machine learning into
products and applications.

Production Machine Learning covers all areas of machine learning beyond simply training a
machine learning model. Production Machine Learning can be viewed as a combination of
machine learning development and modern software development practices. Machine learning
pipelines build the foundation for Production Machine Learning. Implementing and executing
machine learning pipelines are key aspects of production machine learning.

In this chapter, we will introduce the concept of Production Machine Learning. We’ll also
introduce what machine learning pipelines are, look at their benefits, and walk through the steps
of a machine learning pipeline.

What Is Production Machine Learning?
In an academic or research setting, modeling is relatively straightforward. Typically you have a
data set (often a standard data set that is supplied to you, already cleaned and labeled), and you’re
going to use that dataset to train your model and evaluate the results.

The result that you’re trying to achieve is simply a model that makes good predictions. You’ll
probably go through a few iterations to fully optimize the model, but once you’re satisfied with
the results then typically you’re done.

Production ML requires a lot more than just a model. We’ve found that a model is typically only
about 5% of the code that is required to put an ML application into production. Over their lifetimes
Production ML applications will be deployed, maintained, and improved, so that you can deliver
a consistent high-quality experience to your users.

Let’s look at some of the differences between machine learning modeling in a non-production
environment (typically research or academic), and machine learning in a production environment.

,  In an academic or research environment you’re typically using a static dataset. Production
ML uses real-world data, which is dynamic and usually shifting.
 The design priority for academic or research ML is usually the highest accuracy over the
entire training set. But the design priority for production ML is fast inference, fairness, and
good interpretability - as well as acceptable accuracy - and minimizing cost.
 Model training for research ML is based on a single optimal result, and the tuning and
training necessary to achieve it. Production ML requires continuous monitoring,
assessment, and retraining.
 Interpretability and fairness are very important for any ML modeling, but they are
absolutely crucial for production ML.
 And finally, while the main challenge of academic and research ML is finding and tuning
a high accuracy model, the main challenge for production ML is that accuracy plus
everything else - the entire system.

In a Production ML environment, you’re not just producing a single result, you’re developing a
product or service that is often a mission-critical part of your offering.

For example, in Production ML, if you’re doing supervised learning, then you need to make sure
that your labels are accurate. You also need to make sure that your training dataset has examples
which cover the same feature space as the requests that your model will receive. You also want to
reduce the dimensionality of your feature vector to optimize your system performance while
retaining or enhancing the predictive information in your data.

Throughout all of this you need to consider and measure the fairness of your data and model,
especially for rare conditions. In fields such as healthcare, for example, rare but important
conditions may be absolutely critical to success.

On top of all of that, you’re putting a piece of software into production. That requires a system
design that includes all of the things that are required for any production software deployment.
You need to consider:

 Data preprocessing methods
 Parallelized model training setups
 Repeatable model analysis
 Scalable model deployment

Your Production ML system needs to run automatically, so that you’re continuously monitoring
your model performance, ingesting new data, retraining as needed, and redeploying to maintain or
improve your performance.

And of course, in building an ML Production system, like any production system, you need to try
to do all of this at the minimum cost, while producing the maximum performance. It might seem
daunting, but the good news is that there are well-established tools and methodologies for doing
this.

Report Copyright Violation

Written for

Course: Machine Learning Production Systems

All documents for this subject (1)

Document information

Uploaded on: August 2, 2024
Number of pages: 146
Written in: 2021/2022
Type: PRESENTATION
Person: Unknown

Subjects

machine learning production systems

$4.99

Get access to the full document:

Written by students who passed

Immediately available after payment

Read online or as PDF

Get to know the seller

RobertCuong

Get to know the seller

RobertCuong Telecommunication

View profile

Sold

Member since

3 year

Number of followers

Documents

225

Last sold

GPON and WiFi

+ SDH solution based on Fujitsu/Alcatel/Huawei devices in deployment and troubleshoot + Switching and Routing network fundamental and advance + GPON solution with deep knowledge of PLOAM/OMCI, activation procedure. Analysis of Private/Public OMCI + WiFi solution with WiFi Management/Control/Data. WiFi bandsteering, WiFi mesh, and WiFi 6, 6E, 7, ...

0.0

0 reviews

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller RobertCuong. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $4.99. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 47251 documents were sold in the last 30 days Founded in 2010, the go-to place to buy study notes for 16 years now

Machine Learning Production Systems

Content preview

Written for

Document information

Subjects

Get to know the seller

Why students choose Stuvia

Created by fellow students, verified by reviews

Didn't get what you expected? Choose another document

Pay as you like, start learning right away

Working on your references?

Frequently asked questions

What do I get when I buy this document?

Satisfaction guarantee: how does it work?

Who am I buying these notes from?

Will I be stuck with a subscription?

Can Stuvia be trusted?