Shop top categories that ship internationally
$16.55 with 79 percent savings
List Price: $79.99
FREE International Returns
No Import Charges & $8.95 Shipping to Canada Details

Shipping & Fee Details

Price $16.55
AmazonGlobal Shipping $8.95
Estimated Import Charges $0.00
Total $25.50

Delivery Wednesday, January 15
In Stock
$$16.55 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$16.55
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Ships from
Amazon.com
Amazon.com
Ships from
Amazon.com
Sold by
Amazon.com
Amazon.com
Sold by
Amazon.com
Returns
30-day refund/replacement
30-day refund/replacement
This item can be returned in its original condition for a full refund or replacement within 30 days of receipt.
Payment
Secure transaction
Your transaction is secure
We work hard to protect your security and privacy. Our payment security system encrypts your information during transmission. We don’t share your credit card details with third-party sellers, and we don’t sell your information to others. Learn more
Added to

Sorry, there was a problem.

There was an error retrieving your Wish Lists. Please try again.

Sorry, there was a problem.

List unavailable.
Kindle app logo image

Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required.

Read instantly on your browser with Kindle for Web.

Using your mobile phone camera - scan the code below and download the Kindle app.

QR code to download the Kindle App

Follow the authors

Something went wrong. Please try your request again later.

Data Science on AWS: Implementing End-to-End, Continuous AI and Machine Learning Pipelines 1st Edition

4.4 4.4 out of 5 stars 231 ratings

{"desktop_buybox_group_1":[{"displayPrice":"$16.55","priceAmount":16.55,"currencySymbol":"$","integerValue":"16","decimalSeparator":".","fractionalValue":"55","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"oi9z8TcPHdIpmo7kQZT9h%2Bmc4q4UMyM6nine3v6M3QBam%2FIhtyxDoJLEIJjK%2F3yfjEC4gxpxP5jW94CaH67KNjOKToGgN0e5drmnS8q28pr13rki4%2B4quAv%2BpVar8j1Ux8let1orLWwPprzXLHmwxA%3D%3D","locale":"en-US","buyingOptionType":"NEW","aapiBuyingOptionIndex":0}]}

Purchase options and add-ons

With this practical book, AI and machine learning practitioners will learn how to successfully build and deploy data science projects on Amazon Web Services. The Amazon AI and machine learning stack unifies data science, data engineering, and application development to help level up your skills. This guide shows you how to build and run pipelines in the cloud, then integrate the results into applications in minutes instead of days. Throughout the book, authors Chris Fregly and Antje Barth demonstrate how to reduce cost and improve performance.

  • Apply the Amazon AI and ML stack to real-world use cases for natural language processing, computer vision, fraud detection, conversational devices, and more
  • Use automated machine learning to implement a specific subset of use cases with SageMaker Autopilot
  • Dive deep into the complete model development lifecycle for a BERT-based NLP use case including data ingestion, analysis, model training, and deployment
  • Tie everything together into a repeatable machine learning operations pipeline
  • Explore real-time ML, anomaly detection, and streaming analytics on data streams with Amazon Kinesis and Managed Streaming for Apache Kafka
  • Learn security best practices for data science projects and workflows including identity and access management, authentication, authorization, and more

Frequently bought together

This item: Data Science on AWS: Implementing End-to-End, Continuous AI and Machine Learning Pipelines
$16.55
In Stock
Ships from and sold by Amazon.com.
+
$55.99
In Stock
Ships from and sold by Amazon.com.
Total price: $00
To see our price, add these items to your cart.
Details
Added to Cart
spCSRF_Treatment
Choose items to buy together.

From the brand


From the Publisher

aws, amazon web services, data science

Who Should Read This Book

This book is for anyone who uses data to make critical business decisions. The guidance here will help data analysts, data scientists, data engineers, ML engineers, research scientists, application developers, and DevOps engineers broaden their understanding of the modern data science stack and level up their skills in the cloud.

The Amazon AI and ML stack unifies data science, data engineering, and application development to help users level up their skills beyond their current roles. We show how to build and run pipelines in the cloud, then integrate the results into applications in minutes instead of days.

Ideally, and to get most out of this book, we suggest readers have the following knowledge:

  • Basic understanding of cloud computing
  • Basic programming skills with Python, R, Java/Scala, or SQL
  • Basic familiarity with data science tools such as Jupyter Notebook, pandas, NumPy, or scikit-learn

Overview of the Chapters

Chapter 1 provides an overview of the broad and deep Amazon AI and ML stack, an enormously powerful and diverse set of services, open source libraries, and infrastructure to use for data science projects of any complexity and scale.

Chapter 2 describes how to apply the Amazon AI and ML stack to real-world use cases for recommendations, computer vision, fraud detection, natural language understanding (NLU), conversational devices, cognitive search, customer support, industrial predictive maintenance, home automation, Internet of Things (IoT), healthcare, and quantum computing.

Chapter 3 demonstrates how to use AutoML to implement a specific subset of these use cases with SageMaker Autopilot.

Chapters 4–9 dive deep into the complete model development life cycle (MDLC) for a BERT-based NLP use case, including data ingestion and analysis, feature selection and engineering, model training and tuning, and model deployment with Amazon SageMaker, Amazon Athena, Amazon Redshift, Amazon EMR, TensorFlow, PyTorch, and serverless Apache Spark.

Chapter 10 ties everything together into repeatable pipelines using MLOps with SageMaker Pipelines, Kubeflow Pipelines, Apache Airflow, MLflow, and TFX.

Chapter 11 demonstrates real-time ML, anomaly detection, and streaming analytics on real-time data streams with Amazon Kinesis and Apache Kafka.

Chapter 12 presents a comprehensive set of security best practices for data science projects and workflows, including IAM, authentication, authorization, network isolation, data encryption at rest, post-quantum network encryption in transit, governance, and auditability.

Throughout the book, we provide tips to reduce cost and improve performance for data science projects on AWS.

Editorial Reviews

Review

"Wow--this book will help you to bring your data science projects from idea all the way
to production. Chris and Antje have covered all of the important concepts and the
key AWS services, with plenty of real-world examples to get you started
on your data science journey."
--Jeff Barr,
Vice President & Chief Evangelist,
Amazon Web Services

"It's very rare to find a book that comprehensively covers the full end-to-end process of
model development and deployment! If you're an ML practitioner, this book is a must!"
--Ramine Tinati,
Managing Director/Chief Data Scientist Applied Intelligence,
Accenture

"This book is a great resource for building scalable machine learning solutions on AWS
cloud. It includes best practices for all aspects of model building, including training,
deployment, security, interpretability, and MLOps."
--Geeta Chauhan,
AI/PyTorch Partner Engineering Head,
Facebook AI

"The landscape of tools on AWS for data scientists and engineers can be absolutely
overwhelming. Chris and Antje have done the community a service by providing a map
that practitioners can use to orient themselves, find the tools they need to get the
job done and build new systems that bring their ideas to life."
--Josh Wills,
Author, Advanced Analytics with Spark (O'Reilly)

"Successful data science teams know that data science isn't just modeling but needs a
disciplined approach to data and production deployment. We have an army of tools for all
of these at our disposal in major clouds like AWS. Practitioners will appreciate this
comprehensive, practical field guide that demonstrates not just how to apply
the tools but which ones to use and when."
--Sean Owen,
Principal Solutions Architect,
Databricks

From the Author

With this practical book, AI and machine learning (ML) practitioners will learn how
to successfully build and deploy data science projects on Amazon Web Services
(AWS). The Amazon AI and ML stack unifies data science, data engineering, and
application development to help level up your skills. This guide shows you how to
build and run pipelines in the cloud, then integrate the results into applications in
minutes instead of days. Throughout the book, authors Chris Fregly and Antje Barth
demonstrate how to reduce cost and improve performance.
* Apply the Amazon AI and ML stack to real-world use cases for natural language
processing, computer vision, fraud detection, conversational devices, and more.
* Use automated ML (AutoML) to implement a specific subset of use cases with
Amazon SageMaker Autopilot.
* Dive deep into the complete model development life cycle for a BERT-based natural
language processing (NLP) use case including data ingestion and analysis,
and more.
* Tie everything together into a repeatable ML operations (MLOps) pipeline.
* Explore real-time ML, anomaly detection, and streaming analytics on real-time
data streams with Amazon Kinesis and Amazon Managed Streaming for Apache
Kafka (Amazon MSK).
* Learn security best practices for data science projects and workflows, including
AWS Identity and Access Management (IAM), authentication, authorization, and
more.

Overview of the Chapters
Chapter 1 provides an overview of the broad and deep Amazon AI and ML stack, an
enormously powerful and diverse set of services, open source libraries, and infrastructure
to use for data science projects of any complexity and scale.
Chapter 2 describes how to apply the Amazon AI and ML stack to real-world use
cases for recommendations, computer vision, fraud detection, natural language
understanding (NLU), conversational devices, cognitive search, customer support,
industrial predictive maintenance, home automation, Internet of Things (IoT),
healthcare, and quantum computing.
Chapter 3 demonstrates how to use AutoML to implement a specific subset of these
use cases with SageMaker Autopilot.
Chapters 4-9 dive deep into the complete model development life cycle (MDLC) for a
BERT-based NLP use case, including data ingestion and analysis, feature selection
and engineering, model training and tuning, and model deployment with SageMaker,
Amazon Athena, Amazon Redshift, Amazon EMR, TensorFlow, PyTorch, and serverless
Apache Spark.
Chapter 10 ties everything together into repeatable pipelines using MLOps with Sage‐
Maker Pipelines, Kubeflow Pipelines, Apache Airflow, MLflow, and TFX.
Chapter 11 demonstrates real-time ML, anomaly detection, and streaming analytics
on real-time data streams with Amazon Kinesis and Apache Kafka.
Chapter 12 presents a comprehensive set of security best practices for data science
projects and workflows, including IAM, authentication, authorization, network isolation,
data encryption at rest, post-quantum network encryption in transit, governance,
and auditability.
Throughout the book, we provide tips to reduce cost and improve performance for
data science projects on AWS.

Who Should Read This Book
This book is for anyone who uses data to make critical business decisions. The guidance
here will help data analysts, data scientists, data engineers, ML engineers,
research scientists, application developers, and DevOps engineers broaden their
understanding of the modern data science stack and level up their skills in the cloud.
The Amazon AI and ML stack unifies data science, data engineering, and application
development to help users level up their skills beyond their current roles. We show
how to build and run pipelines in the cloud, then integrate the results into applications
in minutes instead of days.

Ideally, and to get most out of this book, we suggest readers have the following
knowledge:
* Basic understanding of cloud computing
* Basic programming skills with Python, R, Java/Scala, or SQL
* Basic familiarity with data science tools such as Jupyter Notebook, pandas,
NumPy, or scikit-learn

Product details

  • Publisher ‏ : ‎ O'Reilly Media; 1st edition (May 11, 2021)
  • Language ‏ : ‎ English
  • Paperback ‏ : ‎ 521 pages
  • ISBN-10 ‏ : ‎ 1492079391
  • ISBN-13 ‏ : ‎ 978-1492079392
  • Item Weight ‏ : ‎ 1.82 pounds
  • Dimensions ‏ : ‎ 7 x 1.05 x 9.19 inches
  • Customer Reviews:
    4.4 4.4 out of 5 stars 231 ratings

About the authors

Follow authors to get new release updates, plus improved recommendations.

Customer reviews

4.4 out of 5 stars
231 global ratings

Review this product

Share your thoughts with other customers
It’s not just a book you read once; it’s a reference guide
4 out of 5 stars
It’s not just a book you read once; it’s a reference guide
If you’re looking to learn how to build machine learning workflows using AWS, this book is a fantastic choice. It covers a wide range of AWS services like SageMaker, Lambda, and Step Functions, showing how to use them together to create powerful data science pipelines. The explanations are clear and easy to understand, even for topics that can be quite technical.What stands out most is how the book is organized. It starts with the basics and gradually moves to advanced topics, making it great for readers at all levels. The authors include step-by-step examples and practical projects, which make it easy to follow along and apply what you learn to real-world tasks.Another highlight is the focus on scalability and automation. These are essential when putting machine learning models into production, and the book goes beyond just teaching the tools—it also explains best practices for optimizing workflows, tracking performance, and keeping your models reliable.Whether you’re new to AWS or experienced in data science, this book has something for everyone. Highly recommended!
Thank you for your feedback
Sorry, there was an error
Sorry we couldn't load the review

Top reviews from the United States

  • Reviewed in the United States on March 22, 2023
    This book is loaded with lots of practical knowledge on how to use the ML services of AWS. This is not a dry cookbook, but also explains what and why. Of note, the book is closely tied to the "Practical Data Science on the AWS Cloud" course on Coursera. The authors are also part of the instructor team for this course. Everything in the course is in the book. But the book has more depth and additional material. Reading the section of the book really enriched the lectures and helped in working on the assignments. But the examples in the book and the assignments in the class are not the same. But it does help to have another example.

    Overall, I would recommend this book, and the course, to anyone who has a basic familiarity of ML concepts and needs to learn how to implement a MLOps pipeline in an AWS cloud environment.
    2 people found this helpful
    Report
  • Reviewed in the United States on July 7, 2024
    Got sagemaker n udemy classes!!

    AWS mls-c01 sagemaker Studio udemy classes and exams!!
  • Reviewed in the United States on December 2, 2024
    If you’re looking to learn how to build machine learning workflows using AWS, this book is a fantastic choice. It covers a wide range of AWS services like SageMaker, Lambda, and Step Functions, showing how to use them together to create powerful data science pipelines. The explanations are clear and easy to understand, even for topics that can be quite technical.

    What stands out most is how the book is organized. It starts with the basics and gradually moves to advanced topics, making it great for readers at all levels. The authors include step-by-step examples and practical projects, which make it easy to follow along and apply what you learn to real-world tasks.

    Another highlight is the focus on scalability and automation. These are essential when putting machine learning models into production, and the book goes beyond just teaching the tools—it also explains best practices for optimizing workflows, tracking performance, and keeping your models reliable.

    Whether you’re new to AWS or experienced in data science, this book has something for everyone. Highly recommended!
    Customer image
    4.0 out of 5 stars It’s not just a book you read once; it’s a reference guide
    Reviewed in the United States on December 2, 2024
    If you’re looking to learn how to build machine learning workflows using AWS, this book is a fantastic choice. It covers a wide range of AWS services like SageMaker, Lambda, and Step Functions, showing how to use them together to create powerful data science pipelines. The explanations are clear and easy to understand, even for topics that can be quite technical.

    What stands out most is how the book is organized. It starts with the basics and gradually moves to advanced topics, making it great for readers at all levels. The authors include step-by-step examples and practical projects, which make it easy to follow along and apply what you learn to real-world tasks.

    Another highlight is the focus on scalability and automation. These are essential when putting machine learning models into production, and the book goes beyond just teaching the tools—it also explains best practices for optimizing workflows, tracking performance, and keeping your models reliable.

    Whether you’re new to AWS or experienced in data science, this book has something for everyone. Highly recommended!
    Images in this review
    Customer image
    Customer image
  • Reviewed in the United States on May 1, 2021
    Very well written, this book covers many AWS services across the entire Amazon AI/ML data science stack. After clearly explaining the value proposition of doing data science in the cloud, the authors navigate the reader through an complete end-to-end machine learning pipeline using the latest in natural language processing techniques including BERT, HuggingFace transformers, and Amazon SageMaker. The authors demonstrate how to implement automated pipelines using TensorFlow, PyTorch, MXNet, Python, and even Java! This book has both technical depth and practical breadth. This book helped me prepare for - and complete - my AWS ML Specialty certification. It was a true delight to read!
    5 people found this helpful
    Report
  • Reviewed in the United States on April 5, 2022
    I like how the authors present the contents. There is a good balance between sample code and explanation.
    There are also many related insights such as Parquet format diagram, compression consideration, performance consideration, etc. The code repository is being actively maintained as well.
    One person found this helpful
    Report
  • Reviewed in the United States on May 26, 2021
    I have been following this book since it was in beta thanks to an Oreilly subscription. I have also attended a workshop put on by the authors. It has greatly helped my overall understanding of how to practically implement in AWS. You can use this as a framework to figure out how and what services to implement.
  • Reviewed in the United States on July 23, 2023
    Customer image
    3.0 out of 5 stars It just arrived but the front page and first pages where folded
    Reviewed in the United States on July 23, 2023

    Images in this review
    Customer image
    Customer image
  • Reviewed in the United States on August 15, 2021
    Loved the way the book is structured and very well written.
    Follows the end to end approach on performing machine learning, specifically on tools available on AWS for moving the ml models in production following the industry best practices.

Top reviews from other countries

Translate all reviews to English
  • Client de Amzn.
    5.0 out of 5 stars Buen material
    Reviewed in Mexico on May 23, 2024
    Lo usé para una prácticas de AWS en la escuela y me ayudó bastante.
    Mi gato también lo aprueba.
    Customer image
    Client de Amzn.
    5.0 out of 5 stars Buen material
    Reviewed in Mexico on May 23, 2024
    Lo usé para una prácticas de AWS en la escuela y me ayudó bastante.
    Mi gato también lo aprueba.
    Images in this review
    Customer image
    Customer image
  • Frank Morales
    5.0 out of 5 stars Outstanding Textbook.
    Reviewed in Canada on January 6, 2024
    Gook book, which provides good guidelines based on a vast amount of practical examples about how to use the AWS Ecosystem Toward data science correctly and accordingly.
  • Philippe Modard
    3.0 out of 5 stars Code not working
    Reviewed in Belgium on July 19, 2024
    The book is great to discover the different services and tools for AI/ML in AWS. But the code in the GitHub repository is very messy, and doesn't work anymore, which goes against the goal of the book.
  • JLB
    5.0 out of 5 stars Awesome
    Reviewed in Germany on April 26, 2023
    This book is great!
  • Ualter
    2.0 out of 5 stars I falls short, very expensive book for what gives you back
    Reviewed in Spain on August 13, 2022
    It is interesting, mainly for those that want to get used to some data science jargon, to get an overview of some processes, and tools. But for the price, it doesn't worth the money, it is very superficial, with too much high-level explanation. If the price of the book were much lower, maybe should deserve more stars, but for that price, it falls short.