Learning Spark (2nd ed.) by Jules S. Damji (ebook)

Book cover: Learning Spark, by Jules S. Damji (preview) — Cover image for book: Learning Spark (2nd ed.)

Book cover: Learning Spark, by Jules S. Damji — Cover image for book: Learning Spark (2nd ed.)

Gift

Pre-order now and we'll email you as soon as it's released.

DRM Free

Pre-order now and we'll email you as soon as it's released.

This title will be released on .

This eBook is no longer available for sale.

This eBook is not available in your country.

Data is bigger, arrives faster, and comes in a variety of formatsâ??and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark.

Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, youâ??ll be able to:

Learn Python, SQL, Scala, or Java high-level Structured APIsUnderstand Spark operations and SQL EngineInspect, tune, and debug Spark operations with Spark configurations and Spark UIConnect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or KafkaPerform analytics on batch and streaming data using Structured StreamingBuild reliable data pipelines with open source Delta Lake and SparkDevelop machine learning pipelines with MLlib and productionize models using MLflow

In The Press

About the Author

Read on Your Favourite Devices

to find out more

You Might be Interested in

Spark: The Definitive Guide

Bill Chambers

US$59.99

Machine Learning Design Patterns

Valliappa Lakshmanan

US$56.99

Learning SQL

Alan Beaulieu

US$56.99

Practical Time Series Analysis

Aileen Nielsen

US$67.99

Building Machine Learning Pipelines

Hannes Hapke