Scaling Python with Dask (Sixth Early Release)

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

Dask is a free and open source library for parallel computing in Python that helps you scale your data science and machine learning workflows. With this quick but thorough resource, data scientists and Python programmers will learn how Dask provides APIs that make it easy to parallelize PyData libraries like NumPy, pandas, and scikit-learn. Author Holden Karau shows you how you can use Dask computations in local systems and then scale to the cloud for heavier workloads. This practical book explains why Dask is popular among industry experts and academics and used by organizations that include Walmart, Capital One, Harvard Medical School, and NASA. With this book, you'll learn about What is Dask is, where you can use it, and how it compares to other tools Batch data parallel processing Key distributed system concepts for Dask users Higher-level APIs and building blocks Integrated libraries, such as scikit-learn, pandas, and PyTorch

Author(s): Holden Karau and Mika Kimmins
Publisher: O'Reilly Media, Inc.
Year: 2023

Language: English
Commentary: early release, raw and unedited
Pages: 26