Bookbot

Data Science with Python and Dask

Valoración del libro

Más información sobre el libro

Dask is a native parallel analytics tool that integrates seamlessly with existing libraries like Pandas, NumPy, and Scikit-Learn, allowing you to work with large datasets using familiar tools. This guide shows how to leverage Dask for data projects without altering your workflow. The print book purchase includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications, with registration instructions provided inside. Efficient data pipelines are crucial for successful data science projects. Dask offers a flexible library for parallel computing in Python, enabling intuitive workflows for ingesting and analyzing large, distributed datasets. It features dynamic task scheduling and parallel collections that enhance the capabilities of NumPy, Pandas, and Scikit-Learn, allowing users to scale their code from a single laptop to a cluster of hundreds of machines effortlessly. The book teaches you to build scalable projects capable of handling massive datasets. You'll explore the Dask framework, analyze the NYC Parking Ticket database, and use DataFrames for streamlined processes. You'll also create machine learning models with Dask-ML, develop interactive visualizations, and build clusters using AWS and Docker. This resource is intended for data scientists and developers familiar with Python and the PyData stack. The author, Jesse Daniel, is an experienced Python developer and educator, leading a team of data scienti

Compra de libros

Data Science with Python and Dask, Jesse C. Daniel

Idioma
Publicado en
2019
product-detail.submit-box.info.binding
(Tapa blanda)
Te avisaremos por correo electrónico en cuanto lo localicemos.

Métodos de pago

3,6
Muy bueno
13 Valoraciones

Nos falta tu reseña aquí

Título
Data Science with Python and Dask
Idioma
Inglés
Publicado en
2019
Formato
Tapa blanda
Páginas
296
ISBN10
1617295604
ISBN13
9781617295607
Serie
Calificación
3,6 de 5
Descripción
Dask is a native parallel analytics tool that integrates seamlessly with existing libraries like Pandas, NumPy, and Scikit-Learn, allowing you to work with large datasets using familiar tools. This guide shows how to leverage Dask for data projects without altering your workflow. The print book purchase includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications, with registration instructions provided inside. Efficient data pipelines are crucial for successful data science projects. Dask offers a flexible library for parallel computing in Python, enabling intuitive workflows for ingesting and analyzing large, distributed datasets. It features dynamic task scheduling and parallel collections that enhance the capabilities of NumPy, Pandas, and Scikit-Learn, allowing users to scale their code from a single laptop to a cluster of hundreds of machines effortlessly. The book teaches you to build scalable projects capable of handling massive datasets. You'll explore the Dask framework, analyze the NYC Parking Ticket database, and use DataFrames for streamlined processes. You'll also create machine learning models with Dask-ML, develop interactive visualizations, and build clusters using AWS and Docker. This resource is intended for data scientists and developers familiar with Python and the PyData stack. The author, Jesse Daniel, is an experienced Python developer and educator, leading a team of data scienti