語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Data engineering for machine learnin...
~
Narayanan, Pavan Kumar.
FindBook
Google Book
Amazon
博客來
Data engineering for machine learning pipelines = from Python libraries to ML pipelines and cloud platforms /
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Data engineering for machine learning pipelines/ by Pavan Kumar Narayanan.
其他題名:
from Python libraries to ML pipelines and cloud platforms /
作者:
Narayanan, Pavan Kumar.
出版者:
Berkeley, CA :Apress : : 2024.,
面頁冊數:
xxv, 636 p. :ill., digital ;24 cm.
內容註:
Chapter 1: Data Manipulation and Analytics Using Pandas -- Chapter 2: Data Manipulation Using Polars and CuDF -- Chapter 3: Introduction to Data Validation -- Chapter 4: Data Validation Using Great Expectations -- Chapter 5: Introduction to API Design Using FastAPI -- Chapter 6: Introduction to Concurrency Programming Using Task -- Chapter 7: Dask ML -- Module 5: Data Pipelines in the Cloud -- Chapter 9: Introduction to Microsoft Azure -- Chapter 10: Introduction to Google Cloud -- Chapter 11: Introduction to Streaming Data -- Chapter 12: Introduction to Workflow Management Using Airflow -- Chapter 13: Introduction to Workflow Management Using Prefect.
Contained By:
Springer Nature eBook
標題:
Machine learning. -
電子資源:
https://doi.org/10.1007/979-8-8688-0602-5
ISBN:
9798868806025
Data engineering for machine learning pipelines = from Python libraries to ML pipelines and cloud platforms /
Narayanan, Pavan Kumar.
Data engineering for machine learning pipelines
from Python libraries to ML pipelines and cloud platforms /[electronic resource] :by Pavan Kumar Narayanan. - Berkeley, CA :Apress :2024. - xxv, 636 p. :ill., digital ;24 cm.
Chapter 1: Data Manipulation and Analytics Using Pandas -- Chapter 2: Data Manipulation Using Polars and CuDF -- Chapter 3: Introduction to Data Validation -- Chapter 4: Data Validation Using Great Expectations -- Chapter 5: Introduction to API Design Using FastAPI -- Chapter 6: Introduction to Concurrency Programming Using Task -- Chapter 7: Dask ML -- Module 5: Data Pipelines in the Cloud -- Chapter 9: Introduction to Microsoft Azure -- Chapter 10: Introduction to Google Cloud -- Chapter 11: Introduction to Streaming Data -- Chapter 12: Introduction to Workflow Management Using Airflow -- Chapter 13: Introduction to Workflow Management Using Prefect.
This book covers modern data engineering functions and important Python libraries, to help you develop state-of-the-art ML pipelines and integration code. The book begins by explaining data analytics and transformation, delving into the Pandas library, its capabilities, and nuances. It then explores emerging libraries such as Polars and CuDF, providing insights into GPU-based computing and cutting-edge data manipulation techniques. The text discusses the importance of data validation in engineering processes, introducing tools such as Great Expectations and Pandera to ensure data quality and reliability. The book delves into API design and development, with a specific focus on leveraging the power of FastAPI. It covers authentication, authorization, and real-world applications, enabling you to construct efficient and secure APIs using FastAPI. Also explored is concurrency in data engineering, examining Dask's capabilities from basic setup to crafting advanced machine learning pipelines. The book includes development and delivery of data engineering pipelines using leading cloud platforms such as AWS, Google Cloud, and Microsoft Azure. The concluding chapters concentrate on real-time and streaming data engineering pipelines, emphasizing Apache Kafka and workflow orchestration in data engineering. Workflow tools such as Airflow and Prefect are introduced to seamlessly manage and automate complex data workflows. What sets this book apart is its blend of theoretical knowledge and practical application, a structured path from basic to advanced concepts, and insights into using state-of-the-art tools. With this book, you gain access to cutting-edge techniques and insights that are reshaping the industry. This book is not just an educational tool. It is a career catalyst, and an investment in your future as a data engineering expert, poised to meet the challenges of today's data-driven world. What You Will Learn Elevate your data wrangling jobs by utilizing the power of both CPU and GPU computing, and learn to process data using Pandas 2.0, Polars, and CuDF at unprecedented speeds Design data validation pipelines, construct efficient data service APIs, develop real-time streaming pipelines and master the art of workflow orchestration to streamline your engineering projects Leverage concurrent programming to develop machine learning pipelines and get hands-on experience in development and deployment of machine learning pipelines across AWS, GCP, and Azure.
ISBN: 9798868806025
Standard No.: 10.1007/979-8-8688-0602-5doiSubjects--Topical Terms:
533906
Machine learning.
LC Class. No.: Q325.5
Dewey Class. No.: 006.31
Data engineering for machine learning pipelines = from Python libraries to ML pipelines and cloud platforms /
LDR
:04199nmm a2200325 a 4500
001
2375480
003
DE-He213
005
20240928131751.0
006
m d
007
cr nn 008maaau
008
241231s2024 cau s 0 eng d
020
$a
9798868806025
$q
(electronic bk.)
020
$a
9798868806018
$q
(paper)
024
7
$a
10.1007/979-8-8688-0602-5
$2
doi
035
$a
979-8-8688-0602-5
040
$a
GP
$c
GP
041
0
$a
eng
050
4
$a
Q325.5
072
7
$a
UYQ
$2
bicssc
072
7
$a
COM004000
$2
bisacsh
072
7
$a
UYQ
$2
thema
082
0 4
$a
006.31
$2
23
090
$a
Q325.5
$b
.N218 2024
100
1
$a
Narayanan, Pavan Kumar.
$3
3725023
245
1 0
$a
Data engineering for machine learning pipelines
$h
[electronic resource] :
$b
from Python libraries to ML pipelines and cloud platforms /
$c
by Pavan Kumar Narayanan.
260
$a
Berkeley, CA :
$b
Apress :
$b
Imprint: Apress,
$c
2024.
300
$a
xxv, 636 p. :
$b
ill., digital ;
$c
24 cm.
505
0
$a
Chapter 1: Data Manipulation and Analytics Using Pandas -- Chapter 2: Data Manipulation Using Polars and CuDF -- Chapter 3: Introduction to Data Validation -- Chapter 4: Data Validation Using Great Expectations -- Chapter 5: Introduction to API Design Using FastAPI -- Chapter 6: Introduction to Concurrency Programming Using Task -- Chapter 7: Dask ML -- Module 5: Data Pipelines in the Cloud -- Chapter 9: Introduction to Microsoft Azure -- Chapter 10: Introduction to Google Cloud -- Chapter 11: Introduction to Streaming Data -- Chapter 12: Introduction to Workflow Management Using Airflow -- Chapter 13: Introduction to Workflow Management Using Prefect.
520
$a
This book covers modern data engineering functions and important Python libraries, to help you develop state-of-the-art ML pipelines and integration code. The book begins by explaining data analytics and transformation, delving into the Pandas library, its capabilities, and nuances. It then explores emerging libraries such as Polars and CuDF, providing insights into GPU-based computing and cutting-edge data manipulation techniques. The text discusses the importance of data validation in engineering processes, introducing tools such as Great Expectations and Pandera to ensure data quality and reliability. The book delves into API design and development, with a specific focus on leveraging the power of FastAPI. It covers authentication, authorization, and real-world applications, enabling you to construct efficient and secure APIs using FastAPI. Also explored is concurrency in data engineering, examining Dask's capabilities from basic setup to crafting advanced machine learning pipelines. The book includes development and delivery of data engineering pipelines using leading cloud platforms such as AWS, Google Cloud, and Microsoft Azure. The concluding chapters concentrate on real-time and streaming data engineering pipelines, emphasizing Apache Kafka and workflow orchestration in data engineering. Workflow tools such as Airflow and Prefect are introduced to seamlessly manage and automate complex data workflows. What sets this book apart is its blend of theoretical knowledge and practical application, a structured path from basic to advanced concepts, and insights into using state-of-the-art tools. With this book, you gain access to cutting-edge techniques and insights that are reshaping the industry. This book is not just an educational tool. It is a career catalyst, and an investment in your future as a data engineering expert, poised to meet the challenges of today's data-driven world. What You Will Learn Elevate your data wrangling jobs by utilizing the power of both CPU and GPU computing, and learn to process data using Pandas 2.0, Polars, and CuDF at unprecedented speeds Design data validation pipelines, construct efficient data service APIs, develop real-time streaming pipelines and master the art of workflow orchestration to streamline your engineering projects Leverage concurrent programming to develop machine learning pipelines and get hands-on experience in development and deployment of machine learning pipelines across AWS, GCP, and Azure.
650
0
$a
Machine learning.
$3
533906
650
0
$a
Python (Computer program language)
$3
729789
650
0
$a
Cloud computing.
$3
1016782
650
1 4
$a
Artificial Intelligence.
$3
769149
650
2 4
$a
Machine Learning.
$3
3382522
650
2 4
$a
Python.
$3
3201289
710
2
$a
SpringerLink (Online service)
$3
836513
773
0
$t
Springer Nature eBook
856
4 0
$u
https://doi.org/10.1007/979-8-8688-0602-5
950
$a
Professional and Applied Computing (SpringerNature-12059)
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9495929
電子資源
11.線上閱覽_V
電子書
EB Q325.5
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入