Architecting Data and Machine Learning Platforms

Architecting Data and Machine Learning Platforms
Author: Marco Tranquillin
Publisher:
Total Pages: 0
Release: 2024-01-30
Genre:
ISBN: 9781098151614


Download Architecting Data and Machine Learning Platforms Book in PDF, Epub and Kindle

All cloud architects need to know how to build data platforms--the key to enabling businesses with data and delivering enterprise-wide intelligence in a fast and efficient way. This handbook is ideal for learning how to design, build, and modernize cloud native data and machine learning platforms using AWS, Azure, Google Cloud, or multicloud tools like Fivetran, dbt, Snowflake, and Databricks. Authors Marco Tranquillin, Valliappa Lakshmanan, and Firat Tekiner cover the entire data lifecycle in a cloud environment, from ingestion to activation, using real-world enterprise architectures. You'll learn how to transform and modernize familiar solutions, like data warehouses and data lakes, and you'll be able to leverage recent AI/ML patterns to get accurate and quicker insights to drive competitive advantage. This book shows you how to: Design a modern cloud native or hybrid data analytics and machine learning platform Accelerate data-led innovation by consolidating enterprise data in a data platform Democratize access to enterprise data and allow business teams to extract insights and build AI/ML capabilities Enable your business to make decisions in real time using streaming pipelines Move from a descriptive analytics approach to a more predictive and prescriptive one by building an MLOps platform Make your organization more effective in working with data analytics and machine learning in a cloud environment

Architecting Data and Machine Learning Platforms

Architecting Data and Machine Learning Platforms
Author: Marco Tranquillin
Publisher: "O'Reilly Media, Inc."
Total Pages: 361
Release: 2023-10-12
Genre: Computers
ISBN: 1098151585


Download Architecting Data and Machine Learning Platforms Book in PDF, Epub and Kindle

All cloud architects need to know how to build data platforms that enable businesses to make data-driven decisions and deliver enterprise-wide intelligence in a fast and efficient way. This handbook shows you how to design, build, and modernize cloud native data and machine learning platforms using AWS, Azure, Google Cloud, and multicloud tools like Snowflake and Databricks. Authors Marco Tranquillin, Valliappa Lakshmanan, and Firat Tekiner cover the entire data lifecycle from ingestion to activation in a cloud environment using real-world enterprise architectures. You'll learn how to transform, secure, and modernize familiar solutions like data warehouses and data lakes, and you'll be able to leverage recent AI/ML patterns to get accurate and quicker insights to drive competitive advantage. You'll learn how to: Design a modern and secure cloud native or hybrid data analytics and machine learning platform Accelerate data-led innovation by consolidating enterprise data in a governed, scalable, and resilient data platform Democratize access to enterprise data and govern how business teams extract insights and build AI/ML capabilities Enable your business to make decisions in real time using streaming pipelines Build an MLOps platform to move to a predictive and prescriptive analytics approach

Architecting Modern Data Platforms

Architecting Modern Data Platforms
Author: Jan Kunigk
Publisher: "O'Reilly Media, Inc."
Total Pages: 636
Release: 2018-12-05
Genre: Computers
ISBN: 1491969229


Download Architecting Modern Data Platforms Book in PDF, Epub and Kindle

There’s a lot of information about big data technologies, but splicing these technologies into an end-to-end enterprise data platform is a daunting task not widely covered. With this practical book, you’ll learn how to build big data infrastructure both on-premises and in the cloud and successfully architect a modern data platform. Ideal for enterprise architects, IT managers, application architects, and data engineers, this book shows you how to overcome the many challenges that emerge during Hadoop projects. You’ll explore the vast landscape of tools available in the Hadoop and big data realm in a thorough technical primer before diving into: Infrastructure: Look at all component layers in a modern data platform, from the server to the data center, to establish a solid foundation for data in your enterprise Platform: Understand aspects of deployment, operation, security, high availability, and disaster recovery, along with everything you need to know to integrate your platform with the rest of your enterprise IT Taking Hadoop to the cloud: Learn the important architectural aspects of running a big data platform in the cloud while maintaining enterprise security and high availability

The Machine Learning Solutions Architect Handbook

The Machine Learning Solutions Architect Handbook
Author: David Ping
Publisher: Packt Publishing Ltd
Total Pages: 442
Release: 2022-01-21
Genre: Computers
ISBN: 1801070415


Download The Machine Learning Solutions Architect Handbook Book in PDF, Epub and Kindle

Build highly secure and scalable machine learning platforms to support the fast-paced adoption of machine learning solutions Key Features Explore different ML tools and frameworks to solve large-scale machine learning challenges in the cloud Build an efficient data science environment for data exploration, model building, and model training Learn how to implement bias detection, privacy, and explainability in ML model development Book DescriptionWhen equipped with a highly scalable machine learning (ML) platform, organizations can quickly scale the delivery of ML products for faster business value realization. There is a huge demand for skilled ML solutions architects in different industries, and this handbook will help you master the design patterns, architectural considerations, and the latest technology insights you’ll need to become one. You’ll start by understanding ML fundamentals and how ML can be applied to solve real-world business problems. Once you've explored a few leading problem-solving ML algorithms, this book will help you tackle data management and get the most out of ML libraries such as TensorFlow and PyTorch. Using open source technology such as Kubernetes/Kubeflow to build a data science environment and ML pipelines will be covered next, before moving on to building an enterprise ML architecture using Amazon Web Services (AWS). You’ll also learn about security and governance considerations, advanced ML engineering techniques, and how to apply bias detection, explainability, and privacy in ML model development. By the end of this book, you’ll be able to design and build an ML platform to support common use cases and architecture patterns like a true professional. What you will learn Apply ML methodologies to solve business problems Design a practical enterprise ML platform architecture Implement MLOps for ML workflow automation Build an end-to-end data management architecture using AWS Train large-scale ML models and optimize model inference latency Create a business application using an AI service and a custom ML model Use AWS services to detect data and model bias and explain models Who this book is for This book is for data scientists, data engineers, cloud architects, and machine learning enthusiasts who want to become machine learning solutions architects. You’ll need basic knowledge of the Python programming language, AWS, linear algebra, probability, and networking concepts before you get started with this handbook.

Designing Cloud Data Platforms

Designing Cloud Data Platforms
Author: Danil Zburivsky
Publisher: Simon and Schuster
Total Pages: 334
Release: 2021-04-20
Genre: Computers
ISBN: 1617296449


Download Designing Cloud Data Platforms Book in PDF, Epub and Kindle

Centralized data warehouses, the long-time defacto standard for housing data for analytics, are rapidly giving way to multi-faceted cloud data platforms. Companies that embrace modern cloud data platforms benefit from an integrated view of their business using all of their data and can take advantage of advanced analytic practices to drive predictions and as yet unimagined data services. Designing Cloud Data Platforms is an hands-on guide to envisioning and designing a modern scalable data platform that takes full advantage of the flexibility of the cloud. As you read, you''ll learn the core components of a cloud data platform design, along with the role of key technologies like Spark and Kafka Streams. You''ll also explore setting up processes to manage cloud-based data, keep it secure, and using advanced analytic and BI tools to analyse it. about the technology Access to affordable, dependable, serverless cloud services has revolutionized the way organizations can approach data management, and companies both big and small are raring to migrate to the cloud. But without a properly designed data platform, data in the cloud can remain just as siloed and inaccessible as it is today for most organizations. Designing Cloud Data Platforms lays out the principles of a well-designed platform that uses the scalable resources of the public cloud to manage all of an organization''s data, and present it as useful business insights. about the book In Designing Cloud Data Platforms, you''ll learn how to integrate data from multiple sources into a single, cloud-based, modern data platform. Drawing on their real-world experiences designing cloud data platforms for dozens of organizations, cloud data experts Danil Zburivsky and Lynda Partner take you through a six-layer approach to creating cloud data platforms that maximizes flexibility and manageability and reduces costs. Starting with foundational principles, you''ll learn how to get data into your platform from different databases, files, and APIs, the essential practices for organizing and processing that raw data, and how to best take advantage of the services offered by major cloud vendors. As you progress past the basics you''ll take a deep dive into advanced topics to get the most out of your data platform, including real-time data management, machine learning analytics, schema management, and more. what''s inside The tools of different public cloud for implementing data platforms Best practices for managing structured and unstructured data sets Machine learning tools that can be used on top of the cloud Cost optimization techniques about the reader For data professionals familiar with the basics of cloud computing and distributed data processing systems like Hadoop and Spark. about the authors Danil Zburivsky has over 10 years experience designing and supporting large-scale data infrastructure for enterprises across the globe. Lynda Partner is the VP of Analytics-as-a-Service at Pythian, and has been on the business side of data for over 20 years.

Machine Learning Design Patterns

Machine Learning Design Patterns
Author: Valliappa Lakshmanan
Publisher: O'Reilly Media
Total Pages: 408
Release: 2020-10-15
Genre: Computers
ISBN: 1098115759


Download Machine Learning Design Patterns Book in PDF, Epub and Kindle

The design patterns in this book capture best practices and solutions to recurring problems in machine learning. The authors, three Google engineers, catalog proven methods to help data scientists tackle common problems throughout the ML process. These design patterns codify the experience of hundreds of experts into straightforward, approachable advice. In this book, you will find detailed explanations of 30 patterns for data and problem representation, operationalization, repeatability, reproducibility, flexibility, explainability, and fairness. Each pattern includes a description of the problem, a variety of potential solutions, and recommendations for choosing the best technique for your situation. You'll learn how to: Identify and mitigate common challenges when training, evaluating, and deploying ML models Represent data for different ML model types, including embeddings, feature crosses, and more Choose the right model type for specific problems Build a robust training loop that uses checkpoints, distribution strategy, and hyperparameter tuning Deploy scalable ML systems that you can retrain and update to reflect new data Interpret model predictions for stakeholders and ensure models are treating users fairly

The Machine Learning Solutions Architect Handbook

The Machine Learning Solutions Architect Handbook
Author: David Ping
Publisher: Packt Publishing Ltd
Total Pages: 603
Release: 2024-04-15
Genre: Computers
ISBN: 180512482X


Download The Machine Learning Solutions Architect Handbook Book in PDF, Epub and Kindle

Design, build, and secure scalable machine learning (ML) systems to solve real-world business problems with Python and AWS Purchase of the print or Kindle book includes a free PDF eBook Key Features Go in-depth into the ML lifecycle, from ideation and data management to deployment and scaling Apply risk management techniques in the ML lifecycle and design architectural patterns for various ML platforms and solutions Understand the generative AI lifecycle, its core technologies, and implementation risks Book DescriptionDavid Ping, Head of GenAI and ML Solution Architecture for global industries at AWS, provides expert insights and practical examples to help you become a proficient ML solutions architect, linking technical architecture to business-related skills. You'll learn about ML algorithms, cloud infrastructure, system design, MLOps , and how to apply ML to solve real-world business problems. David explains the generative AI project lifecycle and examines Retrieval Augmented Generation (RAG), an effective architecture pattern for generative AI applications. You’ll also learn about open-source technologies, such as Kubernetes/Kubeflow, for building a data science environment and ML pipelines before building an enterprise ML architecture using AWS. As well as ML risk management and the different stages of AI/ML adoption, the biggest new addition to the handbook is the deep exploration of generative AI. By the end of this book , you’ll have gained a comprehensive understanding of AI/ML across all key aspects, including business use cases, data science, real-world solution architecture, risk management, and governance. You’ll possess the skills to design and construct ML solutions that effectively cater to common use cases and follow established ML architecture patterns, enabling you to excel as a true professional in the field.What you will learn Apply ML methodologies to solve business problems across industries Design a practical enterprise ML platform architecture Gain an understanding of AI risk management frameworks and techniques Build an end-to-end data management architecture using AWS Train large-scale ML models and optimize model inference latency Create a business application using artificial intelligence services and custom models Dive into generative AI with use cases, architecture patterns, and RAG Who this book is for This book is for solutions architects working on ML projects, ML engineers transitioning to ML solution architect roles, and MLOps engineers. Additionally, data scientists and analysts who want to enhance their practical knowledge of ML systems engineering, as well as AI/ML product managers and risk officers who want to gain an understanding of ML solutions and AI risk management, will also find this book useful. A basic knowledge of Python, AWS, linear algebra, probability, and cloud infrastructure is required before you get started with this handbook.

Machine Learning with Microsoft Technologies

Machine Learning with Microsoft Technologies
Author: Leila Etaati
Publisher: Apress
Total Pages: 363
Release: 2019-06-12
Genre: Computers
ISBN: 1484236580


Download Machine Learning with Microsoft Technologies Book in PDF, Epub and Kindle

Know how to do machine learning with Microsoft technologies. This book teaches you to do predictive, descriptive, and prescriptive analyses with Microsoft Power BI, Azure Data Lake, SQL Server, Stream Analytics, Azure Databricks, HD Insight, and more. The ability to analyze massive amounts of real-time data and predict future behavior of an organization is critical to its long-term success. Data science, and more specifically machine learning (ML), is today’s game changer and should be a key building block in every company’s strategy. Managing a machine learning process from business understanding, data acquisition and cleaning, modeling, and deployment in each tool is a valuable skill set. Machine Learning with Microsoft Technologies is a demo-driven book that explains how to do machine learning with Microsoft technologies. You will gain valuable insight into designing the best architecture for development, sharing, and deploying a machine learning solution. This book simplifies the process of choosing the right architecture and tools for doing machine learning based on your specific infrastructure needs and requirements. Detailed content is provided on the main algorithms for supervised and unsupervised machine learning and examples show ML practices using both R and Python languages, the main languages inside Microsoft technologies. What You'll Learn Choose the right Microsoft product for your machine learning solutionCreate and manage Microsoft’s tool environments for development, testing, and production of a machine learning projectImplement and deploy supervised and unsupervised learning in Microsoft products Set up Microsoft Power BI, Azure Data Lake, SQL Server, Stream Analytics, Azure Databricks, and HD Insight to perform machine learning Set up a data science virtual machine and test-drive installed tools, such as Azure ML Workbench, Azure ML Server Developer, Anaconda Python, Jupyter Notebook, Power BI Desktop, Cognitive Services, machine learning and data analytics tools, and more Architect a machine learning solution factoring in all aspects of self service, enterprise, deployment, and sharing Who This Book Is For Data scientists, data analysts, developers, architects, and managers who want to leverage machine learning in their products, organization, and services, and make educated, cost-saving decisions about their ML architecture and tool set.

Data Lakehouse in Action

Data Lakehouse in Action
Author: Pradeep Menon
Publisher: Packt Publishing Ltd
Total Pages: 206
Release: 2022-03-17
Genre: Computers
ISBN: 1801815100


Download Data Lakehouse in Action Book in PDF, Epub and Kindle

Propose a new scalable data architecture paradigm, Data Lakehouse, that addresses the limitations of current data architecture patterns Key FeaturesUnderstand how data is ingested, stored, served, governed, and secured for enabling data analyticsExplore a practical way to implement Data Lakehouse using cloud computing platforms like AzureCombine multiple architectural patterns based on an organization's needs and maturity levelBook Description The Data Lakehouse architecture is a new paradigm that enables large-scale analytics. This book will guide you in developing data architecture in the right way to ensure your organization's success. The first part of the book discusses the different data architectural patterns used in the past and the need for a new architectural paradigm, as well as the drivers that have caused this change. It covers the principles that govern the target architecture, the components that form the Data Lakehouse architecture, and the rationale and need for those components. The second part deep dives into the different layers of Data Lakehouse. It covers various scenarios and components for data ingestion, storage, data processing, data serving, analytics, governance, and data security. The book's third part focuses on the practical implementation of the Data Lakehouse architecture in a cloud computing platform. It focuses on various ways to combine the Data Lakehouse pattern to realize macro-patterns, such as Data Mesh and Data Hub-Spoke, based on the organization's needs and maturity level. The frameworks introduced will be practical and organizations can readily benefit from their application. By the end of this book, you'll clearly understand how to implement the Data Lakehouse architecture pattern in a scalable, agile, and cost-effective manner. What you will learnUnderstand the evolution of the Data Architecture patterns for analyticsBecome well versed in the Data Lakehouse pattern and how it enables data analyticsFocus on methods to ingest, process, store, and govern data in a Data Lakehouse architectureLearn techniques to serve data and perform analytics in a Data Lakehouse architectureCover methods to secure the data in a Data Lakehouse architectureImplement Data Lakehouse in a cloud computing platform such as AzureCombine Data Lakehouse in a macro-architecture pattern such as Data MeshWho this book is for This book is for data architects, big data engineers, data strategists and practitioners, data stewards, and cloud computing practitioners looking to become well-versed with modern data architecture patterns to enable large-scale analytics. Basic knowledge of data architecture and familiarity with data warehousing concepts are required.

Azure Data and AI Architect Handbook

Azure Data and AI Architect Handbook
Author: Olivier Mertens
Publisher: Packt Publishing Ltd
Total Pages: 284
Release: 2023-07-31
Genre: Computers
ISBN: 1803230738


Download Azure Data and AI Architect Handbook Book in PDF, Epub and Kindle

Master core data architecture design concepts and Azure Data & AI services to gain a cloud data and AI architect’s perspective to developing end-to-end solutions Purchase of the print or Kindle book includes a free PDF eBook Key Features Translate and implement conceptual architectures with the right Azure services Inject artificial intelligence into data solutions for advanced analytics Leverage cloud computing and frameworks to drive data science workloads Book DescriptionWith data’s growing importance in businesses, the need for cloud data and AI architects has never been higher. The Azure Data and AI Architect Handbook is designed to assist any data professional or academic looking to advance their cloud data platform designing skills. This book will help you understand all the individual components of an end-to-end data architecture and how to piece them together into a scalable and robust solution. You’ll begin by getting to grips with core data architecture design concepts and Azure Data & AI services, before exploring cloud landing zones and best practices for building up an enterprise-scale data platform from scratch. Next, you’ll take a deep dive into various data domains such as data engineering, business intelligence, data science, and data governance. As you advance, you’ll cover topics ranging from learning different methods of ingesting data into the cloud to designing the right data warehousing solution, managing large-scale data transformations, extracting valuable insights, and learning how to leverage cloud computing to drive advanced analytical workloads. Finally, you’ll discover how to add data governance, compliance, and security to solutions. By the end of this book, you’ll have gained the expertise needed to become a well-rounded Azure Data & AI architect.What you will learn Design scalable and cost-effective cloud data platforms on Microsoft Azure Explore architectural design patterns with various use cases Determine the right data stores and data warehouse solutions Discover best practices for data orchestration and transformation Help end users to visualize data using interactive dashboarding Leverage OpenAI and custom ML models for advanced analytics Manage security, compliance, and governance for the data estate Who this book is forThis book is for anyone looking to elevate their skill set to the level of an architect. Data engineers, data scientists, business intelligence developers, and database administrators who want to learn how to design end-to-end data solutions and get a bird’s-eye view of the entire data platform will find this book useful. Although not required, basic knowledge of databases and data engineering workloads is recommended.