Training: Designing and Implementing a Data Science Solution on Azure (DP-100)

Ref. DP-100T01

Download PDF

Duration:

4

jours

Exam:

Optionnel

Level:

Intermédiaire

Funding:

Eligible

Home > Trainings > IT Pro > Analytics and Data Management > Designing and Implementing a Data Science Solution on Azure (DP-100)

Designing and Implementing a Data Science Solution on Azure (DP-100)

In this course, Designing and Implementing a Data Science Solution on Azure (DP-100), learn to master machine learning solutions on the cloud with Azure Machine Learning. Our training guides you through data ingestion, model training, deployment, and monitoring of solutions within the Microsoft Azure ecosystem.

Perfectly designed for those with existing knowledge of Python and machine learning, this training will enable you to optimize your projects using the best available tools and techniques. Discover how to transform your data into decisions with a clear strategy and advanced skills in Azure Machine Learning.

Participant Profiles

Data Specialist

Objectives

Use Azure services to develop, train, and deploy machine learning solutions
Design a data ingestion strategy
Train machine learning models
Deploy models in real-time or batch
Explore and utilize Azure Machine Learning resources
Run machine learning pipelines
Optimize models with hyperparameter tuning

Prerequisites

Have followed the course: Microsoft Azure Fundamentals (AZ-900)
Understanding of data science including how to prepare data, train models, and evaluate competing models to select the best one
How to program in the Python programming language and use the Python libraries: pandas, scikit-learn, matplotlib, and seaborn

Course Content

Module 1: Design a data ingestion strategy for machine learning projects

Identify your data source and format
Choose how to serve data to machine learning workflows
Design a data ingestion solution

Module 2: Design a machine learning model training solution

Identify machine learning tasks
Choose a service to train a machine learning model
Decide between compute options

Module 3: Design a model deployment solution

Understand how model will be consumed
Decide on real-time or batch deployment

Module 4: Explore Azure Machine Learning workspace resources and assets

Create an Azure Machine Learning workspace
Identify Azure Machine Learning resources
Identify Azure Machine Learning assets
Train models in the workspace

Module 5: Explore developer tools for workspace interaction

Explore the studio
Explore the Python SDK
Explore the CLI

Module 6: Make data available in Azure Machine Learning

Understand URIs
Create a datastore
Create a data asset

Module 7: Work with compute targets in Azure Machine Learning

Choose the appropriate compute target
Create and use a compute instance
Create and use a compute cluster

Module 8: Work with environments in Azure Machine Learning

Understand environments
Explore and use curated environments
Create and use custom environments

Module 9: Find the best classification model with Automated Machine Learning

Preprocess data and configure featurization
Run an Automated Machine Learning experiment
Evaluate and compare models

Module 10: Track model training in Jupyter notebooks with MLflow

Configure MLflow for model tracking in notebooks
Train and track models in notebooks

Module 11: Run a training script as a command job in Azure Machine Learning

Convert a notebook to a script
Run a script as a command job
Use parameters in a command job

Module 12: Track model training with MLflow in jobs

Track metrics with MLflow
View metrics and evaluate models

Module 13: Run pipelines in Azure Machine Learning

Create components
Create a pipeline
Run a pipeline job

Module 14: Perform hyperparameter tuning with Azure Machine Learning

Define a search space
Configure a sampling method
Configure early termination
Use a sweep job for hyperparameter tuning

Module 15: Deploy a model to a managed online endpoint

Explore managed online endpoints
Deploy your MLflow model to a managed online endpoint
Deploy a model to a managed online endpoint

Module 16: Deploy a model to a batch endpoint

Understand and create batch endpoints
Deploy your MLflow model to a batch endpoint
Deploy a custom model to a batch endpoint
Invoke and troubleshoot batch endpoints

Documentation

Access to Microsoft Learn (online learning content)

Lab / Exercises

Official Microsoft Labs

Exam

This course prepares you to the exam DP-100: Designing and Implementing a Data Science Solution on Azure
If you wish to take this exam, please select it when you add the course to your basket

Complementary Courses

Eligible Funding

ITTA is a partner of a continuing education fund dedicated to temporary workers. This fund can subsidize your training, provided that you are subject to the “Service Provision” collective labor agreement (CCT) and meet certain conditions, including having worked at least 88 hours in the past 12 months.

Additional Information

Designing and Implementing a Data Science Solution on Azure (DP-100)

The design and implementation of a data science solution on Azure (DP-100) is a crucial skill for data professionals looking to leverage Azure’s power for their machine learning and artificial intelligence projects. This training guides you through all the necessary steps to master Azure Machine Learning, from data ingestion to model deployment.

Introduction to Data Science on Azure

Azure is Microsoft’s cloud platform, widely used for developing and deploying data science solutions. It offers a range of tools and services that enable the efficient and scalable design, training, and deployment of machine learning models.

Designing a Data Ingestion Strategy

One of the first steps in any data science project is data ingestion. It is essential to choose the appropriate data source and determine the optimal format for your machine learning workflows. A well-designed ingestion solution ensures that data is easily accessible and usable for modeling.

Training Machine Learning Models

Identifying specific machine learning tasks and selecting the appropriate services for model training is crucial. Azure offers various services that facilitate this process, providing flexible computing options to meet the specific needs of each project.

Deploying Machine Learning Models

Deployment is a critical step in making models accessible and operational. Azure allows for either real-time or batch deployment, depending on your application’s needs. Understanding how the model is consumed helps in choosing the best deployment strategy.

Exploring Azure Machine Learning Resources and Assets

The Azure Machine Learning Workspace is where you can manage all aspects of your machine learning projects. The creation and management of resources and assets, such as models and datasets, are facilitated by Azure’s intuitive interface.

Using Development Tools to Interact with Azure

To effectively interact with the Azure Machine Learning workspace, it is important to master various development tools like the Azure Studio, Python SDK, and CLI interface. These tools allow for smooth management and interaction with your projects.

Data Availability in Azure Machine Learning

Creating URIs, data stores, and data assets ensures that your datasets are always available and well-organized for machine learning experiments. Efficient data management is essential for accurate and reproducible results.

Using Compute Targets in Azure

Azure offers various compute options, including compute instances and clusters, which allow for efficient management of the resources needed for model training. These compute targets are optimized to provide high performance and great flexibility.

Exploring and Using Environments in Azure Machine Learning

Environments in Azure Machine Learning play a crucial role in ensuring that experiments are reproducible and isolated. Using curated environments or creating custom environments as per your project’s needs can significantly enhance development efficiency.

Automated Machine Learning and Model Optimization

Automated Machine Learning enables data preprocessing, feature engineering, and experiment execution to find the best classification models. This approach helps automate and optimize the modeling process.

Tracking Model Training with MLflow

MLflow is a powerful tool for model tracking. Configuring MLflow to track experiments in Jupyter notebooks and jobs allows for effective management and evaluation of models, ensuring traceability and reproducibility of results.

Running and Tracking Training Scripts

Running training scripts as command jobs in Azure Machine Learning and using parameters for these jobs ensures great flexibility and efficient management of experiments.

Deploying Models to Endpoints

Deploying models to managed online or batch endpoints makes the models accessible for production applications. Azure facilitates this process by offering robust and scalable deployment options.

Hyperparameter Tuning and Pipeline Execution

Configuring a search space, using sampling methods, and hyperparameter tuning techniques optimize model performance. Additionally, creating and running pipelines in Azure Machine Learning automate and orchestrate the various stages of the machine learning workflow.

Mastering the design and implementation of a data science solution on Azure opens numerous opportunities in the field of machine learning and artificial intelligence. By taking this training, you will be equipped to pass the DP-100 exam successfully and fully exploit the capabilities of Microsoft Azure for your data science projects.

Frequently Asked Questions

What is the design and implementation of a data science solution on Azure?

It is the process of developing, training, deploying, and managing machine learning models using the tools and services provided by Azure.

How difficult is the DP-100 exam?

The DP-100 exam evaluates skills in designing and implementing data science solutions on Azure. With adequate preparation, including a thorough understanding of the modules mentioned, you can pass this exam successfully.

What is the purpose of Azure in data science?

Azure provides a powerful cloud infrastructure for developing and deploying data science solutions, facilitating data ingestion, model training, and production deployment efficiently and scalably.