Azure Blob Storage Performance Metrics

In this post, we are going to discuss about two important performance metrics for Azure Blob Storage, latency and bandwidth. Latency in the context of Azure blob storage is the amount of time an application must wait for an input/output request to be completed. Before we discuss Azure storage latency metrics, we must understand requestContinue reading “Azure Blob Storage Performance Metrics”

Azure Storage Reserved Capacity

To optimize Azure Storage costs, Microsoft provides option of reserved capacity on Azure storage. Reserved storage provides a fixed amount of capacity to customers, on the Azure cloud for the period of reservation. Reserved capacity is priced at a discount compared to normal capacity on Azure storage. Reserved capacity is available for Block Blobs andContinue reading “Azure Storage Reserved Capacity”

ADE Q03: Azure Data Platform Advanced Quiz

Which of the following is NOT correct about Transparent Data Encryption (TDE)? A. TDE is a data encryption technology to keep data encrypted while it is stored in the DB B. TDE is available for Azure Synapse Analytics and Azure SQL Databases C. TDE can be used to encrypt transaction log files D. TDE isContinue reading “ADE Q03: Azure Data Platform Advanced Quiz”

Storage Emulators for Azure

While developing an application, we may need to store and access data in the Azure Storage account. If the amount of read and write access to the Azure Storage is high, this could result in huge costs for testing the application. To provide local deployment and testing option to developers without incurring costs, Azure storageContinue reading “Storage Emulators for Azure”

SBS01 : Importing data into Azure Synapse Analytics using PolyBase

In the previous post, we learnt the basics of Polybase and how it makes data ingestion much faster. In this post we are going to look at the steps that we need to perform to ingest data into Azure Synapse Analytics. Step 1 : Create a Database Master key Explanation: Creating a database master keyContinue reading “SBS01 : Importing data into Azure Synapse Analytics using PolyBase”

PolyBase Introduction

As we have seen in previous post, Azure Synapse Analytics has a Massively Parallel Processing (MPP) architecture with multiple compute nodes controlled by a control node. To take advantage of the  MPP architecture, data ingestion must be parallelized. PolyBase optimizes the data ingestion into PDW. The other advantage that Polybase offers is that, it supportsContinue reading “PolyBase Introduction”

Azure SQL Database Dynamic Data Masking

Dynamic data masking plays a significant role in data security in the context of Azure SQL Database. It is way of restricting access of sensitive information to a specific set of users. Dynamic data masking is available for Azure SQL Database, Azure SQL Managed Instance and Azure Synapse Analytics. Dynamic data masking can be implementedContinue reading “Azure SQL Database Dynamic Data Masking”

Passed the DP200 Exam – Implementing an Azure Data Solution

I am happy to have passed Microsoft’s DP-200 Implementing an Azure Data Solution Exam today. This is the one of the requirements towards Microsoft’s Azure Data Engineer Certification. Details about the skills measured in the exam can be found here. The exam had multiple sections, starting with a case study and questionsaround an organization planningContinue reading “Passed the DP200 Exam – Implementing an Azure Data Solution”

Azure Databricks Clusters

An Azure Databricks Cluster is a grouping of computation resources which are used to run data engineering and data science workloads. These workloads include ETL pipelines, streaming data processing and machine learning. Based on the usage, Azure Databricks clusters can be of two types: Interactive Cluster: This type of cluster can be used to workContinue reading “Azure Databricks Clusters”