In this post, we are going to discuss about two important performance metrics for Azure Blob Storage, latency and bandwidth. Latency in the context of Azure blob storage is the amount of time an application must wait for an input/output request to be completed. Before we discuss Azure storage latency metrics, we must understand requestContinue reading “Azure Blob Storage Performance Metrics”
Monthly Archives: June 2020
Azure Storage Reserved Capacity
To optimize Azure Storage costs, Microsoft provides option of reserved capacity on Azure storage. Reserved storage provides a fixed amount of capacity to customers, on the Azure cloud for the period of reservation. Reserved capacity is priced at a discount compared to normal capacity on Azure storage. Reserved capacity is available for Block Blobs andContinue reading “Azure Storage Reserved Capacity”
ADE Q03: Azure Data Platform Advanced Quiz
Which of the following is NOT correct about Transparent Data Encryption (TDE)? A. TDE is a data encryption technology to keep data encrypted while it is stored in the DB B. TDE is available for Azure Synapse Analytics and Azure SQL Databases C. TDE can be used to encrypt transaction log files D. TDE isContinue reading “ADE Q03: Azure Data Platform Advanced Quiz”
Storage Emulators for Azure
While developing an application, we may need to store and access data in the Azure Storage account. If the amount of read and write access to the Azure Storage is high, this could result in huge costs for testing the application. To provide local deployment and testing option to developers without incurring costs, Azure storageContinue reading “Storage Emulators for Azure”
SBS01 : Importing data into Azure Synapse Analytics using PolyBase
In the previous post, we learnt the basics of Polybase and how it makes data ingestion much faster. In this post we are going to look at the steps that we need to perform to ingest data into Azure Synapse Analytics. Step 1 : Create a Database Master key Explanation: Creating a database master keyContinue reading “SBS01 : Importing data into Azure Synapse Analytics using PolyBase”
PolyBase Introduction
As we have seen in previous post, Azure Synapse Analytics has a Massively Parallel Processing (MPP) architecture with multiple compute nodes controlled by a control node. To take advantage of the MPP architecture, data ingestion must be parallelized. PolyBase optimizes the data ingestion into PDW. The other advantage that Polybase offers is that, it supportsContinue reading “PolyBase Introduction”
Azure SQL Database Dynamic Data Masking
Dynamic data masking plays a significant role in data security in the context of Azure SQL Database. It is way of restricting access of sensitive information to a specific set of users. Dynamic data masking is available for Azure SQL Database, Azure SQL Managed Instance and Azure Synapse Analytics. Dynamic data masking can be implementedContinue reading “Azure SQL Database Dynamic Data Masking”
Passed the DP200 Exam – Implementing an Azure Data Solution
I am happy to have passed Microsoft’s DP-200 Implementing an Azure Data Solution Exam today. This is the one of the requirements towards Microsoft’s Azure Data Engineer Certification. Details about the skills measured in the exam can be found here. The exam had multiple sections, starting with a case study and questionsaround an organization planningContinue reading “Passed the DP200 Exam – Implementing an Azure Data Solution”
Azure Databricks Clusters
An Azure Databricks Cluster is a grouping of computation resources which are used to run data engineering and data science workloads. These workloads include ETL pipelines, streaming data processing and machine learning. Based on the usage, Azure Databricks clusters can be of two types: Interactive Cluster: This type of cluster can be used to workContinue reading “Azure Databricks Clusters”
Azure Event Hubs Partitions
An Azure Event hub partition is an ordered sequence of events. It acts like a queue, when the newer events arrive, they are added to the end of the partition. Data is retained for a fixed amount of time in Azure event hub partitions. The retention time can be configured by the user and isContinue reading “Azure Event Hubs Partitions”