[go: up one dir, main page]

0% found this document useful (0 votes)
25 views9 pages

Akhil Data Engineer

Akhil Azmeera is an experienced Azure Data Engineer with over 5 years of expertise in designing and optimizing data pipelines using various Azure services and tools. He has a strong background in SQL optimization, real-time analytics, and automation, along with leadership experience in managing data projects and ensuring compliance with data governance standards. His technical skills include proficiency in cloud platforms, programming languages, big data technologies, and data visualization tools.

Uploaded by

goudsujeeth0
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views9 pages

Akhil Data Engineer

Akhil Azmeera is an experienced Azure Data Engineer with over 5 years of expertise in designing and optimizing data pipelines using various Azure services and tools. He has a strong background in SQL optimization, real-time analytics, and automation, along with leadership experience in managing data projects and ensuring compliance with data governance standards. His technical skills include proficiency in cloud platforms, programming languages, big data technologies, and data visualization tools.

Uploaded by

goudsujeeth0
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 9

AKHIL AZMEERA LinkedIn Profile

Email: azmeeraakhil19@gmail.com PH: (848) 260-9560


Azure Data Engineer

Professional Summary:

 Experienced Azure Data Engineer with over 5 years of hands-on expertise in designing, developing, and
optimizing scalable data pipelines using Azure Data Factory (ADF), Azure Synapse Analytics, Databricks,
Snowflake, Azure Data Lake, and Azure Cosmos DB.
 Proficient in SQL query optimization, including T-SQL, PL/SQL, and Snowflake SQL, with proven success in
performance tuning for high-volume datasets.
 Strong background in building real-time data analytics solutions using Azure Functions, Azure Event Hubs,
Kusto Query Language (KQL), and Power BI for operational insights and decision-making.
 Extensive experience in automating data workflows and integrating DevOps practices with tools like
Terraform, Azure DevOps, and CI/CD pipelines to streamline deployments and ensure high availability.
 Expert in data governance and security best practices, ensuring compliance with GDPR and CCPA
regulations in the Azure ecosystem.
 Skilled in big data technologies such as Azure Synapse, Databricks, Spark, and Azure Data Lake for
handling high-performance data processing, transformation, and storage.
 Proven ability to collaborate with cross-functional teams and communicate technical solutions to both
technical and non-technical stakeholders, delivering impactful, data-driven insights.
 Cost optimization experience using Azure Cost Management, enabling efficient cloud resource usage while
minimizing costs in large-scale data pipeline operations.
 Leadership experience in managing end-to-end data pipeline projects, focusing on performance
improvement, automation, cloud infrastructure, and data migration.
 Experienced in designing and implementing scalable ETL pipelines using Azure Data Factory (ADF),
Databricks, Synapse Analytics, and Azure Functions.
 Proficient in building and optimizing idempotent ETL pipelines with Azure Data Factory, Databricks, and
Snowflake for high-volume data processing and integration.
 Worked on machine learning model development for predictive maintenance, anomaly detection, and
optimization, using Azure Machine Learning and Python.
 Expertise in real-time data analytics and visualization using Power BI, KQL, and Azure-native tools,
ensuring accelerated query performance.
 Strong background in data governance, security, and compliance (GDPR, CCPA) within the Azure
ecosystem.
 Proficient in cost optimization strategies in cloud environments, including Azure Cost Management and
Azure Monitor for tracking resources and performance.
 Experienced in leveraging Azure Cosmos for building globally distributed, scalable, and low-latency cloud
solutions, integrating with services like Azure Functions, Data Factory, and Event Hubs for real-time data
processing.
 Adept in using SQL Server Management Studio (SSMS), Snowflake SQL, Azure SQL Database, and
Databricks for database management and query optimization.
 Experienced in managing large-scale data storage and processing with Azure Data Lake, Snowflake,
Synapse Analytics, and Azure Blob Storage.
 Collaborated with cross-functional teams to align technical solutions with business goals and to deliver end-
to-end data-driven solutions.
 Experienced in automating data workflows with Azure Logic Apps, Azure Data Factory, and Azure
Functions, leveraging Snowpipe for seamless data ingestion.
 Proficient in managing and optimizing cloud infrastructure using Terraform, Azure Resource Manager
(ARM) templates, and Kubernetes.
 Strong problem-solving skills with a focus on optimizing system performance, resource efficiency, and
ensuring high availability in ETL pipelines and data orchestration frameworks.
 Hands-on experience with Azure Data Explorer (ADX), Excel-based reporting, and cross-team collaboration
for UX-centric analytics solutions

Technical Skills:
 Cloud Platforms: Azure (ADF, Databricks, Synapse, Data Lake, Azure Functions, Azure Machine Learning, Azure
Cognitive Services, Azure Cosmos DB, Azure Blob Storage, Azure Monitor, Azure Cost Management)
 Programming Languages: Python, Scala, SQL, T-SQL, PL/SQL, PySpark, Spark-SQL
 Big Data Technologies: Spark, Hive, Delta Lake, Kafka, Azure Databricks, Azure Synapse Analytics, Snowflake, Azure
Data Lake Storage, Snowpipe
 ETL & Data Integration: Azure Data Factory (ADF), Informatica PowerCenter, SSIS
 DevOps & Source Control: Jenkins, Terraform, GitHub, SVN, Docker, Kubernetes, CI/CD pipelines
 Data Modeling & Processing: Snowflake Virtual Warehouses, Stream sets, Data Orchestration, HL7 and FHIR data
processing
 Visualization: Power BI
 Operating Systems: UNIX, Linux, Windows
 Security & Compliance: GDPR, CCPA, Data Governance
 Automation & Scripting: PowerShell, Python scripting
Certifications:
Microsoft Certified: Azure Data Engineer Associate.
Microsoft Certified: Power BI Data Analyst Associate
Professional Experience:
Centific Global Solutions
Sr. Data Engineer | Microsoft Mar 2024 – Present
Location: Remote

Project Overview:
Worked with Microsoft’s infrastructure team to design and implement a scalable and highly efficient data pipeline for
analyzing operational data from data centers. The project involved leveraging Azure cloud services, Snowflake, and ETL
processes to monitor, analyze, and optimize the performance of multiple global data centers. The solution supports
real-time data analytics, predictive maintenance, and resource optimization.

Responsibilities:

 Designed and implemented end-to-end data pipelines using Azure Data Factory (ADF), Databricks, Snowflake,
and Azure Synapse Analytics for large-scale data processing.

 Built scalable ETL pipelines using Azure Data Factory (ADF) and Databricks, ensuring efficient data
transformation and orchestration for large datasets.
 Developed idempotent ETL processes to ensure interrupted, incomplete, or failed pipelines could be rerun
without issues using ADF Dataflows and Pipelines.
 Created and optimized stored procedures in Snowflake SQL, PL/SQL, and T-SQL for efficient data processing
and transformation.
 Utilized Delta Lake for efficient data management and optimized query performance with features like Time
Travel and Schema Enforcement.
 Ensured compliance with GDPR, CCPA, and other data privacy regulations by implementing role-based access
controls (RBAC) and encryption in all data workflows.
 Built and managed automated CI/CD pipelines in Azure DevOps, enabling rapid deployment and version control
of data engineering solutions.

 Developed PowerShell scripts to automate Azure resource provisioning and manage pipeline deployments in
Azure DevOps, streamlining the deployment process.
 Automated multi-step cloud workflows using Azure Logic Apps and PowerShell scripting, enabling seamless
resource provisioning and approval processes, closely aligned with Power Automate capabilities.
 Built event-driven alerts and auto-escalation workflows using Azure Logic Apps, showcasing hands-on
experience with Microsoft’s low-code automation platform that parallels Power Automate workflow design.
 Familiar with Power Automate connectors and integration patterns through my experience building Azure
Logic Apps that interacted with Azure DevOps, SharePoint, and Outlook for real-time notifications and task
automation.
 Collaborated with teams leveraging Power Apps for operational dashboards, gaining exposure to the Power
Platform’s front-end tools and their seamless integration with Azure and Power BI.
 Proactively explored Power Automate templates and workflows to enhance dashboard-driven automation and
approval cycles, with readiness to build low-code automations as part of cross-org reporting solutions.
 Optimized Databricks performance by leveraging caching, partitioning strategies, and PySpark configurations to
reduce job runtimes by X%.
 Created and maintained a custom SQL deployment framework in Azure Data Factory (ADF) to ensure seamless
and repeatable deployments, reducing errors during pipeline execution.
 Collaborated with business teams to understand operational analytics requirements and translated them into
scalable data pipeline solutions.

 Implemented Snowpipe for automated and scalable data ingestion, ensuring continuous data flow with minimal
latency.
 Optimized SQL queries using T-SQL, PL/SQL, and Snowflake SQL for performance tuning, reducing processing
time by 30% for large datasets.
 Developed real-time analytics dashboards in Power BI to provide leadership with actionable insights for
operational decision-making.
 Leveraged Azure Cosmos DB to build scalable and globally distributed solutions for high-availability data
pipelines.
 Automated data ingestion processes using Snowpipe and Azure Functions, ensuring seamless, real-time data
flow with minimal latency.
 Optimized performance of Azure Data Lake by managing partitioning strategies and optimizing consistency
levels.
 Built automated data workflows using Azure Logic Apps, Terraform, and Databricks, streamlining deployment
and resource management.
 Implemented data validation and quality checks using PySpark and Snowflake, ensuring high-quality data in
pipelines.
 Developed complex KQL queries for Azure Data Explorer (ADX) and Azure Monitor, proactively monitoring
performance and flagging anomalies.
 Worked on integrating Azure Cosmos services within cloud-based data pipelines to support globally
distributed, highly available applications with multi-region write capabilities.
 Built scalable and event-driven data processing pipelines by combining Azure Cosmos with Azure Functions,
Event Hubs, and Azure Data Factory for near real-time analytics and ingestion of streaming data.
 Monitored and fine-tuned performance across Cosmos services, configuring consistency levels, throughput,
and partition strategies to meet SLAs for mission-critical enterprise applications.

 Utilized Stream sets in Snowflake to capture changes in data dimensions and maintain historical versions.
 Developed Kusto Query Language (KQL) queries to efficiently analyze large volumes of telemetry and log data
from IoT sensors and server logs, optimizing data center operations.
 Utilized KQL join, summarize, and render operators to correlate telemetry logs across distributed systems,
reducing incident response time by 30%.
 Developed alert rules and scheduled queries in Azure Monitor using KQL to proactively flag anomalies in
performance metrics.
 Built interactive Azure Data Explorer (ADX) dashboards for leadership teams using KQL queries and real-time
log streams.
 Ensured secure data manipulation by applying role-based access controls (RBAC) in Azure and configuring
diagnostic logs for compliance.
 Improved data reliability by implementing data validation and quality checks using PySpark and Snowflake
constraints in ADF pipelines.
 Designed and optimized complex SQL queries in SQL Server Management Studio (SSMS), Azure SQL, and
Snowflake for database management, data integrity checks, and query optimization.
 Created and maintained complex PySpark scripts in Azure Databricks for distributed data processing, machine
learning pipelines, and data transformation workflows.
 Developed ETL system tests for data pipelines, performing root cause analysis on failed data loads and
resolving production issues.
 Developed and optimized Power BI dashboards, integrating with Azure Analysis Services, Snowflake, Azure
Synapse, and SQL Server, reducing manual reporting efforts and improving decision-making efficiency.
 Implemented complex DAX queries to enhance Power BI dashboard performance, ensuring faster insights from
large-scale datasets.
 Integrated DevOps practices into the data pipeline lifecycle using Azure DevOps, implementing CI/CD pipelines
for automating deployments and ensuring high availability of data services.
 Monitored and debugged pipelines using Databricks Job Clusters, Azure Monitor, and Log Analytics, ensuring
consistent and error-free execution of workflows.
 Designed and built a highly efficient orchestrator for scheduling jobs, executing workflows, and performing
data quality checks across pipelines.
 Collaborated with cross-functional teams (Infrastructure, IT Security, DevOps, Data Science) to ensure
alignment with business goals and system performance requirements.
 Implemented advanced Python-based analytics for anomaly detection, predictive modeling, and performance
optimization in data center operations.
 Led the development of SQL-based reporting solutions, automating key performance metrics extraction and
improving reporting timelines.
 Utilized Excel for intermediate data wrangling and validation during reporting cycle, integrating tabular
exports into Power BI dashboards.
 Managed and optimized SQL Server databases, ensuring high availability, data integrity, and performance
tuning using tools like SSMS and SQL Profiler.

Environment: Azure Data Factory (ADF), Azure Synapse Analytics, Snowflake, Databricks, Delta Lake, Spark, Power
BI, Kusto Query Language (KQL), SQL Server Management Studio (SSMS), Azure Functions, Azure Data Lake Storage
(ADLS Gen2), Snowpipe, SQL Server, HL7, FHIR, IoT Data, Power BI Dashboards, Azure Monitor, Azure Logic Apps

Data Engineer
DataSycle, New Jersey Dec 2022 to Feb 2024
Responsibility:

 Optimized data pipeline performance using Azure Data Factory (ADF), Snowflake, and Azure Synapse Analytics
for efficient data transformation and loading.
 Designed and implemented scalable, automated ETL pipelines for data ingestion and transformation,
leveraging Azure Data Factory (ADF) and Snowflake.
 Created reusable metadata-driven pipelines using ADF, improving automation and reducing manual setup by
40%.
 Led data migration projects from on-prem databases to Azure Cloud services, including Azure SQL Database,
Azure Data Lake, and Snowflake, ensuring smooth transitions and process automation.
 Implemented advanced Python-based analytics for anomaly detection, predictive modeling, and performance
optimization in data center operations.
 Wrote PowerShell scripts to automate the deployment of cloud resources in Azure, significantly reducing
manual intervention and accelerating deployment times.
 Integrated Kafka with Azure Data Factory and Databricks for near-real-time data streaming, enabling faster
processing and anomaly detection.
 Improved data pipeline efficiency by refactoring Spark jobs and optimizing SQL queries, achieving up to X%
reduction in processing time.
 Worked with data scientists to integrate predictive analytics into the ETL pipelines, using Azure Machine
Learning to predict operational anomalies.
 Implemented data governance best practices in line with GDPR and CCPA, ensuring secure data access and
compliance across all data pipelines.
 Developed and optimized Power BI dashboards using data from Azure Data Lake and Snowflake, streamlining
reporting for operational decision-making.
 Managed real-time data pipelines using Azure Databricks and Azure Event Hub, enabling near real-time data
processing for anomaly detection.
 Utilized Snowpipe for automated and scalable data ingestion, reducing the time required to process and load
new data into Snowflake.
 Captured changes in data dimensions using StreamSets in Snowflake, automating data updates and
maintaining versioning.
 Collaborated with QA teams to implement automated data quality validation scripts in Snowflake, ensuring
consistent data transformation.
 Implemented security measures in data workflows, ensuring compliance with GDPR and CCPA regulations for
handling sensitive data.

 Designed and implemented self-healing ADF pipelines to auto-retry on transient failures, improving SLA
adherence by 25%.
 Created automated metadata-driven pipelines using parameterized ADF templates, reducing manual ETL setup
by 40%.
 Developed dynamic Power BI dashboards sourced from Azure Data Lake and Snowflake to deliver KPIs and drill-
down analytics.

 Developed Azure Logic Apps-based automation for orchestrating Azure resource provisioning and data
movement tasks, comparable to Power Automate’s low-code workflow patterns.
 Automated approval workflows using PowerShell scripting and Azure Logic Apps, reducing manual overhead
and enabling faster decision cycles across data engineering operations.
 Familiar with Power Automate integration concepts through automating cross-platform processes involving
Azure SQL, SharePoint, and Outlook, providing a strong foundation for rapid adoption of Power Automate.
 Collaborated on Power BI dashboard deployments integrated with Power Apps front-ends, supporting business
users in accessing self-service reporting and triggering automated data refresh processes.
 Collaborated with QA teams to implement automated data quality validation scripts for all new Snowflake
tables.
 Integrated structured and unstructured data sources into Azure Data Factory and Snowflake, building efficient
data ingestion and transformation pipelines.

 Built and maintained real-time data pipelines for processing sensor data using Azure Databricks, Azure Event
Hub, and Spark Streaming, enabling real-time data processing and anomaly detection.
 Utilized Snowpipe for automated and scalable data ingestion, addressing time-sensitive ETL requirements and
ensuring fast, consistent data load into Snowflake.
 Captured changes in data dimensions and maintained versioning with Stream sets in Snowflake, automating
data updates and scheduling processes with Snowflake Tasks.
 Implemented SQL stored procedures for optimized data transformations in Oracle PL/SQL, SQL Server T-SQL,
and Snowflake SQL, ensuring high-performance data handling.
 Built and tested ETL pipeline systems, conducted root cause analysis, and resolved production issues to ensure
the smooth running of data flow operations across environments.
 Automated reporting and data processing with Power BI and Azure SQL Database, streamlining the creation of
real-time data visualizations for operational decision-making.
 Led deployment of ETL systems using Azure Data Factory, ensuring scalable, automated processes for large-
scale data operations and analytics across healthcare, retail, and finance sectors.
 Developed and optimized data models in Snowflake, ensuring effective storage, retrieval, and access to large
datasets in line with business intelligence and analytics needs.
 Worked with HL7 and FHIR data formats, ensuring compliance with healthcare industry standards during data
ingestion and ETL process development.
 Automated ETL job scheduling, data quality checks, and task coordination using ADF pipelines, ensuring
seamless orchestration and monitoring of data workflows.
 Designed and built APIs using Snowflake and Azure Data Factory to automate data processing and enhance
system integration.
 Implemented security best practices for handling sensitive data, ensuring data privacy and regulatory
compliance in Azure-based environments.
 Maintained effective working relationships with cross-functional teams, managing data integration,
transformation, and analysis tasks, and aligning them with project goals.

Environment: Azure, Azure Data Factory (ADF), Snowflake, Azure Synapse Analytics, Azure Functions, Azure Blob
Storage, Azure Databricks, Azure Data Lake, Azure Monitor, Azure Cosmos DB, Azure Cognitive Services, Power BI, SQL,
Python, PL/SQL, T-SQL, Snowflake SQL, Apache Airflow, GitHub, SVN, Terraform, Azure Resource Manager (ARM),
Jenkins, Jira, TensorFlow, PyTorch, Scikit-learn, PowerShell, Informatica PowerCenter, SSIS, HL7, FHIR.

Data Engineer
Telka LLC, TX Jan 2022 to Dec 2022
Responsibilities:

 Collaborated with the Product Engineering and Data Governance teams to design, test, and support scalable
ETL workflows using Azure Data Factory (ADF), ensuring alignment with data requirements and business
objectives.
 Designed and implemented robust ETL strategies using Azure Data Factory (ADF) and Snowflake for high-
volume data processing.
 Automated infrastructure management and data migrations using Azure Resource Manager (ARM) templates,
Azure CLI, and Terraform, streamlining the process of provisioning infrastructure.
 Optimized SQL performance by improving query plans, partitioning strategies, and leveraging indexing to
minimize processing time in high-volume environments.
 Integrated machine learning models for predictive maintenance, anomaly detection, and real-time data
processing into the data pipeline workflows.
 Automated infrastructure provisioning and ETL pipeline deployments using PowerShell and Azure CLI,
enhancing deployment speed and reducing manual intervention.
 Designed and implemented real-time data pipelines using Kafka, Azure Databricks, and Spark Streaming for
high-frequency data ingestion and processing.
 Led the migration of on-prem SQL Server databases to Azure Data Lake and Snowflake, ensuring smooth data
transitions with minimal downtime.
 Enhanced ETL workflows by implementing dynamic partitioning and performance tuning, reducing ETL
processing time by X%.
 Built and optimized data ingestion frameworks using Azure Databricks, enhancing performance in both batch
and real-time workflows.
 Managed data migration from SQL Server to Azure Data Lake and Snowflake, ensuring smooth transitions and
process automation.

 Automated complex deployment and data integration workflows using Azure Logic Apps and PowerShell
scripting, laying a direct foundation for Power Automate-based automation.
 Designed end-to-end cloud automation pipelines with approval triggers and email notifications in Azure Logic
Apps, closely resembling Power Automate workflow designs.
 Familiar with the Microsoft Power Platform ecosystem, including Power BI and Power Apps integration
patterns, through experience building interactive dashboards and collaborating with app development teams.
 Developed process-driven automations using PowerShell scripting and Azure Logic Apps, enhancing
operational efficiency in a way similar to Power Automate’s low-code solutions.
 Developed complex SQL-based ETL processes using PL/SQL, T-SQL, and Snowflake SQL, optimizing data
transformation workflows.
 Integrated Kafka for building real-time data pipelines, enabling continuous data movement to Azure Synapse
Analytics and Azure Data Lake.
 Led data migration from on-prem SSIS to ADF and Azure Synapse, ensuring scalability and compatibility in the
cloud environment.
 Optimized performance of Spark Databricks clusters, ensuring high availability and reducing pipeline
processing time.
 Developed APIs for automating data processing and integration using Snowflake and Azure Data Factory.

 Developed and implemented robust ETL strategies for high-volume data processing, leveraging Azure Data
Factory (ADF) and Snowflake for efficient data transformation, load balancing, and exception handling.
 Collaborated with cross-functional teams, including engineering, data science, and operations, to ensure the
successful implementation of ETL workflows that meet business and technical requirements.
 Designed and implemented SQL-based ETL processes for high-performance data warehousing and reporting,
optimizing transformation workflows for speed and efficiency.
 Developed and deployed a large-scale data solution on Snowflake, ensuring scalability for processing over 100
datasets and optimizing storage and retrieval for enhanced performance.
 Utilized SQL Sentry and Azure Monitor to proactively monitor and resolve performance issues in ETL pipelines,
ensuring minimal downtime and optimal data flow efficiency.
 Led the integration of Kafka for building real-time data pipelines, ensuring continuous data movement into
Azure Data Lake and Azure Synapse Analytics for near real-time analytics.
 Monitored, troubleshot, and optimized Spark Databricks clusters, ensuring high availability and performance of
ETL pipelines and real-time data processing workflows.
 Automated data pipeline scheduling with Apache Airflow, creating Python-based scripts to orchestrate
workflows for data transformation and integration.
 Designed Source-to-Target Mappings (S2TM) for building core databases and optimizing ETL pipeline
processes, ensuring data consistency and high-quality transformation across environments.
 Performed data preprocessing and feature engineering tasks using Python, preparing datasets for machine
learning model training and ensuring data quality for analysis.
 Developed complex PL/SQL units for managing business-critical scenarios in data processing, ensuring high
availability and data integrity.
 Created SQL Server Agent Jobs and Airflow DAGs for orchestrating ETL workloads across Azure Synapse and
Snowflake.
 Migrated legacy ETL jobs from on-prem SSIS to ADF and Azure Synapse pipelines, ensuring compatibility and
scalability.
 Designed Azure Monitor-based alerts and health checks for production pipelines to proactively detect and
resolve failures.
 Integrated Power BI dataflows with Azure Data Lake to support self-service reporting and reduce reporting lead
times.

 Utilized Kubernetes and Docker for the CI/CD pipeline management, ensuring automated testing and
deployment of ETL code to production environments.

Environment: Azure Data Factory (ADF), Azure Synapse Analytics, Azure Data Lake, Snowflake, Kafka, Azure
Monitor, Azure Blob Storage, SQL Server, Python, PL/SQL, T-SQL, Apache Airflow, Azure Resource Manager (ARM),
Terraform, Azure CLI, Azure Databricks, Power BI, GitHub, SVN, Jenkins, Kubernetes, Docker, CI/CD, SQL Sentry,
Informatica PowerCenter, SSIS, SAP PLM, Apache Spark, Azure Functions.

Junior Data Engineer


Insight Technologies, Bengaluru, India May 2019 to July 2021
Responsibilities:

 Utilized Kubernetes and Docker for the CI/CD pipeline management, ensuring automated testing and
deployment of ETL code to production environments.
 Created PowerShell scripts to automate cloud resource management, reducing manual errors and improving
operational efficiency.
 Developed complex SQL queries to transform raw data into actionable insights, improving decision-making and
operational efficiency for stakeholders.
 Designed and implemented real-time data ingestion pipelines using Azure Databricks and Azure Event Hub,
reducing data latency to near real-time.
 Implemented robust data validation and quality checks in ADF pipelines, ensuring data integrity and accuracy
across all datasets.
 Collaborated with cross-functional teams (data scientists, business analysts, IT) to align data infrastructure with
business objectives, ensuring data-driven decision-making.

 Developed and implemented ETL pipelines using Azure Data Factory (ADF) to orchestrate data workflows,
ensuring efficient data processing, transformation, and integration with Snowflake for optimal data storage.
 Worked with Azure Synapse Analytics and Snowflake to integrate data from multiple sources, ensuring high-
quality, timely data for analysis, and optimizing data warehouses for fast query performance.
 Collaborated with cross-functional teams to align data infrastructure with business objectives, gathering
requirements to support data-driven initiatives and ensure business intelligence goals were met.
 Built and optimized data ingestion frameworks using Azure Databricks, enhancing ETL pipelines for improved
performance and processing large datasets across batch and real-time workflows.
 Participated in data migration projects from on-premises databases to Azure Cloud services, including Azure
SQL Database, Azure Data Lake, and Snowflake, ensuring smooth data transitions and process automation.
 Monitored and maintained ETL pipelines to ensure consistent data flow and reliability across platforms, utilizing
Azure Monitor and Azure Data Factory for performance optimization and troubleshooting.
 Assisted in data cleansing and transformation efforts by writing SQL and Python scripts to clean, filter, and
prepare data for downstream analysis, ensuring high data quality.
 Worked with Azure Blob Storage and Azure Data Lake to store and manage large datasets, adhering to security,
compliance, and data governance protocols to ensure data integrity.
 Gained experience with Azure Logic Apps and Azure Functions to automate workflows, integrate data sources,
and optimize business processes across various systems.
 Created Power BI dashboards from Azure Data Lake insights to assist business users in making evidence-based
decisions.
 Assisted in Excel-based data validation processes prior to ETL pipeline integration for downstream analytics.
 Supported data annotation and pre-processing for machine learning projects in Azure ML.
 Implemented data masking and encryption procedures for sensitive healthcare datasets to comply with
GDPR/CCPA regulations.

 Designed and created visual reports and dashboards using Power BI, empowering business users with
actionable insights and supporting data-driven decision-making.

Environment: Azure Data Factory (ADF), Snowflake, Azure Synapse Analytics, Azure Data Lake, Azure SQL Database,
Azure Functions, Power BI, SQL, Python, PySpark, Azure Blob Storage, Azure Monitor, Azure Logic Apps, Azure
Databricks, GitHub, SVN, Terraform, Apache Airflow, Jenkins.

Education:
Eastern Illinois University | Masters in Computer Science Illinois, USA | 08/2021 - 12/2022
Coursework: Python and Databases, Statistics, Data Analysis and Reporting (Data Visualizations, Reporting)
Malla Reddy College | Bachelors in Computer Science Hyderabad, India | 06/2015 - 05/2019

You might also like