Azure Data Engineer
Description
Company Profile: Founded in 1976, CGI is among the largest independent IT and business consulting services firms in the world. With 94,000 consultants and professionals across the globe, CGI delivers an end-to-end portfolio of capabilities, from strategic IT and business consulting to systems integration, managed IT and business process services and intellectual property solutions. CGI works with clients through a local relationship model complemented by a global delivery network that helps clients digitally transform their organizations and accelerate results. CGI Fiscal 2024 reported revenue is CA$14.68 billion and CGI shares are listed on the TSX (GIB.A) and the NYSE (GIB). Learn more at cgi.com.
Job Summary We are looking for a hands-on Azure Data Engineer with 3–5 years of experience in building and supporting data pipelines on Microsoft Azure. The ideal candidate will have practical experience with Apache Airflow orchestration, Python development, and containerized deployments on Red Hat OpenShift (OCP). This role focuses on developing, operating, and enhancing data pipelines, supporting platform upgrades, and working closely with senior engineers and stakeholders to deliver reliable data solutions.
Key Responsibilities Data Engineering
. Develop and maintain data ingestion and transformation pipelines on Azure. . Support batch and near-real-time data processing use cases. . Work with structured and semi-structured data sources. . Assist in implementing data models for analytics and reporting.
Apache Airflow
- Orchestration
. Develop and maintain Apache Airflow DAGs under guidance of senior engineers. . Implement scheduling, retries, SLAs, and dependency management. . Patch Airflow upgrades and environment validations. . Troubleshoot DAG failures, performance issues, and data pipeline errors. . Monitor workflows and ensure operational stability.
Python Development & Version Support
. Develop and enhance Python-based data pipelines and Airflow DAGs. . Python version upgrades and dependency updates. . Fix issues related to deprecated libraries and breaking changes. . Write clean, reusable, and testable Python code.
Azure Data Platform
. Work with Azure Data Factory (ADF) for data ingestion and orchestration. . Develop transformations using Azure Databricks (PySpark / Spark SQL). . Use Azure Data Lake Storage Gen2 (ADLS) for data storage. . Support analytical workloads using Azure Synapse Analytics. . Follow security best practices using Azure Key Vault and RBAC.
OpenShift Container Platform (OCP)
. Support Apache Airflow deployments on Red Hat OpenShift (OCP). . Work with containerized Airflow components (Scheduler, Webserver, Workers). . Assist with configuration of Pods, ConfigMaps, and Secrets. . Help troubleshoot Airflow issues in containerized environments.
DevOps & Operational Support
. Use Git-based version control for code management. . Support CI/CD pipelines for data workloads. . Assist with monitoring using Airflow logs and Azure Monitor. . Participate in incident resolution and root cause analysis.
Required Skills & Qualifications Core Skills
. 3–5 years of experience in Data Engineering. . Hands-on experience with Microsoft Azure data services. . Working experience with Apache Airflow. . Strong proficiency in Python and SQL. . Basic to intermediate experience with Spark / PySpark.
Mandatory Experience
. Hands-on development of Airflow DAGs. . Exposure to Python version upgrades or dependency management. . Experience supporting Airflow deployments on OpenShift (OCP) or Kubernetes-based platforms.
Skills:
Kubernetes