Senior Data Engineer
Description
Hiring for a next-generation technology company building sovereign, secure, and scalable digital platforms across AI, Web3.0, cloud, and health-tech.
Hands-on Senior Data Engineer to build and maintain scalable data pipelines, lake house architecture, and analytics infrastructure. Work with modern open-source technologies to enable data-driven insights and GenAI-powered analytics capabilities.
Key Responsibilities
Data Ingestion & ETL •Build and maintain data pipelines using Airbyte from multiple sources •Configure connectors for databases (PostgreSQL, MySQL, MongoDB), APIs, SaaS applications • Implement CDC for real-time data replication or using kafka streaming •Decremental and full-load sync strategies •Ensure data validation and quality checks Workflow Orchestration •Develop and maintain Airflow DAGs for ETL workflows • Implement error handling, retry logic, and monitoring •Schedule complex multi-step data pipelines • Integrate with databases, APIs, and external systems
Data Warehouse & Analytics •Design and optimize ClickHouse schemas for OLAP •Write advanced SQL queries and optimize performance • Implement materialized views and aggregations •Manage data partitioning and retention policies •Build data marts for analytics use cases Lake House Architecture • Implement data lake house on object storage (Cloudian) •Design data partitioning strategies (by date, region) •Work with Parquet/ORC file formats • Implement access control and data governance BI & Dashboards •Support Apache Superset and Power BI dashboard development
Skills
Want AI to find more roles like this?
Upload your CV once. Get matched to relevant assignments automatically.