Hire Right Search PartnersNoida, India

Data/Analytics Lead

Description

Hiring for a next-generation technology company building sovereign, secure, and scalable digital platforms across AI, Web3.0, cloud, and health-tech.

Lead the design and implementation of a modern analytics platform using 100% open-source technologies. Build scalable data pipelines, lake house architecture, and BI infrastructure from scratch. Manage 4-6 data engineers.

Key Responsibilities

Leadership & Strategy

•Lead team of 4-6 data engineers •Define analytics platform architecture and roadmap •Build data platform from scratch using open-source tools •Establish data governance and quality frameworks

Platform Architecture •Design end-to-end analytics platform on open-source stack •Build data lake house on Cloudian object storage •Architect ClickHouse for OLAP (sub-second queries) •Design multi-source ingestion (APIs, databases, SaaS) • Implement real-time and batch pipelines

Data Ingestion & ETL •Build pipelines using Airbyte (CDC, connectors, validation) •Orchestrate workflows with Apache Airflow (DAGs, monitoring, error handling) •Decremental/full-load strategies,•Data Streaming using Kafka

Data Warehouse & Analytics •Optimize ClickHouse (materialized views, distributed tables) •Design dimensional models (star/snowflake schemas) •Build semantic layers for consistent metrics

BI Platform •Deploy Apache Superset for self-service analytics • Integrate Power BI (optional) • Implement Redis caching for performance

Data Quality & Governance • Implement quality frameworks (Great Expectations, Soda) •Build data reconciliation automation •Establish data lineage, catag, compliance ()

Performance & Automation •Write Python automation tools and custom connectors,• Implement CI/CD for data pipelines •Optimize costs and performance

Skills

ETLRedisAirflowApacheCI/CDPower BIPythonSupersetComplianceGDPRData WarehouseKafkaAISnowflake

Want AI to find more roles like this?

Upload your CV once. Get matched to relevant assignments automatically.

Try personalized matching