LuxoftRemote

Senior Data Engineer (Spark, Databricks)

Description

Project Description: We are seeking a skilled, hands-on Data Engineer with demonstrated experience in Databricks, Apache Spark, Python, and real-time data streaming solutions.

Responsibilities:

  • Implement resource composition logic provided by the Product team into scalable and efficient PySpark code.
  • Develop, optimize, and maintain PySpark applications within the Databricks environment.
  • Utilize and maintain existing CI/CD pipelines in Databricks for automated build, testing, and deployment.
  • Write and maintain unit tests to ensure high code quality and reliability.
  • Perform data validation and quality checks to ensure compliance with defined business rules
  • Support and manage production deployments, ensuring system stability and performance.
  • Troubleshoot and resolve issues across development, testing, and production environments.
  • Collaborate with Product, QA, DevOps, and healthcare domain teams to ensure accurate implementation.
  • Maintain technical documentation for code, workflows, and deployment processes.
  • Pull from and push code to GitHub repositories, following version control and branching best practices.

Mandatory Skills Description:

  • 5+ years of experience in data engineering roles
  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
  • 4+ years of experience with Databricks data pipelines
  • Strong hands-on experience with PySpark and/or Apache Spark.
  • Experience working with CI/CD pipelines and DevOps best practice. Proficiency with CI/CD, GitHub (pull requests, branching, code reviews).
  • Experience writing unit tests and performing data validation
  • Experience supporting production systems and deployments
  • Strong analytical and troubleshooting skills.

Nice-to-Have Skills Description:

  • Knowledge of FHIR (Fast Healthcare Interoperability Resources) standards.
  • Experience with healthcare data integration projects.
  • Familiarity with REST APIs and FHIR server integrations.
  • Experience in Agile/Scrum development environments.

Skills

AgileData EngineeringApacheApache SparkGitHubDatabricksDevOpsComplianceRESTSparkPythonCI/CDScrum

Want AI to find more roles like this?

Upload your CV once. Get matched to relevant assignments automatically.

Try personalized matching