CitiusTechHybrid

Data Science/MLOps Engineer - Onsite Interview

Project-Based

Description

Job Description Data Science/MLOps Engineer Who we are: - At CitiusTech , we constantly strive to solve the industry's greatest challenges with technology, creativity, and agility. With over 8,500 healthcare technology professionals worldwide, CitiusTech powers healthcare digital innovation, business transformation, and industry-wide convergence for over 140 organizations through next-generation technologies, solutions, and products. We aim to accelerate the transition to a human-first, sustainable, and digital healthcare ecosystem with the world's leading Healthcare and life sciences organizations and our partners. Here is an opportunity for you to make a difference and collaborate with global leaders to shape the future of healthcare and positively impact human lives. Our vision: - To inspire new possibilities for the health ecosystem with technology and human ingenuity. What is in it for you? Build, deploy, and operationalize scalable AI-powered clinical NLP and machine learning solutions using deep learning, LLMs, and cloud-native big data platforms in healthcare environments. Responsibilities: - Analyze and process large volumes of unstructured clinical and healthcare text using advanced NLP, machine learning, and deep learning techniques Enhance and optimize existing AI/NLP workflows by deg and implementing state-of-the-art algorithms, including Large Language Models (LLMs) and agentic frameworks such as LangGraph, to improve performance, scalability, and usability Develop, maintain, and extend modular NLP components using Python and other relevant programming or scripting languages Perform comprehensive text pre-processing, data quality assessments, feature engineering, and validation of NLP model outputs Design, implement, and execute systematic testing frameworks, error-handling mechanisms, and model performance evaluation methodologies Build, deploy, and manage end-to-end ML pipelines following MLOps best practices, including versioning, monitoring, retraining, and CI/CD automation Create and maintain technical documentation, model documentation, testing reports, and user manuals Design and develop scalable data pipelines for extraction, transformation, and loading (ETL) from diverse data sources, including MCP servers Leverage SQL and AWS big data technologies such as EMR, Spark, and PySpark for large-scale data processing Collaborate closely with Engineering, Data, and Platform teams to design, deploy, and optimize robust and secure AI/NLP infrastructure Utilize AWS services for model development and deployment, including AWS Bedrock for generative AI applications Work with relational databases to manage structured and semi-structured data efficiently. Experience: - 6+ Years Location: - Houston, TX 3days/week Educational Qualifications: - Engineering Degree BE/ME/BTech/MTech/BSc/MSc. Technical certification in multiple technologies is desirable. Skills: - Mandatory skills Strong hands-on expertise in Natural Language Processing (NLP), machine learning, and deep learning Proficiency in Python for building and deploying NLP and ML solutions Experience working with Large Language Models (LLMs), prompt engineering, and agentic workflows (e.g., LangGraph or similar frameworks) Solid understanding of data pre-processing, normalization, feature extraction, and quality validation techniques Strong MLOps experience, including model versioning, pipeline orchestration, CI/CD, monitoring, performance tracking, and retraining strategies Experince to containerization and orchestration tools (Docker, Kubernetes) Proficiency in SQL for data querying, transformation, and analytics Practical experience with AWS big data and compute services (EMR, Spark, PySpark) Working knowledge of AWS services, including AWS Bedrock for generative AI use cases Experience with at least one relational database: PostgreSQL or MySQL Strong understanding of testing strategies, error analysis, and model validation techniques Good-to-Have Skills Prior experience with clinical, biomedical, or healthcare NLP use cases Familiarity with healthcare data standards, terminologies, or ontologies Experience deploying ML/NLP solutions in regulated or production healthcare environments Knowledge of distributed systems and cloud-native data architectures Experience with additional data stores, data warehouses, or NoSQL technologies Strong technical documentation and stakeholder communication skills Experience working in agile or cross-functional product development teams Life at CitiusTech We focus on building highly motivated engineering teams and thought leaders with an entrepreneurial mindset, centered on our core values of Passion, Respect, Openness, Unity, and Depth (PROUD) of knowledge . Our success lies in creating a fun, transparent, non-hierarchical, diverse work culture that focuses on continuous learning and work-life balance. Rated by our employees as the Great Place to Work for according to the Great Place to Work survey. We offer you a comprehensive set of benefits to ensure that you have a long and rewarding career with us. Our EVP Be You Be Awesome is our EVP and it reflects our continuing efforts to create CitiusTech as a great place to work where our employees can thrive, both personally and professionally. It encompasses the unique benefits and opportunities we offer to support your growth, well-being, and success throughout your journey with us and beyond. Together with our clients, we are solving some of the greatest healthcare challenges and positively impacting human lives. Welcome to the world of Faster Growth, Higher Learning, and Stronger Impact. Join CitiusTech. Be You. Be Awesome. To learn more about CitiusTech, visit and follow us on

Skills

ETLCI/CDNatural Language ProcessingDockerAWSPostgreSQLApache SparkData ScienceAIPythonKubernetesMLAgileDeep LearningMySQLNLPSparkSQLMachine Learning

Want AI to find more roles like this?

Upload your CV once. Get matched to relevant assignments automatically.

Try personalized matching