Software Developer/Engineer (LLM / Meta Llama 3 / Mistral / Mixtral / Python)
Description
Trigyn has a long-term contract opportunity for Software Developer/Engineer with our direct client - a major utility services firm based in Philadelphia, Pennsylvania (Hybrid). Details on the role are listed below: NOTE: ? Hybrid work. 3 days onsite (Philadelphia, PA). ? Only local candidates preferred. ? In-person interview is required. Consultant Requirements ? On-Prem LLM & Vector DB Implementation Core Experience: ? Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments ? Strong proficiency in Python for LLM inference, prompt engineering, and integration ? Experience with CPU-based inference, model quantization, and performance tuning Vector Databases & RAG ? Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or Pgvector ? Proven implementation of Retrieval-Augmented Generation (RAG) pipelines ? Experience generating and managing embeddings and metadata filtering Security & Governance ? Understanding of data, air-gapped deployments, and enterprise security requirements ? Experience implementing access controls and audit logging Nice to Have: ? Experience with LangChain or LlamaIndex ? Exposure to Rust, Go, or C++ for high-performance services ? Familiarity with Docker and Kubernetes for on-prem deployments ? Knowledge of inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers) ? Prior work in regulated or enterprise environments Deliverables: ? Reference architecture and deployment guidance ? Working prototype (LLM + vector DB + RAG) ? Documentation and knowledge transfer to internal teams. For Immediate Response call, or send your resume to TRIGYN TECHNOLOGIES, INC. is an EQUAL OPPORTUNITY EMPLOYER and has been in business for 35 years. TRIGYN is an ISO 9001:2015, ISO 27001:2013 (ISMS),ISO 20000:2018 and CMMI Level 5 certified company.