Our customer's product is an AI-powered platform that helps businesses make better decisions and work more efficiently. It uses advanced analytics and machine learning to analyze large amounts of data and provide useful insights and predictions. The platform is widely used in various industries, including healthcare, to optimize processes, improve customer experiences, and support innovation. It integrates easily with existing systems, making it easier for teams to make quick, data-driven decisions to deliver cutting-edge solutions.
Requirements
- Bachelor’s or Master’s degree in Computer Science or related field;
- 4+ years of hands-on experience with Machine Learning and production LLM systems;
- Strong ML fundamentals: transformers, prompt engineering, embeddings, vector search;
- Backend API experience with FastAPI, async patterns, and rate limiting;
- Experience with vector databases (Pinecone, Weaviate, Chroma) and hybrid search;
- Advanced Python skills: async/await, type hints, Pydantic, SOLID principles;
- MLOps experience: MLflow, model versioning, A/B testing; Langfuse preferred;
- NLP & computer vision experience: document understanding, OCR, GPT-4 Vision;
- Experience building feature pipelines, real-time & batch inference, and model serving;
- Familiarity with HuggingFace (required); LangChain / LlamaIndex preferred.
Nice to have skills:
- Understanding of DevOps, CI / CD including: Docker containerization, Azure DevOps pipelines or GitHub Actions, Kubernetes (nice to have);
- Data security including: Multi-tenant data isolation, Secure key management (Azure Key Vault), Audit trail implementation;
- Experience in designing on cloud platform including: Azure (strongly preferred): Azure OpenAI, Blob Storage, Key Vault, Container Registry, AWS or GCP;
- Experience in data engineering in Big Data systems including: Large-scale data processing, ETL/ELT pipelines.
Responsibilities:
- Design and build scalable backend systems, APIs, and microservices with FastAPI;
- Write high-quality backend code using Python, SQL, async/await, and solid OOP principles;
- Apply software best practices to ensure reliability, scalability, and on-time delivery;
- Implement dependency injection, layered architectures, and SOLID design patterns;
- Integrate Azure OpenAI (GPT-4, GPT-4 Vision) with robust retry and error handling;
- Build LLM observability with Langfuse (prompts, tokens, cost, latency);
- Develop prompt management with versioning, fallbacks, and cost optimization strategies;
- Orchestrate async workflows using Celery for complex pipelines;
- Design multi-tenant architectures with strict data isolation;
- Integrate third-party APIs (Veeva Vault, Adobe PDF Services, OCR);
- Troubleshoot systems using structured logging and distributed tracing;
- Document APIs and changes using OpenAPI/Swagger.
Benefits
- Awesome projects with an impact
- Udemy courses of your choice
- Team-buildings, events, marathons & charity activities to connect and recharge
- Workshops, trainings, expert knowledge-sharing that keep you growing
- Clear career path
- Absence days for work-life balance
- Flexible hours & work setup - work from anywhere and organize your day your way