Описание
As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and production systems, supporting both internal and external use cases across various environments. The ideal candidate combines strong ML fundamentals with deep expertise in backend system design.
You’ll work in a highly collaborative environment, bridging research and engineering to deliver seamless experiences to our customers and accelerate innovation across the company. You will: Build and maintain fault-tolerant, high-performance systems for serving LLMs workloads at scale. Build an internal platform to empower LLM capability discovery.
Collaborate with researchers and engineers to integrate and optimize models for production and research use cases. Conduct architecture and design reviews to uphold best practices in system design and scalability. Develop monitoring and observability solutions to ensure system health and performance.
Lead projects end-to-end, from requirements gathering to implementation, in a cross-functional environment. Ideally you'd have: 5+ years of experience building large-scale, high-performance backend systems. , Python, Go, Rust, C++).
g. ) Experience with LLM capabilities and concepts such as reasoning, tool calling, prompt templates, etc. , Docker, Kubernetes).
Контакты работодателя (email/phone/telegram) скрыты из публичного превью —
отправьте резюме, чтобы мы связали вас напрямую.