Full-time Posted June 04, 2026
Apply Now

Job Description

Help us push the boundaries of AI inference at NVIDIA — where your systems expertise shapes both the technology and the teams building on top of it! We're looking for a Senior Software Engineer to work at the frontier of large-scale LLM serving, partnering directly with some of the world's most technically demanding customers to unlock the full performance potential of NVIDIA's inference stack. In this role, you'll combine deep systems knowledge with hands‑on customer engagement — profiling real deployments, benchmarking across GPU clusters, and turning insights into improvements that ripple across the open-source ecosystem. Do you love digging into performance problems that don't have obvious answers, and want your work to have an impact far beyond a single codebase? We'd love to talk. Unlike traditional customer‑facing engineering roles, we expect you to go far deeper — contributing to vLLM, NVIDIA Dynamo, and the tooling that makes every engineer on your team more effective. <...

Apply for This Position

Ready to take the next step? Click the button below to submit your application.

Submit Application