Job Description
Job Overview
Join NVIDIA as a Senior Engineer and build cutting-edge AI inference systems that serve large-scale models with astounding efficiency. Focus on optimizing GPU performance and collaborating with top experts.
In this pivotal role, you will have the opportunity to architect high-performance inference stacks and optimize NVIDIA's GPU solutions for maximum productivity. Your expertise will be instrumental in achieving industry-leading benchmarks and implementing state-of-the-art GPU kernels within a collaborative, multi-cloud framework.
Leverage your skills in performance engineering at NVIDIA to drive AI innovation.
Key Responsibilities
- Develop and optimize features for vLLM with latest GPU tech
- Benchmark and profile GPU kernels for efficiency
- Create tools for inference benchmarking methodologies
- Lead orchestration of large-scale inference deployme...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application