Job Description
Mandatory Skills Description:
- Strong C/C++ programming skills
- Experience with compiler internals (llvm, gcc or any other)
- Basic Python programming skills
- Experience in performance analysis
Project Description:
- Working on GPU support for OpenAI/Triton — a language and compiler for writing highly efficient custom Deep-Learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with Triton on GPUs with ROCm.
Responsibilities:
- New features development, support and optimization of OpenAI/Triton project for GPUs. Communication with other developers, customers and project managers. Test implementation, project documentation and verification of system with unit/component/functional tests.
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application