Job Description
Please submit your CV in English and indicate your level of English proficiency.
Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems.
Participation is project-based, not permanent employment.
What This Opportunity Involves
You design mathematics problems to challenge a frontier AI model. The problem must have an answer verifiable by code, and the problem has to require a specialized tool like Z3, cvc5, SageMath, Macaulay2, or others. NumPy or SymPy on their own won't cut it. Each problem runs inside a sealed Linux container with the tool pre-installed and a programmatic judge that grades the model's answer.
As an expert author, you:
Pick an anchor tool and design a problem that hinges on its usage Write a Python reference solution, supply input files optionally where needed Decide the numerical answer and how close the model needs to get to count as right Test the proble...
Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems.
Participation is project-based, not permanent employment.
What This Opportunity Involves
You design mathematics problems to challenge a frontier AI model. The problem must have an answer verifiable by code, and the problem has to require a specialized tool like Z3, cvc5, SageMath, Macaulay2, or others. NumPy or SymPy on their own won't cut it. Each problem runs inside a sealed Linux container with the tool pre-installed and a programmatic judge that grades the model's answer.
As an expert author, you:
Pick an anchor tool and design a problem that hinges on its usage Write a Python reference solution, supply input files optionally where needed Decide the numerical answer and how close the model needs to get to count as right Test the proble...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application