Full-time Posted June 23, 2026
Apply Now

Job Description

Job Summary

We are looking for a highly capable engineer/researcher to lead the R&D of Small Language Models (SLMs) and Vision‑Language Models (VLMs) for edge / low‑latency and cost‑efficient production scenarios. You will own the continuous pretraining, supervised instruction tuning (SFT), and compression/distillation pipelines, and work closely with platform teams to deliver reliable, measurable improvements in inference efficiency, tool‑use success rate, and overall model quality.

Key Responsibilities
  • SLM/VLM Training: Continuous Pretraining & Instruction Tuning (SFT)
  • Conduct continuous pretraining and SFT for SLMs and VLMs to improve task performance and domain adaptation.
  • Build reproducible training workflows in PyTorch, including data processing, training, evaluation, and model versioning.
  • Compression, Distillation & Edge/Low‑Latency Inference Optimization
  • Design and implement...

Apply for This Position

Ready to take the next step? Click the button below to submit your application.

Submit Application