Full-time Posted June 17, 2026
Apply Now

Job Description

Job Summary

We are looking for a highly capable engineer/researcher to lead the R&D of Small Language Models (SLMs) and Vision-Language Models (VLMs) for edge / low-latency and cost-efficient production scenarios. You will own the continuous pretraining, supervised instruction tuning (SFT), and compression/distillation pipelines, and work closely with platform teams to deliver reliable, measurable improvements in inference efficiency, tool‑use success rate, and overall model quality.

Key Responsibilities

1) SLM/VLM Training: Continuous Pretraining & Instruction Tuning (SFT)

  • Conduct continuous pretraining and SFT for SLMs and VLMs to improve task performance and domain adaptation.

  • Build reproducible training workflows in PyTorch , including data processing, training, evaluation, and model versioning.

2) Compression, Distillation & Edge/Low‑Latency Inference Optimization

  • ...

Apply for This Position

Ready to take the next step? Click the button below to submit your application.

Submit Application