Varsity portfolio draft picks

Explore premier career opportunities with our portfolio companies
If you are a Varsity portfolio companyclaim your profile.

Research Engineer - Post Training

Kog

Kog

Paris, France
Posted on Jul 25, 2025

Location

Paris, France

Employment Type

Full time

Location Type

Hybrid

Department

Engineering

KOG:

Kog is a real-time AI startup, created a little over a year ago, which aims to revolutionize how AI is used in digital experiences. The goal is to make AI faster, more efficient, and more intuitive.

We creatively optimize at a very low level with our own solutions and seek out new ideas that we implement ambitiously.

Kog is based on two axes:

  • Hardcore GPU Engineering

  • Custom model architectures for speed

Our final objective is to be 10x faster on GPUs and 10x faster on model architecture, thus 100x faster total!

About the Model Architecture team:

The team thrives to deliver extremely low-latency inference models. Next-generation applications raise new target performance, and we build and deliver pipelines to match these new challenges.

As a post-training engineer, you'll be responsible for understanding the Product's needs and delivering high-quality models based on our architecture.

What you'll do:

  • You will translate business needs into quality standards. You will select among existing benchmarks and develop custom ones.

  • You'll design, implement, and validate the post-training recipe.

  • You'll highlight the capabilities and limitations of our custom architecture from a Research and Product perspective.

  • You'll help prioritize the next research topics, contributing to the continuous improvement of our models and product.

About you:

Must-have:

  • You have 2+ years of experience in fine-tuning LLMs to product needs.

  • You have experience in visualizing and understanding data.

  • You have strong communication skills and can convert real-world use cases into clear benchmark ideas and implementations.

  • You have technical coding skills in Python, Pytorch, and one fine-tuning framework such as MLflow.

Nice-to-have :

  • You have a deep understanding of compression and fine-tuning algorithms and can pick the right one in any given scenario.

  • You have worked in an HPC environment (SLURM).

What we offer:

  • Competitive salary

  • Equities (BSPCE)

  • Elite technical challenges

  • World-class team (9 engineers, including 3 PhD)

  • A creative environment where your goal is to push back the limits

  • Equipment you'll need to perform

  • WeWork offices in the 13th district of Paris (near Station F)

  • Afterworks during our Paris week

You can apply right below if you feel that you're up to the task!