Nebius

Senior ML Engineer (Token Factory) | JobSetuu

Nebius

Czechia, Europe, Germany, Israel, Netherlands, UK
['Full-Time']

Posted 2 hours ago • Via jobicy.com

Description

Job Overview

  • Source: Jobicy

Job Description

About Nebius:

Nebius is leading a new era in cloud infrastructure for the global AI economy. We are building a full-stack AI cloud platform that supports developers and enterprises from data and model training through to production deployment, without the cost and complexity of building large in-house AI/ML infrastructure.

Built by engineers, for engineers. From large-scale GPU orchestration to inference optimization, we own the hard problems across compute, storage, networking and applied AI.

Listed on Nasdaq (NBIS) and headquartered in Amsterdam, we have a global footprint with R&D hubs across Europe, the UK, North America and Israel. Our team of 1,500+ includes hundreds of engineers with deep expertise across hardware, software and AI R&D.

The role

Token Factory is a part of Nebius Cloud, one of the world's largest GPU clouds, running tens of thousands of GPUs. We are building a high-performance inference and fine-tuning platform designed to push foundation models to their hardware limits. Our mission is to maximize throughput, minimise latency, and optimise cost-per-token across tens of thousands of GPUs.

 

Some directions we are currently working on, and which you can be a part of:

  • Inference Optimization: Identifying LLM inference bottlenecks to drive production speedups. Squeezing the maximum performance for a wide range of LLM architectures at scale (e.g., GPT-OSS, Kimi K2.5, DeepSeek V3.1/V3.2, GLM-5).
  • Inference engines support: Implement novel speculative decoding architectures, optimise components of various LLM designs (dense/MoE, autoregressive/parallel), and contribute to open-source inference engines.
  • Low Precision Training & Inference: Design and productionise low-precision (FP8, NVFP4/MXFP4) training and inference pipelines with measurable gains in throughput and cost-efficiency.

 

We expect you to have:

  • A profound understanding of theoretical foundations of machine learning and transformer architecture.
  • Experience profiling GPU workloads using Nsight, PyTorch profiler, or similar tools
  • Understanding of GPU memory hierarchy and compute/memory tradeoffs
  • Familiarity with important ideas in LLM space, such as MHA, RoPE, KV-cache, Flash Attention, and quantisation
  •  Understanding of performance aspects of large neural network training (sharding strategies, custom kernels, hardware features etc.)
  •  Strong software engineering skills (we mostly use Python)
  • Deep experience with modern deep learning frameworks
  • Proficiency in contemporary software engineering approaches, including CI/CD, version control and unit testing
  • Strong communication and leadership abilities

 

Nice to have:

  • Experience working with open-source inference engines (vLLM, SGLang, TensorRT-LLM), including contributions
  • Experience with kernel languages or DSLs such as Triton, Cute, CUTLASS, CUDA
  • A track record of building and delivering products (not necessarily ML-related) in a dynamic startup-like environment.
  • Strong engineering skills, including experience in developing large distributed systems or high-load web services.
  • Open-source projects that showcase your engineering prowess
  •  Excellent command of the English language, alongside superior writing, articulation, and communication skills.

 

 

Benefits & Perks:

  • Competitive compensation
  • Career growth and learning opportunities
  • Flexibility and ownership
  • Collaborative and innovative culture
  • Opportunity to work on impactful AI projects
  • International environment and talented teams

What's it like to work at Nebius:

Fast moving - Bold thinking - Constant growth - Meaningful impact - Trust and real ownership - Opportunity to shape the future of AI 

Equal Opportunity Statement:

Nebius is an equal opportunity employer. We are committed to fostering an inclusive and diverse workplace and to providing equal employment opportunities in all aspects of employment. We do not discriminate on the basis of race, color, religion, sex (including pregnancy), national origin, ancestry, age, disability, genetic information, marital status, veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by applicable law.

Applicants must be authorized to work in the country in which they apply and will be required to provide proof of employment eligibility as a condition of hire. 

If you need accommodations during the application process, please let us know.

Expert Career Tips for Senior ML Engineer (Token Factory) Roles

To succeed in a competitive market as a Senior ML Engineer (Token Factory), you need more than just technical skills. Here are some expert strategies to elevate your profile:

  • Build a Strong Portfolio: For technical roles, a clean GitHub or a personal project site is essential. For non-technical roles, a case study portfolio demonstrating problem-solving and impact is equally valuable. Show, don't just tell, what you have achieved in your previous positions.
  • Master the Narrative: When interviewing, use the STAR method (Situation, Task, Action, Result) to structure your answers. Quantify your results wherever possible—mentioning "increased efficiency by 20%" is much more impactful than saying "improved efficiency."
  • Continuous Learning: The industry moves fast. Whether it's staying updated with the latest AI tools or mastering a new management methodology, continuous professional development is key. Consider obtaining industry-recognized certifications that align with Senior ML Engineer (Token Factory) requirements.
  • Networking: Connect with other professionals in similar roles. Join online communities, attend webinars, and engage in meaningful discussions on professional social networks. Often, the best opportunities come through referrals and community engagement.
  • Soft Skills Matter: Communication, empathy, and leadership are often the deciding factors between two equally qualified technical candidates. Cultivate these skills as they are universally valued across all industries and seniority levels.

Additionally, research the specific company's culture and values. Tailoring your application to show how you align with their mission can significantly increase your chances of moving forward in the process.

Salary & Compensation

Salary not disclosed; typically competitive for the role.

Work Arrangement

Type: On-Site

Standard business hours at the office.

Comprehensive Application Strategy & Hiring Process

Applying for a new role is a marathon, not a sprint. Follow this strategic approach to maximize your success rate:

1. Initial Research & Tailoring

Don't send the same resume to every employer. Spend at least 30 minutes researching the company. Look for recent news, their product roadmap, and their team structure. Modify your summary and core competencies to reflect the specific keywords found in the job description.

2. The Perfect Cover Letter

If the application allows for a cover letter, use it to tell a story that your resume cannot. Explain why you are passionate about this specific company and how your unique background makes you the perfect fit for the challenges they are currently facing.

3. Navigating the Multi-Stage Interview

Most modern hiring processes involve 3-5 stages. This typically includes a recruiter screen, a technical or skill-based assessment, a peer interview, and a final leadership round. Prepare for each stage differently: focus on enthusiasm and fit for the recruiter, technical depth for the assessment, and strategic vision for the leadership round.

4. Post-Interview Follow-Up

Always send a personalized thank-you note within 24 hours of each interview. Reference a specific topic discussed during the call to demonstrate your active listening and genuine interest in the role.

By following these steps, you demonstrate a high level of professionalism and attention to detail that sets you apart from the average applicant.

Typical Interview Process

  1. Resume screening
  2. HR call
  3. Skill interview
  4. Final manager interview
  5. Offer

Tip: Research the company's products and culture.

Global Market Intelligence & Relocation Insights

At JobSetuu, we specialize in helping talent navigate the global job market. Here is what you need to know about the current landscape in Global and beyond:

The demand for skilled professionals is increasingly borderless. For roles based in Global, understanding the local cost of living, visa requirements (if applicable), and cultural nuances is vital. If this is a remote role, consider the time zone alignment and the asynchronous communication culture of the hiring organization.

Relocation Support: Many forward-thinking companies offer relocation packages that include moving stipends, temporary housing, and legal assistance with work permits. When evaluating an offer, look beyond the base salary—consider the total compensation package, including equity, bonuses, and healthcare benefits.

Work-Life Balance Trends: Hybrid and remote work have become standard in many regions. Research the local labor laws and common practices regarding work hours and vacation time to ensure the role aligns with your lifestyle goals.

Leveraging JobSetuu's tools can help you compare salaries across different cities and understand the "purchasing power" of your potential offer, ensuring you make an informed decision for your long-term career path.

Skills & Competency Roadmap for Professional Development

To remain competitive in Professional Development, we recommend focusing on the following core competencies over the next 12-18 months:

  • Technical Mastery: Deepen your expertise in the core tools and languages relevant to your field. For developers, this might be cloud architecture; for marketers, it might be data-driven attribution modeling.
  • AI Augmentation: Learn how to leverage generative AI and automation tools to increase your productivity. Understanding how to integrate these technologies into your workflow is becoming a non-negotiable skill.
  • Leadership & Strategy: Even in individual contributor roles, the ability to think strategically and lead projects from inception to completion is highly valued. Focus on stakeholder management and high-level project planning.
  • Data Literacy: The ability to interpret data and use it to drive decisions is essential across all business functions. Familiarize yourself with data visualization and basic analytical concepts.

By investing in these areas, you not only prepare yourself for the role you are applying for today but also build a resilient foundation for the opportunities of tomorrow.

Apply via JobSetuu

Discover your next career milestone on JobSetuu. This Senior ML Engineer (Token Factory) position is part of our commitment to bringing you the most relevant and high-impact job openings globally. At JobSetuu, we simplify your job search by aggregating premier listings and providing the tools you need to stand out. Don't miss the chance to elevate your professional journey—explore more opportunities and career insights on our platform today.

shopping_cart

Recommended Career Gear

Ant Value VM7P Slim Computer Case/Office Cabinet - Black | Support Micro ATX, ITX | Built-in 2X USB Ports, Pre-Installed Power Supply & 120mm Side Fan Amazon Choice
coding accessories

Ant Value VM7P Slim Computer Case/Office Cabinet - Black | Support Micro ATX, ITX | Built-in 2X USB Ports, Pre-Installed Power Supply & 120mm Side Fan

₹2,649
Buy on Amazon
Wooden Step Stools for Adults Kids with Non-Slip Rubber Feet, Heavy Duty Stepping Stools with 500-LBS Capacity for Bedroom Kitchen Bathroom, Bed Steps for High Beds, Easy to Assemble, Rustic Brown Amazon Choice
coding accessories

Wooden Step Stools for Adults Kids with Non-Slip Rubber Feet, Heavy Duty Stepping Stools with 500-LBS Capacity for Bedroom Kitchen Bathroom, Bed Steps for High Beds, Easy to Assemble, Rustic Brown

₹999
Buy on Amazon
Cuzor Mini UPS 12V Router UPS up to 2A | Up to 5 Hours Backup | WiFi UPS Power Backup During powercuts | 2x2900 mAh | Mini ups for Router Amazon Choice
coding accessories

Cuzor Mini UPS 12V Router UPS up to 2A | Up to 5 Hours Backup | WiFi UPS Power Backup During powercuts | 2x2900 mAh | Mini ups for Router

₹1,449
Buy on Amazon
check_circle

Discovery Success