NVIDIA Logo

NVIDIA

Senior Machine Learning Applications and Compiler Engineer

Reposted Yesterday
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Cambridge, Cambridgeshire, England
Senior level
In-Office or Remote
Hiring Remotely in Cambridge, Cambridgeshire, England
Senior level
Design and implement compiler and runtime optimizations for large-scale inference, map neural network workloads onto NVIDIA hardware, benchmark and profile performance, prototype new compilation/runtime techniques, and collaborate with hardware and software teams to influence architecture and tools.
The summary above was generated by AI

NVIDIA is seeking engineers to develop algorithms and optimizations for our inference and compiler stack. You will work at the intersection of large-scale systems, compilers, and deep learning, crafting how neural network workloads map onto future NVIDIA platforms. This is your chance to be part of something outstandingly innovative!

 

What you’ll be doing:

  • Build, develop, and maintain high-performance runtime and compiler components, focusing on end-to-end inference optimization.

  • Define and implement mappings of large-scale inference workloads onto NVIDIA’s systems.

  • Extend and integrate with NVIDIA’s SW ecosystem, contributing to libraries, tooling, and interfaces that enable seamless deployment of models across platforms.

  • Benchmark, profile, and monitor key performance and efficiency metrics to ensure the compiler generates efficient mappings of neural network graphs to our inference hardware.

  • Collaborate closely with hardware architects and design teams to feedback software observations, influence future architectures, and codesign features that unlock new performance and efficiency points.

  • Prototype and evaluate new compilation and runtime techniques, including graph transformations, scheduling strategies, and memory/layout optimizations tailored to spatial processors.

  • Publish and present technical work on novel compilation approaches for inference and related spatial accelerators at top tier ML, compiler, and computer architecture venues.

 

What we need to see:

  • MS or PhD in Computer Science, Electrical/Computer Engineering, or related field, or equivalent experience, with 5 years of relevant experience.

  • Strong software engineering background with proficiency in systems level programming (e.g., C/C++ and/or Rust) and solid CS fundamentals in data structures, algorithms, and concurrency.

  • Hands on experience with compiler or runtime development, including IR design, optimization passes, or code generation.

  • Experience with LLVM and/or MLIR, including building custom passes, dialects, or integrations.

  • Familiarity with deep learning frameworks such as TensorFlow and PyTorch, and experience working with portable graph formats such as ONNX.

  • Solid understanding of parallel and heterogeneous compute architectures, such as GPUs, spatial accelerators, or other domain specific processors.

  • Strong analytical and debugging skills, with experience using profiling, tracing, and benchmarking tools to drive performance improvements.

  • Excellent communication and collaboration skills, with the ability to work across hardware, systems, and software teams.

  • Ideal candidates will have direct experience with MLIR based compilers or other multilevel IR stacks, especially in the context of graph based deep learning workloads.

 

Ways to stand out from the crowd:

  • Prior work on spatial or dataflow architectures, including static scheduling, pipeline parallelism, or tensor parallelism at scale.

  • Contributions to opensource ML frameworks, compilers, or runtime systems, particularly in areas related to performance or scalability.

  • Demonstrated research impact, such as publications or presentations at conferences like PLDI, CGO, ASPLOS, ISCA, MICRO, MLSys, NeurIPS, or similar.

  • Experience with large-scale AI distributed inference or training systems, including performance modeling and capacity planning for multi rack deployments.

 

#LI-Hybrid

Top Skills

C
C++
Llvm
Mlir
Onnx
PyTorch
Rust
TensorFlow

Similar Jobs

2 Hours Ago
Remote or Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead a regional sales team (5+ AEs) to hit quota and grow accounts. Recruit and coach reps, manage forecasts and territory balance, drive pipeline and new-logo acquisition, negotiate enterprise deals, develop champions and C-level relationships, expand adoption, and enforce deal hygiene and value-based selling.
Top Skills: AIIt InfrastructureSaaSSalesforce (Sfdc)
2 Hours Ago
Remote or Hybrid
Staines, Surrey, England, GBR
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead on-time, on-budget delivery of ServiceNow engagements using NowCreate methodology. Manage governance, scope, timeline, risks, finances, stakeholders, and partner collaboration to drive customer outcomes and long-term success.
Top Skills: Servicenow,Nowcreate,Ai,Ai-Powered Tools
2 Hours Ago
Remote or Hybrid
Staines, Surrey, England, GBR
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead Expert Services delivery teams using the Now Create methodology to drive large, complex ServiceNow engagements. Own program governance, risk mitigation, stakeholder alignment, and mentor teams to accelerate time-to-value and ensure long-term customer success.
Top Skills: Servicenow,Now Create,Ai

What you need to know about the Belfast Tech Scene

If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account