NVIDIA-logo
NVIDIA
ยท
June 6, 2025
Apply Now
This job has closed.

Windows AI Engineering Intern - Fall 2025

Santa Clara, CA
Internship
Onsite
$18/hr - $71/hr
Intern
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. As a Windows AI Engineering Intern, you will develop inference runtimes and optimize GenAI pipelines while collaborating with various teams to enhance AI integration in applications for Windows.

Responsibilities

  • Partnering with NVIDIA software, research, architecture, and product teams, aligning strategies and technical needs for fostering the ecosystem of AI on a Windows RTX PC.
  • Performing in-depth analysis and optimization of AI models, AI frameworks, data processing pipelines, and inference backends to ensure the best performance on current and next-generation GPU architectures.
  • Identifying and implementing compute and memory optimizations across the full AI inference stack on RTX Windows PC.
  • Developing model compression and fine-tuning techniques to reduce resource consumption and improve performance, enabling efficient deployment and better user experience.
  • Designing and implementing an optimized framework for running AI NPCs in gaming applications as part of the NVIDIA ACE Platform.
  • Collaborating with Microsoft to drive advancements in APIs, AI frameworks, and platforms for developing and deploying AI inferencing applications.
  • Ensuring the effective deployment of directed tests through collaboration with the automation team, thereby ensuring the robustness of automated testing.

Qualification

Required

  • Pursuing BS, MS, or PhD in Computer Science, Software Engineering, Mathematics, or a related field.
  • Excellent C++ programming and debugging skills with a strong understanding of data structures and algorithms.
  • Strong analytical and problem-solving abilities, with the capacity to multitask effectively in a dynamic environment.

Preferred

  • Experience with AI inferencing pipelines and applications using ML/DL frameworks like PyTorch, ONNX Runtime preferred.
  • Understanding of modern techniques in Machine Learning, Deep Neural Networks, and Generative AI with relevant contributions to major open-source projects will be a plus.
  • Proficiency in lower-level system/GPU programming, CUDA, and developing high-performance systems.
  • Hands-on experience with building applications using graphics APIs like OpenGL, DirectX, Vulkan, etc.
  • Consistent track record of delivering end-to-end products with geographically distributed teams in multinational product companies.

Benefits

  • Intern benefits
NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI.
Glassdoor
4.6
Founded in 1993
Santa Clara, California, USA
10001+ employees
https://www.nvidia.com

Similar Job