NVIDIA Careers | Hiring DevOps & Automation Engineer in India (Bengaluru, Hyderabad, Pune, Mumbai, Remote) About NVIDIA

NVIDIA is at the forefront of AI, High-Performance Computing (HPC), and visualization technologies. Since the invention of the GPU, NVIDIA has transformed modern computing and continues to power groundbreaking advancements across industries—from scientific research and deep learning to autonomous driving and cloud computing.

Recognized globally as one of the most desirable employers in the technology industry, NVIDIA attracts some of the world’s brightest and most ambitious engineers. We thrive on innovation, creativity, and collaboration. If you’re looking for an opportunity to shape the future of AI and computing, this is the perfect place to accelerate your career.

We are expanding our Software Infrastructure Team in India and seeking a highly skilled DevOps & Automation Engineer to design, build, and enhance the infrastructure that supports large-scale GPU clusters. These systems—interconnected via NVLink and InfiniBand—are the backbone of today’s fastest HPC and AI workloads.


Why Join NVIDIA as a DevOps & Automation Engineer?

Working at NVIDIA means more than just building pipelines or managing infrastructure—it means being part of a mission-driven organization shaping the future of AI and high-performance computing. As a DevOps Engineer here, you will:

  • Work on cutting-edge infrastructure supporting next-generation AI research.
  • Collaborate with global engineering teams solving complex, large-scale challenges.
  • Contribute directly to GPU-powered cluster automation that drives innovation across industries.
  • Be part of a company consistently ranked as a “Great Place to Work” and known for its employee-centric culture.

This is not just another DevOps job. It’s an opportunity to engineer systems at a scale and complexity few organizations in the world can match.


Key Responsibilities

As a DevOps & Automation Engineer at NVIDIA, you will play a critical role in ensuring our systems remain scalable, reliable, and efficient. Your day-to-day responsibilities will include:

  1. CI/CD Pipeline Development & Management
    • Build and maintain Continuous Integration and Continuous Deployment pipelines that accelerate release cycles while ensuring reliability.
    • Enable modularized development by decoupling monolithic systems into scalable, loosely coupled components.
  2. Automation & Infrastructure Engineering
    • Design automation workflows for software release management, dependency handling, and system updates.
    • Implement infrastructure-as-code frameworks to simplify provisioning, scaling, and cluster management.
  3. GPU Cluster Management
    • Develop automation solutions for provisioning and maintaining GPU clusters connected with NVLink and InfiniBand.
    • Monitor performance, ensure high availability, and support workload scalability.
  4. Monitoring & Troubleshooting
    • Automate system health monitoring, logging, and alerting using tools such as Prometheus and Grafana.
    • Diagnose and resolve complex issues across distributed systems with minimal downtime.
  5. Cross-Functional Collaboration
    • Partner with global engineering teams to align on best practices, infrastructure standards, and release processes.
    • Support firmware/software rollouts while minimizing operational risks.

Skills & Qualifications

To succeed in this role, you should bring a blend of technical expertise, problem-solving skills, and collaborative mindset.

  • Education:
    • BS/MS in Computer Science, Computer Engineering, or related technical discipline, or equivalent hands-on experience.
  • Technical Skills:
    • 5+ years managing infrastructure in high-performance or distributed environments.
    • Expertise in Python, Ansible, and Shell scripting.
    • Hands-on experience with CI/CD tools (Jenkins, GitLab CI/CD, etc.) and infrastructure-as-code (Terraform, Ansible, Puppet, Chef).
    • Solid understanding of Linux, networking, and distributed systems design.
    • Ability to break down and refactor monolithic systems into modular, scalable architectures.
  • Soft Skills:
    • Strong problem-solving and analytical thinking.
    • Excellent cross-functional communication and collaboration abilities.
    • Passion for learning new technologies and applying them in real-world scenarios.

Preferred (Nice-to-Have) Skills

While not mandatory, the following skills will make you stand out:

  • Experience with cluster management tools like Slurm.
  • Familiarity with NVIDIA DGX systems and GPU-based clusters.
  • Understanding of observability tools (Prometheus, Grafana).
  • Proven record of leading DevOps process improvements and driving team-wide efficiency.

Career Growth at NVIDIA

At NVIDIA, you will be empowered to:

  • Grow Your Skills – Continuous learning opportunities, certifications, and exposure to cutting-edge AI infrastructure.
  • Advance Your Career – Clear career progression paths with opportunities to lead projects, mentor teams, and take on global responsibilities.
  • Make an Impact – Your work will directly support groundbreaking innovations in AI, autonomous driving, healthcare, robotics, and HPC research.

NVIDIA fosters a culture where engineers don’t just execute tasks—they innovate, collaborate, and shape the future of technology.


Work Locations

We are hiring across multiple locations in India, including:

  • Bengaluru
  • Hyderabad
  • Pune
  • Mumbai
  • Remote (India)

This flexibility ensures you can choose the work environment that best suits your lifestyle while staying connected with global teams.


About NVIDIA’s Culture

At NVIDIA, we believe that people are our greatest asset. Our inclusive, innovative, and collaborative culture has earned us a reputation as one of the world’s most desirable technology employers.

  • Innovation at Scale: Work on projects that redefine industries.
  • Collaboration Across Borders: Join global teams solving challenges that impact millions worldwide.
  • Employee Well-being: Competitive compensation, benefits, and work-life balance.
  • Recognition: NVIDIA is consistently named a Great Place to Work and ranks high among top employers worldwide.

How to Apply

If you’re passionate about automation, DevOps, and building infrastructure at scale, and want to be part of a team shaping the future of AI and HPC, we want to hear from you!

Apply Now for the role of DevOps & Automation Engineer at NVIDIA (JR2000937) and become part of one of the most innovative technology companies in the world.

🔗 Apply Here – NVIDIA Careers

 

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top