NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s an outstanding legacy of innovation driven by extraordinary technology and amazing people. NVIDIA is looking for a highly motivated SRE Engineer to join the NVIDIA AIR team – the Digital Twin for Data Center Simulation web application. NVIDIA AIR enables cloud-scale efficiency by creating identical replicas of real-world data center infrastructure deployments. To learn more, visit NVIDIA AIR.
What you'll be doing:
Design, deploy, and manage IaaS platforms with a focus on high availability and performance.
Automate infrastructure operations using tools like Terraform, Ansible, and Python.
Focus on efficiency by automating repetitive workflows.
Develop monitoring and observability tooling to detect and prevent outages using Prometheus, Grafana, ELK, etc.
Deploy and troubleshoot non-disruptive cloud operations with an emphasis on secure production infrastructure.
Manage deployment/upgrades for Operating Systems, Kubernetes (k8s) clusters, and other orchestration tools.
Provide day-to-day support for engineering activities with CI/CD tools like Git and Jenkins.
Implement and enforce best practices around infrastructure security, access control, and operational efficiency.
What we need to see:
BS degree in Computer Science, Software Engineering, or a related field (or equivalent experience).
3–5+ years of experience in a Site Reliability, DevOps, or Systems Engineering role.
Strong automation and scripting skills in Ansible, Python, and Shell Scripting.
Experience in IaaS environments, including deploying, configuring, and administering Linux-based bare metal servers.
Deep experience in infrastructure engineering, focused on managing and monitoring a highly available production infrastructure.
Skilled in observability practices, using Prometheus, Grafana, ELK/EFK, and integrated alerting systems.
Solid grasp of Linux internals and core networking concepts including NAT, DNS, DHCP, routing, and firewall configuration with iptables or nftables.
Experience with modern deployment architecture for non-disruptive cloud operations, including blue-green and canary rollouts.
Proficiency in Kubernetes, Docker, QEMU, and Libvirt.
Ways to stand out from the crowd:
Hands-on expertise with AWS, including deploying complex, load-balanced, and highly available workloads.
Proficiency in debugging network issues in both infrastructure and SDN.
Experience with performance tuning and benchmarking across storage, compute, or networking.
Implemented robust metrics collection and alerting infrastructure.
Familiar with compliance standards such as FedRAMP, HIPAA, and SOC 2.
With competitive salaries and a generous benefits package (www.nvidiabenefits.com ), we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!
The base salary range is 120,000 USD - 235,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
NVIDIA is looking for a skilled Senior Field Application Engineer to support OEM system design and deployment of cutting-edge AI datacenter solutions.
Drive growth and adoption of NVIDIA Cloud Partners' AI infrastructure by startups and ISVs in a strategic business development role.
A Senior Red Team Engineer role at Early Warning entails leading offensive security efforts to protect and enhance payment systems through advanced threat identification and adversary emulation.
Lead enterprise security initiatives and manage a team at Palo Alto Networks to secure applications, endpoints, and infrastructure within a dynamic cybersecurity environment.
Become a key IT Engineer at the Red Gate Group, driving strategic technology solutions and requirements management for national security agencies.
Drive the security engineering for OpenAI's innovative agentic AI systems, building robust defenses and shaping industry standards.
Information Systems Security Engineer needed to provide cybersecurity support and lead security control efforts for Navy systems in Kittery, ME.
Lead the development of advanced Contact Center Chat Platform solutions at Brightspeed to transform customer interactions through innovative technology.
GuidePoint Security is looking for a US citizen Splunk Enterprise Certified Architect to remotely lead advanced security-focused Splunk deployments and innovative cybersecurity solutions.
Experienced Senior System Engineer & Team Lead needed to manage and optimize large-scale Windows, Azure, and enterprise IT infrastructure within a dynamic agency environment.
Medtronic invites skilled IT professionals to support and innovate infrastructure operations onsite at their Eatontown location, enhancing healthcare technology ecosystems.
Northwood is searching for a Senior Security Engineer to lead the design and implementation of security frameworks for their global space communications network, with a focus on regulatory compliance and operational security.
Dynanet Corporation seeks a skilled UI/UX Specialist/Developer to enhance cybersecurity and network perimeter solutions with hybrid work in Bethesda, MD.
Join Legend Biotech’s IT team as an Information Security Operations Analyst to enhance cybersecurity operations and safeguard cutting-edge biotech innovations.
Lead HealthPartners' enterprise architecture team to drive strategic technology initiatives and digital transformation in a hybrid work environment.
NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.
130 jobs