Back to the stack

[Remote] Network DevOps Engineer, RDMA Fabric Automation - Multiple Openings

Remote Worldwide Hiring now

Note: The job is a remote job and is open to candidates in USA. Vultr is a leading cloud infrastructure company focused on providing high-performance solutions for enterprises and AI innovators. They are seeking a Network DevOps Engineer to automate and operate RDMA-based Ethernet fabrics, ensuring reliable network performance at a global scale.

Responsibilities

  • Automate deployment and operations of large-scale RDMA (RoCEv2) Ethernet fabrics across Vultr data centers
  • Build Ansible and Python-based frameworks to provision, validate, and remediate underlay and overlay networks
  • Integrate network automation with Vultr’s source-of-truth systems (NetBox, OpsMill) for intent-driven configuration and validation
  • Develop telemetry ingestion and correlation pipelines (gNMI, Prometheus, Kafka, custom collectors) for real-time network health and performance metrics
  • Collaborate with platform, orchestration, and product engineering teams to optimize RDMA performance, PFC/ECN behavior, and path symmetry across fabrics
  • Implement CI/CD workflows for network configuration changes — validation, pre-checks, and rollbacks
  • Investigate complex network behaviors across layers — flow hashing, congestion domains, ECMP, and overlay interactions
  • Contribute to the design of next-generation GPU and AI interconnect fabrics, ensuring seamless integration into Vultr’s global network architecture

Skills

  • Solid understanding of modern data center networking: EVPN-VXLAN, BGP, MLAG, QoS, and traffic engineering
  • Deep familiarity with RoCEv2, RDMA transport tuning, ECN/PFC, and lossless Ethernet design
  • Strong experience with automation frameworks like Ansible, and languages like Python, Golang, Rust, or PHP
  • Comfort working with telemetry and monitoring stacks — Prometheus, Grafana, Loki, ELK, or similar
  • Previous experience integrating with NetBox, Nautobot, OpsMill or similar for topology and configuration source-of-truth
  • Familiarity with CI/CD systems (GitHub Actions, Jenkins, ArgoCD) for continuous delivery of network automation
  • Strong Linux networking background, including namespaces, netlink, and system-level debugging

Benefits

  • 100% company-paid insurance premiums for employee medical, dental and vision plans.
  • 401(k) plan that matches 100% up to 4%, with immediate vesting
  • Professional Development Reimbursement of $2,500 each year
  • 11 Holidays + Paid Time Off Accrual + Rollover Plan
  • Commitment matters to Vultr! Increased PTO at 3 year and 10 year anniversary + 1 month paid sabbatical every 5 years + Anniversary Bonus each year
  • $500 stipend for remote office setup in first year + $400 each following year
  • Internet reimbursement up to $75 per month
  • Gym membership reimbursement up to $50 per month
  • Company paid Wellable subscription

Company Overview

  • Vultr is an AI cloud infrastructure platform offering latest generation NVIDIA GPUs and AMD CPUs and GPUs across 32 worldwide regions It was founded in 2014, and is headquartered in West Palm Beach, Florida, USA, with a workforce of 201-500 employees. Its website is https://www.vultr.com.
  • Company H1B Sponsorship

  • Vultr has a track record of offering H1B sponsorships, with 1 in 2024. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job
    Apply for this role Opens the employer's application page — free, no JobStack account needed.

    More from the stack

    [Remote] Staff Software Engineer, Ads Measurement Products

    Remote Worldwide
    View role

    [Remote] Med Dir., Prov Perf & Clin Transf

    Remote Worldwide
    View role

    [Remote] Customer Success Support

    Remote Worldwide
    View role

    [Remote] Product, Platform & Enterprise Full Stack Sr/Staff Software Engineer (Remote - US)

    Remote Worldwide
    View role

    [Remote] Head of Marketing

    Remote Worldwide
    View role

    [Remote] Senior React Engineer (Full Stack / AI / Azure)

    Remote Worldwide
    View role

    [Remote] Full Stack Engineer

    Remote Worldwide
    View role

    [Remote] Senior Product Manager

    Remote Worldwide
    View role

    [Remote] Senior Platform Engineer

    Remote Worldwide
    View role

    [Remote] Front End Web Production Engineer

    Remote Worldwide
    View role

    Remote Chat Support Specialist – Live Customer Service Representative | Flexible Remote Position | No Experience Needed | Training Provided

    Remote Worldwide
    View role

    Principal Software Engineer - GitHub Actions

    Remote Worldwide
    View role

    Per Diem – RN, Triage Per

    Remote Worldwide
    View role

    Logistics Coordinator – Remote – GA, IN, NE, NV, PA, TX, FL

    Remote Worldwide
    View role

    Business Developer (Veneto)

    Remote Worldwide
    View role

    Sociology Adjunct - Online/Remote

    Remote Worldwide
    View role

    Analista de Qualidade de Software | Júnior (13126)

    Remote Worldwide
    View role

    Assistant Property Manager, Multifamily

    Remote Worldwide
    View role

    Value-based Care Registered Nurse (Per Diem)

    Remote Worldwide
    View role

    Customer Care Representative – Remote Pharmacy Support & Patient Wellness Advocate at arenaflex

    Remote Worldwide
    View role