itjobs.ca Logo
Branch logo

AI DevOps & Reliability Engineer

Branchabout 21 hours ago
Remote
CA$123,000 - CA$160,000/year
Senior Level
Full-Time

Top Benefits

Health And Wellness Programs
Paid Time Off
Retirement Planning Options

About the role

At Branch, we power every touchpoint with links that work and insights that prove it. From click to conversion, we make growth measurable. Our unparalleled attribution, backed by AI-enhanced linking, is trusted to deliver seamless experiences that increase ROI, decrease wasted spend, and eliminate siloed attribution. We bring the same rigor to how we build our team, by empowering our people to move fast, own outcomes, and build something that matters. We take pride in making meaningful investments in our team’s health, wealth, and growth so individuals can thrive as we scale. Our culture values smart, humble, and collaborative teammates who take accountability and drive results in an environment where their work truly moves the business forward. We are innovative, scaling with purpose, and led by seasoned leaders who know how to build enduring companies. Trusted by brands like Instacart, Western Union, NBCUniversal, ZocDoc, and Sephora, we’re big enough to matter, small enough for you to make a real impact. If you’re excited by the grit of building, rapid learning, and shaping the future of customer growth, you’ll find your place here. About The Group We're hiring an AI DevOps & Reliability Engineer to own how software ships and runs at Branch. The role has two areas: half central platform and standards work, half embedded with an engineering team. Centrally, you'll build and operate the delivery platform (CI/CD pipelines, deployment automation, environments) so teams can release safely, frequently, and on demand. Embedded, you'll work hands-on with an engineering team day-to-day on their infrastructure, deployment, and operational practices, mentoring them and building their capability over time. You'll also lead the adoption of AI in DevOps and SRE work at Branch. Bringing modern AI tooling (Claude Code, agentic workflows) into runbook generation, alerting, incident response, and operational tooling is a core part of this role, not a side project. It's a strategic direction we're committed to. As a lead, you'll work directly with engineering leadership to shape the operations and delivery roadmap across multiple milestones. What You'll DoDelivery & Release Engineering Design and expand deployment automation, advancing the org toward on-demand and continuous production releases. Establish release practices and standards: progressive delivery, rollback, release tracking, deployment inventory teams can trust. Extend automation deeper into production paths, reducing manual steps and release toil. Enable verification through automation: quality gates as code, build engineering supports our efforts. Pipelines & Guardrails Own CI/CD standards across teams: quality gates, automated checks, guardrails that catch problems before production. Build pipeline tooling that makes the safe path the easy path for engineers. Environments Design and build out dev, staging, and on-demand (ephemeral) environments that mirror production and spin up on request. Treat environment provisioning as a product: fast, reproducible, self-service. AI-Embedded Ops Bring AI tooling into operations: automated runbook generation, intelligent alerting, AI-assisted incident response, operational tooling. Help build an org-wide, AI-augmented ops practice and share patterns across teams. This is a core part of the role, aligned with Branch's broader AI direction. Infrastructure & GitOps Champion Infrastructure as Code (Terraform / CloudFormation) for provisioning, configuration, and lifecycle management. Drive GitOps-based delivery with Argo CD for secure, repeatable, scalable deployments across Kubernetes. Operational Reliability Bring a strong reliability foundation: alerting practices, on-call, runbooks, SLI/SLO definition, incident response. Partner with engineering teams on the operational practices that keep their services healthy at high volume. Operate and tune high-volume data infrastructure: streaming pipelines (Kafka) and SQL/NoSQL datastores under heavy production load. Strengthen team-level runbooks, operational readiness, and production hygiene; feed improvements back into the platform. Embedded Team Work Embed with an assigned engineering team day-to-day, working hands-on with them on infrastructure, deployment, and reliability work. Mentor team engineers on operational best practices, observability, and reliability. Help build the team's capability over time so good practices stick. Engineering Metrics Stand up DORA metrics (lead time, deployment frequency, change failure rate, MTTR) and use them to target real improvements. Make delivery and reliability health visible to teams and leadership. Leadership & Partnership Work with engineering leadership on the operations and delivery roadmap. Drive cross-team adoption of standards and tooling through collaboration and influence. What We're Looking For Hands-on experience adopting AI into DevOps and SRE practices (Claude Code, Cursor, agents, or similar) to improve automation, debugging, and operational efficiency. 7+ years in DevOps, platform, infrastructure, or related engineering roles, ideally in fast-scaling environments. Strong hands-on Kubernetes and AWS experience. Deep IaC experience (Terraform and/or CloudFormation) and the ability to set IaC standards for other teams. Proven CI/CD architecture experience: pipelines, quality gates, release automation. GitOps experience with Argo CD (or Flux) for Kubernetes delivery. Hands-on experience operating streaming infrastructure (Kafka) in production. Experience managing SQL and NoSQL datastores at high volume: performance, scaling, operational health. Solid scripting/automation skills (Python, Bash, or similar). Working knowledge of observability stacks: Prometheus, Grafana, PagerDuty (Loki / Alertmanager a plus). Familiarity with on-call, incident response, SLI/SLO definition, and runbooks, and the operational practices that support them. Strong collaborator and communicator. Comfortable working across teams, mentoring engineers, and driving alignment without authority. Nice to Have Progressive delivery (canary, blue/green) and feature-flag-driven release experience. Cost / efficiency awareness in cloud infrastructure. Broader data / streaming ecosystem exposure (Spark, schema management, CDC, etc.). What Success Looks Like Teams ship on demand — merge to prod in hours, no tickets, no waiting on you. Deploy frequency up a tier. Faster without breaking — lead time and MTTR down while change-failure rate holds flat. Platform does the work — safe path is the easy path; manual release steps trending to zero; envs self-service in minutes. AI is in the ops loop — runbooks, alerting, incident response AI-assisted; patterns other teams reuse unprompted. Capability sticks — embedded team owns its own deploy and reliability work after you rotate off. Health is visible — DORA metrics instrumented for teams and leadership; roadmap driven by data. This role is 100% remote in Canada. This role does not qualify for relocation or visa sponsorship. In accordance with applicable law, the following represents a reasonable estimated compensation range for this role: the estimated pay range for this role, if based in Canada is 123,000 CAD to 160,000 CAD. Please note that this information is provided for those hired in Canada only. Compensation for candidates outside of Canada will be based on the candidate’s specific work location. Actual compensation will be determined based on skills, experience, and geographic location and may be more or less than the amount shown above. This role additionally includes a 10% annual bonus tied to company goals. The salary range provided represents base compensation and does not include potential equity, which is available for qualifying positions. At Branch, we are committed to the well-being of our team by offering a comprehensive benefits package. From health and wellness programs to paid time off and retirement planning options, we provide a range of benefits for qualified employees. For detailed information on the benefits specific to your position, please consult with your recruiter. Branch is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status. If you think you'd be a good fit for this role, we'd love for you to apply! At Branch, we strive to create an inclusive culture that encourages people from all walks of life to bring their unique, diverse perspectives to work. We aim every day to build an environment that empowers us all to do the best work of our careers, and we can't wait to show you what we have to offer! A little bit about us: Branch is the leading provider of engagement and performance mobile SaaS solutions for growth-focused teams, trusted to maximize the value of their evolving digital strategies. The Branch platform provides a seamless experience across paid and organic, on all channels and platforms, online and offline, to eliminate friction and drive valuable action at the moments of highest intent. With Branch, businesses gain accurate mobile measurement and insights into user interactions, enabling them to drive conversions, engagement, and more intelligent marketing spend. Branch is an award-winning employer headquartered in Mountain View, CA. World-class brands like Instacart, Western Union, NBCUniversal, Zocdoc and Sephora acquire users, retain customers and drive more conversions with Branch. Candidate Privacy Information: For more information on the data that Branch will collect through your application, and how we use, share, delete, and retain that information as part of our recruitment and employment efforts, please see our HR Privacy Policy.

About Branch

Construction

Similar Jobs