itjobs.ca Logo

About the role

The Role

We’re looking for a Software Developer Specialist to join our Disaster Recovery Governance Team, reporting to the Director - Software Development. In this role, you’ll help shape how we define, implement, and continuously improve the resilience of our systems and platforms across the organization.

You'll fit right in if you enjoy solving complex technical challenges, working across teams and systems, and building solutions that hold up when it matters most!

WhatYou’llDo

  • Design and implement disaster recovery solutions across services and platforms, ensuring systems meet defined recovery and resiliency standards.

  • Develop and evolve disaster recovery governance frameworks that establish the standards, processes, and capabilities needed to build a scalable enterprise disaster recovery platform.

  • Collaborate with engineering, infrastructure, and operations teams to understand system dependencies and develop effective failover strategies across environments.

  • Build and maintain automation and tooling to support recovery testing, failover execution, and environment readiness.

  • Participate in disaster recovery tests and simulations, contributing to execution, validation, and follow-through on identified improvements.

  • Document recovery procedures, system dependencies, and test outcomes to support operational readiness and audit requirements.

WhatYou’llBring

  • Bachelor’s degree in Computer Science, Computer Engineering, or a related field, or equivalent experience.

  • 4+ years experience designing, building, and supporting distributed systems, with a strong understanding of failure modes, system dependencies, and resiliency patterns (e.g., retries, graceful degradation, recovery-ready design).

  • Solid grounding in disaster recovery principles, including failover strategies, recovery planning, and measurable recovery objectives (RTO/RPO), with experience supporting or participating in regular DR testing.

  • Hands-on experience in AWS cloud environments, with familiarity across infrastructure, application layers, and data processing systems.

  • Experience building or contributing to automation and tooling that supports reliability, testing, or operational workflows (e.g., readiness checks, repeatable recovery tests, failover/rollback execution).

  • Strong collaboration and communication skills, with the ability to translate complex technical concepts into clear documentation, runbooks, and shared standards across teams.

  • Familiarity with monitoring, alerting, and operational readiness practices, including backup and data protection concepts, restore validation, and infrastructure-as-code for repeatable environments.

Nice to Have

  • Experience supporting disaster recovery tests and resilience “game day” exercises (including controlled fault-injection experiments).

  • Familiarity with high-availability architectures across multiple environments or regions, ideally in regulated, reliability-focused, or globally distributed environments.

  • Experience developing in Java or Python is considered an asset.

This position will be located in St. John’s or Toronto and offers the opportunity for a hybrid work environment at least 3 days a week in-office, subject to change, providing flexibility and accessibility for qualified candidates.

Come as You Are

Nasdaq is an equal opportunity employer. We welcome applications from candidates of all backgrounds and identities.

We are committed to fostering an inclusive workplace where diverse perspectives, experiences, and identities are valued and celebrated.

We ensure that individuals with disabilities are provided with reasonable accommodation throughout the hiring process.

About NASDAQ

Financial Services