Observability Engineer
About the role
Must-have8 years hands-on experience in observability, SRE, or DevOps roles with proven expertise across infrastructure and application-level reliability.Deep expertise in observability tooling Dynatrace, ELK, Splunk, and PagerDuty demonstrated understanding of observability principles (instrumentation, correlation IDs, SLISLO frameworks).Advanced proficiency with Azure Kubernetes Service (AKS), Terraform, and Azure managed services (SQL MI, Redis, Functions, Event Grid) proven ability to design and implement infrastructure-as-code solutions.Strong hands-on experience instrumenting applications for comprehensive observability distributed tracing, metrics collection, and log aggregation across Node.js and .NET applications in microservices and event-driven architectures.Proven troubleshooting expertise in distributed systemsdiagnosing root causes across multiple service layers, databases, caches, and APIs in production environments.Excellent incident management skills hands-on experience with PagerDuty and ServiceNow ability to resolve high-severity incidents rapidly and conduct effective root cause analysis.Knowledge of incident, problem, and change management processes, including SRE principles, blameless postmortems, and chaos engineering practices.Exceptional communication and leadership