Détails du poste
- Lieu de travail : Montreal (Hybride)
- Type de poste : Permanent à temps plein
- Horaire de travail : Horaire variable
Description du poste
Job Title: Azure Support and DevOps Specialist –Operations Production Management
Location: Montreal (Day 1 onsite; in-office presence required 3x/week)
Schedule: Occasional planned weekend rotational support; shifts may vary (9am–6pm or 12pm–9pm, communicated in advance).
Aperçu de l’entreprise
A leading global financial services firm with operations in 43 countries, offering investment banking, securities, investment management, and wealth management services. The firm values integrity, excellence, and teamwork, providing a strong foundation for career growth and professional development.
Aperçu du rôle
As an Azure Support and DevOps Specialist, you will ensure the stability and resilience of critical settlement flows across on-premises and cloud platforms. This role combines production support with project work to enhance system performance, observability, scalability, and reliability. You will collaborate with developers and infrastructure engineers to implement automated reliability solutions, observability systems, and telemetry frameworks.
Responsabilités clés
- Maintain stability and resilience of settlement systems across hybrid environments.
- Provide Level 3 production management support for Java-based applications.
- Act as liaison between support and development teams, managing incident resolution.
- Implement observability and telemetry systems (SLIs, SLOs, metrics, logging).
- Use automation to improve uptime and mitigate risk.
- Debug and troubleshoot large-scale distributed applications across software, infrastructure, and databases.
- Collaborate cross-functionally to enhance system reliability and performance.
Qualifications requises
- Bachelor’s degree in Computer Science or related field.
- 5+ years in IT, with at least 2 years in Level 3 Production Management.
- Experience supporting applications deployed on Azure Cloud.
- Strong knowledge of observability and reliability tools in cloud-native environments.
- Expertise in debugging distributed applications.
- Hands-on experience with Python and Java codebases.
- Strong communication and collaboration skills.
Compétences techniques
- Terraform, GitHub.
- Azure Kubernetes, Azure Spring Apps, Azure Web Apps, AzureSQL, Kafka, Azure ServiceBus.
- Instrumentation for logging/metrics/events using Azure Monitor, Datadog, Prometheus, Grafana.
- Observability design principles.