Within the Operations team, the Senior DevOps Platform Reliability Engineer (PRE) assumes a dual role as a software and systems engineer.
This professionals primary responsibility is to ensure the seamless functionality of our services and production systems while facilitating scalability to accommodate our product launches and subsequent growth.
Collaborating closely with our Product and Development teams, the role entails providing essential support for Percept Clouds technical operations, infrastructure, and applications.
Beyond joining a dynamic team, this role offers a significant chance to actively shape the trajectory of our product development.
Role and Responsibilities
Infrastructure & Design (20%)
- Demonstrate a strong customer-centric approach, prioritizing the success of our platform and user experience as a constant focal point.
- Architect, construct, and manage foundational infrastructure components that enable PERCEPT CLOUDs scalability and support its expanding SaaS customer base and remote on-premises deployments.
- Take ownership of key infrastructure elements, including virtualization technologies, storage systems, data processing pipelines, secrets databases, release engineering, and intrusion detection systems.
- Develop tools to automate the deployment, monitoring, and operations of the overall platform.
- Enhance deployment processes to achieve a high level of reliability and simplicity.
- Contribute to operationalizing a diverse array of technologies essential for our platform (Azure Kubernetes, Confluent Kafka, Elastic Stack, MongoDB, etc.).
- Act as a pivotal member of a software development team, engaging in all stages of development : technical design, peer reviews, capacity planning, performance optimization, production deployment, and ongoing maintenance.
- Investigate and resolve production issues across various services and layers of the technology stack.
- Play a role in root cause analysis efforts, implementing solutions and applying lessons learned to prevent recurrent issues.
- Strengthen tools and automation to address gaps in existing systems and create new solutions to tackle increasingly complex challenges.
- Thoroughly document all actions, transforming findings into replicable processes, and ultimately, into automated procedures.
- Lead the definition, planning, execution, and monitoring of projects.
- Ensure smooth day-to-day operations, including monitoring, measuring, and troubleshooting infrastructure and services.
- Manage access controls for critical infrastructure resources.
- Contribute to the development of a CI / CD pipeline that aligns with organizational performance and security standards, emphasizing seamless integration with cloud platforms and self-service workflows.
- Generate and maintain project artifacts such as project charters, contracts, project plans, trackers for issues / defects / risks, lessons learned reports, as well as coordinate and organize meetings and recurring status updates for key stakeholders.
- Contribute to the Azure DevOps backlog of the team.
- Conduct post-mortems and in-depth root cause analyses to drive ongoing improvement.
- Collaborate closely with internal IT teams and external vendors to ensure uninterrupted 24 / 7 business operations.
- Hold a Bachelors degree in computer engineering or computer science.
- Possess a solid technical background with 5 years of experience.
- Demonstrate practical experience of 3 years in supporting production systems.
- Have hands-on experience of 1 year in managing production workloads within Kubernetes.
- Understand and apply Agile methodology effectively.
- Showcase strong technical writing skills.
- Possess a natural investigative instinct and a curiosity to explore and apply new technologies.
- Exhibit adaptability and flexibility in response to changing situations and requirements.
- Demonstrate end-to-end ownership of initiatives, from inception through deployment and ongoing maintenance.
- Communicate excellently, both internally and with clients, and deliver effective presentations.
- Take complete ownership of assigned components.
- Be a collaborative team player and a proactive self-starter.
Bilingual proficiency in both French and English .
This role not only offers the chance to be part of a dynamic team but also provides a significant opportunity to shape the future direction of our products and services.3 hours ago