Who We Are
Welcome to TELUS Digital — where innovation drives impact at a global scale.
As an award-winning digital product consultancy and the digital division of TELUS, one of Canada's largest telecommunications providers, we design and deliver transformative customer experiences through cutting-edge technology, agile thinking, and a people-first culture.
With a global team across North America, South America, Central America, Europe, and APAC, we offer end-to-end expertise across eight core service areas: Digital Product Consulting, Digital Marketing Services, Data & AI, Strategy Consulting, Business Operations Modernization, Enterprise Applications, Cloud Engineering, and QA & Test Engineering.
About the Role
As a Senior Platform Engineer, you will play a crucial role in designing, implementing, and maintaining our cloud-native infrastructure.
You'll leverage your extensive experience with the different Cloud Providers, Kubernetes, CI/CD, and Infrastructure as Code to drive technical excellence and innovation.
This role offers the opportunity to take on increased responsibilities and mentor team members.
Responsibilities:
- Platform Architecture and Development:
- Design and implement scalable cloud-native architectures
- Lead the development and maintenance of Kubernetes infrastructure
- Architect, optimize, and maintain CI/CD pipelines using GitHub Actions
- Develop and maintain scalable Helm Chart templates
- Implement Infrastructure as Code (IaC) best practices using Terraform
- Drive container strategy and implementation
- Maintain and enhance comprehensive observability solutions, with hands-on experience
- Implement FinOps practices for optimizing cloud costs
- Process and Quality:
- Establish and promote technical standards and best practices
- Conduct thorough code reviews and participate in architecture discussions
- Implement and monitor SLOs/SLIs for platform services
- Spearhead automation and optimization initiatives
- Ensure adherence to security and compliance requirements
- Participate in on-call rotation and provide incident management support for production environments
- Lead post-mortems and drive continuous improvement in incident response processes
- Collaborate with the security team to implement robust security practices
- Collaboration & Mentorship:
- Mentor other team member engineers and contribute to their technical growth.
- Collaborate with cross-functional teams to align technical solutions with business objectives.
- Lead technical discussions and present complex ideas to both technical and non-technical audiences.
- Proactively identify and propose solutions to technical challenges
- Drive agile practices and contribute to continuous improvement efforts
- Write comprehensive technical documentation to empower team members and ensure knowledge transfer.
Requirements
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- 5+ years of hands-on experience in DevOps and Platform engineering.
- Expert-level knowledge of Kubernetes administration and architecture, including configuring, managing, and maintaining clusters.
Extensive experience with: - GitHub Actions and CI/CD pipeline design - Helm Chart development and maintenance - Terraform and Infrastructure as Code practices - Container technologies and Docker - Cloud platforms (AWS/GCP), including an understanding of cloud-native application architectures and deployment patterns - Excellent problem-solving and decision-making abilities with outstanding communication skills - Ability to troubleshoot and handle issues in production environments.
- Demonstrated ability to mentor and guide team members - Experience in leading technical initiatives or small teams
Preferred Qualifications:
- Hold a Certified Kubernetes Administrator (CKA) certification.
- Experience with FluxCD or any other GitOps tool.
- Experience in configuring and maintaining Kong API Gateway.
- Experience in configuring and maintaining Istio service-mesh.
- Experience with observability platforms such as Datadog, Prometheus, Grafana, OpenTelemetry, and Logstash.
- Experience working with microservices in an event-driven architecture.
- Experience with Terragrunt - Scripting and automation (Python, Bash, Go) - Have worked or maintained a Hashicorp Vault - Understanding of security best practices and compliance requirements - Experience with multiple cloud providers - Designed, built, and maintained scalable and reliable AI/ML infrastructure on AWS, including MLOps pipelines and data platforms,to support the full lifecycle of machine learning models from experimentation to production.
What's in it for you
- Private medical and life insurance from day one.
- Professional growth budget for certifications and training.
- Flexible work schedule.
- Performance-based bonuses.