43 Operations Engineer jobs in Canada
Senior Cloud Operations Engineer

Posted 22 days ago
Job Viewed
Job Description
Job Category: Engineering & Cloud
Location: Canada - BC - Remote
**Meet Our Team:**
**_Due to the nature of the work, Canadian Citizenship is required. This is a remote role in Canada, Pacific Time Zone only please._**
As a member of Cloud Operations team, you will be a key member responsible for the reliability and availability of Pegasystems cloud service offerings. We operate as a global follow the sun 24x7 team with locations in Bangalore, Sydney, and the East Coast of the United States. We encourage a culture of diversity, openness, intellectual curiosity, problem solving, and consistently strive to create an environment that provides the support and mentorship needed to learn and grow.
**Picture Yourself at Pega:**
You will have the opportunity to work on diverse problems and apply your expertise and experience to improve reliability of Pega Cloud Platform. You will take personal ownership of the systems you manage and possess the tenacity to delve to the root of the problem quickly, understand why it happened, and prevent it from reoccurrence. By collaborating and communicating with customers and internal stake holders, you will deliver best in class support.
**What You'll Do at Pega:**
- Perform provisioning of new environments and upgrade of the infrastructure components & Product application
- Perform decommission of existing environments
- Troubleshoot and resolve customers environment issues along with root cause analysis
- Create and maintain operational runbooks
- Identify and document Standard operating procedures for daily tasks
- Participate in testing of pre-release product enhancement testing with Engineering
- Identify opportunities for automation of repeated tasks and reduce toil
- Write scripts to automate repetitive tasks
- Work with team on scheduling upgrade tasks / hotfixes and patches
- Manage / execute deployment of system updates / patches and hotfixes
- Monitor the teams ticket queue and work with team to distribute tickets in timely manner
- Monitor teams email distribution list for escalation / communication and work with team to respond in timely manner
- Prepare handoff documentation to work with other global teams
- Willing to be on-call to support customers 24 x 7 on rotational basis
- Flexibility to work on/ cover for rotating weekend shift (Saturday and Sunday)
**Who You Are:**
- Proven professional and technical experience in an enterprise cloud environment supporting SAAS applications with a focus on operational delivery excellence and customer service
- You are self-motivated, inquisitive, and creative, with a passion for continuous improvement and excellent people skills
- Works well with cross-functional global and remote teams
- Demonstrated ability to learn new technologies, techniques, and tools quickly to meet our business requirements
- Comfortable working in a fast-paced, enterprise environment
- Possess customer obsession and proven empathy towards customers
- Good communication skills to navigate internal and external customers
**What You've Accomplished:**
You are skilled in Cloud, Linux, Middleware and DevOps Technologies, and have accomplished the below:
- 7+ years of hands-on operational or engineering experience in installing, configuring, troubleshooting, and tuning Java applications and Apache Tomcat application servers
- 7+ years of experience with enterprise scale Linux Administration
- Hands-on operational experience with Amazon Web Services (AWS) and/or Google Cloud Platform (GCP)
- Deep understanding of cloud-based infrastructure, platform, and application operational administration - including product and platform upgrades, installations, backup, and recovery, monitoring and observability, etc.
- Experience with microservices architecture with Kubernetes is a plus
- Administration of web servers running Tomcat, Apache, IIS, Nginx
- Basic network troubleshooting skills including TCP/IP, DNS, VPN is a plus
- Experience in Bash/Shell, Python, or similar scripting languages to automate common tasks, a plus
- Bachelor's degree in Computer Science/Engineering or equivalent
- AWS / GCP Certification, a plus
- Certified Kubernetes Administrator, a plus
- Ability to obtain Security clearance if required
- Canadian Citizenship is required
**Pega Offers You:**
+ Gartner Analyst acclaimed technology leadership across our categories of products
+ Continuous learning and development opportunities
+ An innovative, inclusive, agile, flexible, and fun work environment
+ Competitive global benefits program inclusive of pay + bonus incentive, employee equity in the company#LI-KH2
Job ID: 22325
**AI in Action -** Pega embraces the power of artificial intelligence. We encourage all employees to actively engage with AI technologies and continually explore ways to responsibly integrate AI into our products and processes.
**Culture -** At Pegasystems, we foster an environment where people feel valued and empowered to contribute their best. With global clients across industries and regions, we know our success depends on the unique perspectives, experiences, and talents of our people. Ours is a workplace where everyone can grow, collaborate, and deliver meaningful outcomes.
We encourage candidates from all backgrounds and experiences and focus on the core competencies and mindset needed to thrive in a role.
As an Equal Opportunity employer, Pegasystems will not discriminate in its employment practices due to an applicant's race, color, religion, sex, sexual orientation, gender identity, national origin, age, genetic information, veteran or disability status, or any other category protected by law.
**Export Compliance -** For positions requiring access to technical data subject to export control regulations such as this, Pegasystems may need to obtain export license approval from the U.S. Government and EU Authorities for certain individuals.
**Accommodations -** If you require reasonable accommodations under the Americans with Disabilities Act (US only) or comparable regional regulations in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process,or contact (US only) 1-888-PEGA-NOW and/or 225 Wyman Street Waltham, MA 02451 ATTN: Benefits.
It is Pega's policy to engage, recruit, hire, promote, train, discipline, and compensate in all job classifications, without regard to race, color, sex, religion, national origin, age, disability, sexual orientation, gender identity, veteran status, or any other category protected by law.
Media Operations Engineer (Video Streaming)
Posted today
Job Viewed
Job Description
Job Description
Salary:
About us--
We are technologists at heart, who love what we do.
At Quickplay we believe in transparency, fairness, and collaboration while we passionately work on some of the toughest use cases in OTT video; and are enthusiastic about massive scale and agility. If you get excited about building the future of OTT video, and aspire to be part of a high-performing, learning-oriented, and caring culture--you have landed on the right company.
In an evolving market, our employees are inspired by the innovative environment that allows them to lead, motivate, and create, while reaching their full potential and achieving great results. Spending their days in a challenging atmosphere developing cutting edge products for the biggest names in media and communications, Quickplay employees are able to expand their skills and grow with a passionate and talented group of people.
About the role--
Primarily focused on:
- Located at our Toronto office as part of our Engineering team, this position is responsible for ensuring content deliverables for Quickplay clients including all phases of planning and communication with Content Providers and Service Providers.
- Work with existing customers on new/existing business and system requirements with respect to content initiatives (Discovery, Planning, Deployment and Validation).
- Identify and define Internal business and system requirements for continuous life-cycle improvement of the Gen V platform.
- Define reports for Key Performance Indices, service level reporting, exception reporting, and data analysis to share with key stakeholders.
- Manage internal stakeholder, customer expectations and communication with respect to content.
- Own service setup in the production environment.
- Identify candidates for automation of media processing related tasks.
- Manage delivery, testing, validation, troubleshooting, Service Level Agreements, and customer communication related to content.
- Act as the central point of contact for the customer, as well as the Subject Matter Expert for Sales/Product teams and during ISD/Product Sprint/Grooming Cycle.
- Ensure customer satisfaction with respect to content initiatives.
- Build close relationships to Service Providers and Content Providers and obtain regular forecasting.
- Ensure documentation and hand off to other support groups for post implementation Procedures.
About You
Experience & Technical Requirements:
- General knowledge of video and audio transcoding, codecs, and formats.
- Knowledge of DRM systems like Apple Fairplay, Google Widevine & amp; and Microsoft Playready
- Working knowledge of various platforms for streaming media playback (eg: AppleTV, Roku, iOS, Android).
- Knowledge of HTTP based ABR techniques like Apple HLS, LL-HLS, MPEG-DASH, CMAF, and Smooth Streaming.
- 2 to 5 years of experience working in a technical role (development or quality assurance) in a software development environment.
- Experience working with XML and different metadata formats.
- Experience working with Content Providers and VOD (video on demand) content delivery methods.
- Experience working on Content Management Systems.
- Flexibility to alter shifts/days off and work overtime to accommodate special projects and departmental objectives.
Highly Favorable Skills:
- Excellent interpersonal and customer service skills with the ability to function effectively in a team environment.
- Excellent written and verbal communication skills with an emphasis on communication across levels within an organization.
- Strong analytical skills along with the ability to prioritize tasks and make decisions.
Easy-going and flexible individual who can integrate and function within a pre-existing team. - Self-starter who can operate with minimal direction.
- Excellent oral and written communication skills capable of leading design/architecture & training sessions.
- A creative thinker and experienced problem solver.
Site Reliability Engineer / Platform Operations Engineer
Posted today
Job Viewed
Job Description
Job Description
We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.
You Will:
- Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
- Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
- Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
- Troubleshooting, reproducing and mitigating issues in our production environments
- Mentoring other team members.
- Operate global AWS Platforms at scale
You Have:
- Evidence of Strong Troubleshooting, problem-solving and investigative skills
- Experience of AWS or Other cloud providers
- Experience developing in Java
- Major incident management on experience operating production platforms at scale
- Experience working with distributed web applications
- Experience Automating operational tasks / Processes using other languages
- Understanding of relational and/or NoSQL data structures
- Experience mentoring/influencing peers
- Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements
Bonus:
- Worked with Ansible, Terraform, Python
- Experience working with Serverless / Containers
- Experience of ELK &/Or Graphite/Prometheus / Grafana
- Used Tracing Tools in production before
- Experience in Chaos Engineering / Failure Injection Testing
- Experience of working in an Agile Environment
- Experience working in a similar site reliability role
This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!
Site Reliability Engineer / Platform Operations Engineer
Posted today
Job Viewed
Job Description
Job Description
We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.
You Will:
- Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
- Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
- Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
- Troubleshooting, reproducing and mitigating issues in our production environments
- Mentoring other team members.
- Operate global AWS Platforms at scale
You Have:
- Evidence of Strong Troubleshooting, problem-solving and investigative skills
- Experience of AWS or Other cloud providers
- Experience developing in Java
- Major incident management on experience operating production platforms at scale
- Experience working with distributed web applications
- Experience Automating operational tasks / Processes using other languages
- Understanding of relational and/or NoSQL data structures
- Experience mentoring/influencing peers
- Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements
Bonus:
- Worked with Ansible, Terraform, Python
- Experience working with Serverless / Containers
- Experience of ELK &/Or Graphite/Prometheus / Grafana
- Used Tracing Tools in production before
- Experience in Chaos Engineering / Failure Injection Testing
- Experience of working in an Agile Environment
- Experience working in a similar site reliability role
This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!
Site Reliability Engineer / Platform Operations Engineer
Posted today
Job Viewed
Job Description
Job Description
We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.
You Will:
- Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
- Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
- Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
- Troubleshooting, reproducing and mitigating issues in our production environments
- Mentoring other team members.
- Operate global AWS Platforms at scale
You Have:
- Evidence of Strong Troubleshooting, problem-solving and investigative skills
- Experience of AWS or Other cloud providers
- Experience developing in Java
- Major incident management on experience operating production platforms at scale
- Experience working with distributed web applications
- Experience Automating operational tasks / Processes using other languages
- Understanding of relational and/or NoSQL data structures
- Experience mentoring/influencing peers
- Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements
Bonus:
- Worked with Ansible, Terraform, Python
- Experience working with Serverless / Containers
- Experience of ELK &/Or Graphite/Prometheus / Grafana
- Used Tracing Tools in production before
- Experience in Chaos Engineering / Failure Injection Testing
- Experience of working in an Agile Environment
- Experience working in a similar site reliability role
This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!
Site Reliability Engineer / Platform Operations Engineer
Posted today
Job Viewed
Job Description
Job Description
We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.
You Will:
- Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
- Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
- Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
- Troubleshooting, reproducing and mitigating issues in our production environments
- Mentoring other team members.
- Operate global AWS Platforms at scale
You Have:
- Evidence of Strong Troubleshooting, problem-solving and investigative skills
- Experience of AWS or Other cloud providers
- Experience developing in Java
- Major incident management on experience operating production platforms at scale
- Experience working with distributed web applications
- Experience Automating operational tasks / Processes using other languages
- Understanding of relational and/or NoSQL data structures
- Experience mentoring/influencing peers
- Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements
Bonus:
- Worked with Ansible, Terraform, Python
- Experience working with Serverless / Containers
- Experience of ELK &/Or Graphite/Prometheus / Grafana
- Used Tracing Tools in production before
- Experience in Chaos Engineering / Failure Injection Testing
- Experience of working in an Agile Environment
- Experience working in a similar site reliability role
This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!
Site Reliability Engineer / Platform Operations Engineer
Posted today
Job Viewed
Job Description
Job Description
We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.
You Will:
- Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
- Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
- Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
- Troubleshooting, reproducing and mitigating issues in our production environments
- Mentoring other team members.
- Operate global AWS Platforms at scale
You Have:
- Evidence of Strong Troubleshooting, problem-solving and investigative skills
- Experience of AWS or Other cloud providers
- Experience developing in Java
- Major incident management on experience operating production platforms at scale
- Experience working with distributed web applications
- Experience Automating operational tasks / Processes using other languages
- Understanding of relational and/or NoSQL data structures
- Experience mentoring/influencing peers
- Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements
Bonus:
- Worked with Ansible, Terraform, Python
- Experience working with Serverless / Containers
- Experience of ELK &/Or Graphite/Prometheus / Grafana
- Used Tracing Tools in production before
- Experience in Chaos Engineering / Failure Injection Testing
- Experience of working in an Agile Environment
- Experience working in a similar site reliability role
This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!
Be The First To Know
About the latest Operations engineer Jobs in Canada !
Site Reliability Engineer / Platform Operations Engineer
Posted today
Job Viewed
Job Description
Job Description
We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.
You Will:
- Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
- Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
- Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
- Troubleshooting, reproducing and mitigating issues in our production environments
- Mentoring other team members.
- Operate global AWS Platforms at scale
You Have:
- Evidence of Strong Troubleshooting, problem-solving and investigative skills
- Experience of AWS or Other cloud providers
- Experience developing in Java
- Major incident management on experience operating production platforms at scale
- Experience working with distributed web applications
- Experience Automating operational tasks / Processes using other languages
- Understanding of relational and/or NoSQL data structures
- Experience mentoring/influencing peers
- Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements
Bonus:
- Worked with Ansible, Terraform, Python
- Experience working with Serverless / Containers
- Experience of ELK &/Or Graphite/Prometheus / Grafana
- Used Tracing Tools in production before
- Experience in Chaos Engineering / Failure Injection Testing
- Experience of working in an Agile Environment
- Experience working in a similar site reliability role
This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!
Site Reliability Engineer / Platform Operations Engineer
Posted today
Job Viewed
Job Description
Job Description
We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.
You Will:
- Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
- Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
- Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
- Troubleshooting, reproducing and mitigating issues in our production environments
- Mentoring other team members.
- Operate global AWS Platforms at scale
You Have:
- Evidence of Strong Troubleshooting, problem-solving and investigative skills
- Experience of AWS or Other cloud providers
- Experience developing in Java
- Major incident management on experience operating production platforms at scale
- Experience working with distributed web applications
- Experience Automating operational tasks / Processes using other languages
- Understanding of relational and/or NoSQL data structures
- Experience mentoring/influencing peers
- Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements
Bonus:
- Worked with Ansible, Terraform, Python
- Experience working with Serverless / Containers
- Experience of ELK &/Or Graphite/Prometheus / Grafana
- Used Tracing Tools in production before
- Experience in Chaos Engineering / Failure Injection Testing
- Experience of working in an Agile Environment
- Experience working in a similar site reliability role
This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!