EN | FR

43 Operations Engineer jobs in Canada

Senior Cloud Operations Engineer

British Columbia, British Columbia Pegasystems

Posted 22 days ago

Job Viewed

Tap Again To Close

Job Description

Senior Cloud Operations Engineer
Job Category: Engineering & Cloud
Location: Canada - BC - Remote
**Meet Our Team:**
**_Due to the nature of the work, Canadian Citizenship is required. This is a remote role in Canada, Pacific Time Zone only please._**
As a member of Cloud Operations team, you will be a key member responsible for the reliability and availability of Pegasystems cloud service offerings. We operate as a global follow the sun 24x7 team with locations in Bangalore, Sydney, and the East Coast of the United States. We encourage a culture of diversity, openness, intellectual curiosity, problem solving, and consistently strive to create an environment that provides the support and mentorship needed to learn and grow.
**Picture Yourself at Pega:**
You will have the opportunity to work on diverse problems and apply your expertise and experience to improve reliability of Pega Cloud Platform. You will take personal ownership of the systems you manage and possess the tenacity to delve to the root of the problem quickly, understand why it happened, and prevent it from reoccurrence. By collaborating and communicating with customers and internal stake holders, you will deliver best in class support.
**What You'll Do at Pega:**
- Perform provisioning of new environments and upgrade of the infrastructure components & Product application
- Perform decommission of existing environments
- Troubleshoot and resolve customers environment issues along with root cause analysis
- Create and maintain operational runbooks
- Identify and document Standard operating procedures for daily tasks
- Participate in testing of pre-release product enhancement testing with Engineering
- Identify opportunities for automation of repeated tasks and reduce toil
- Write scripts to automate repetitive tasks
- Work with team on scheduling upgrade tasks / hotfixes and patches
- Manage / execute deployment of system updates / patches and hotfixes
- Monitor the teams ticket queue and work with team to distribute tickets in timely manner
- Monitor teams email distribution list for escalation / communication and work with team to respond in timely manner
- Prepare handoff documentation to work with other global teams
- Willing to be on-call to support customers 24 x 7 on rotational basis
- Flexibility to work on/ cover for rotating weekend shift (Saturday and Sunday)
**Who You Are:**
- Proven professional and technical experience in an enterprise cloud environment supporting SAAS applications with a focus on operational delivery excellence and customer service
- You are self-motivated, inquisitive, and creative, with a passion for continuous improvement and excellent people skills
- Works well with cross-functional global and remote teams
- Demonstrated ability to learn new technologies, techniques, and tools quickly to meet our business requirements
- Comfortable working in a fast-paced, enterprise environment
- Possess customer obsession and proven empathy towards customers
- Good communication skills to navigate internal and external customers
**What You've Accomplished:**
You are skilled in Cloud, Linux, Middleware and DevOps Technologies, and have accomplished the below:
- 7+ years of hands-on operational or engineering experience in installing, configuring, troubleshooting, and tuning Java applications and Apache Tomcat application servers
- 7+ years of experience with enterprise scale Linux Administration
- Hands-on operational experience with Amazon Web Services (AWS) and/or Google Cloud Platform (GCP)
- Deep understanding of cloud-based infrastructure, platform, and application operational administration - including product and platform upgrades, installations, backup, and recovery, monitoring and observability, etc.
- Experience with microservices architecture with Kubernetes is a plus
- Administration of web servers running Tomcat, Apache, IIS, Nginx
- Basic network troubleshooting skills including TCP/IP, DNS, VPN is a plus
- Experience in Bash/Shell, Python, or similar scripting languages to automate common tasks, a plus
- Bachelor's degree in Computer Science/Engineering or equivalent
- AWS / GCP Certification, a plus
- Certified Kubernetes Administrator, a plus
- Ability to obtain Security clearance if required
- Canadian Citizenship is required
**Pega Offers You:**
+ Gartner Analyst acclaimed technology leadership across our categories of products
+ Continuous learning and development opportunities
+ An innovative, inclusive, agile, flexible, and fun work environment
+ Competitive global benefits program inclusive of pay + bonus incentive, employee equity in the company#LI-KH2
Job ID: 22325
**AI in Action -** Pega embraces the power of artificial intelligence. We encourage all employees to actively engage with AI technologies and continually explore ways to responsibly integrate AI into our products and processes.
**Culture -** At Pegasystems, we foster an environment where people feel valued and empowered to contribute their best. With global clients across industries and regions, we know our success depends on the unique perspectives, experiences, and talents of our people. Ours is a workplace where everyone can grow, collaborate, and deliver meaningful outcomes.
We encourage candidates from all backgrounds and experiences and focus on the core competencies and mindset needed to thrive in a role.
As an Equal Opportunity employer, Pegasystems will not discriminate in its employment practices due to an applicant's race, color, religion, sex, sexual orientation, gender identity, national origin, age, genetic information, veteran or disability status, or any other category protected by law.
**Export Compliance -** For positions requiring access to technical data subject to export control regulations such as this, Pegasystems may need to obtain export license approval from the U.S. Government and EU Authorities for certain individuals.
**Accommodations -** If you require reasonable accommodations under the Americans with Disabilities Act (US only) or comparable regional regulations in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process,or contact (US only) 1-888-PEGA-NOW and/or 225 Wyman Street Waltham, MA 02451 ATTN: Benefits.
It is Pega's policy to engage, recruit, hire, promote, train, discipline, and compensate in all job classifications, without regard to race, color, sex, religion, national origin, age, disability, sexual orientation, gender identity, veteran status, or any other category protected by law.
This advertiser has chosen not to accept applicants from your region.

Media Operations Engineer (Video Streaming)

Toronto, Ontario Quickplay

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Description

Job Description

Salary:

About us--

We are technologists at heart, who love what we do.

At Quickplay we believe in transparency, fairness, and collaboration while we passionately work on some of the toughest use cases in OTT video; and are enthusiastic about massive scale and agility. If you get excited about building the future of OTT video, and aspire to be part of a high-performing, learning-oriented, and caring culture--you have landed on the right company.

In an evolving market, our employees are inspired by the innovative environment that allows them to lead, motivate, and create, while reaching their full potential and achieving great results. Spending their days in a challenging atmosphere developing cutting edge products for the biggest names in media and communications, Quickplay employees are able to expand their skills and grow with a passionate and talented group of people.

About the role--

Primarily focused on:

  • Located at our Toronto office as part of our Engineering team, this position is responsible for ensuring content deliverables for Quickplay clients including all phases of planning and communication with Content Providers and Service Providers.
  • Work with existing customers on new/existing business and system requirements with respect to content initiatives (Discovery, Planning, Deployment and Validation).
  • Identify and define Internal business and system requirements for continuous life-cycle improvement of the Gen V platform.
  • Define reports for Key Performance Indices, service level reporting, exception reporting, and data analysis to share with key stakeholders.
  • Manage internal stakeholder, customer expectations and communication with respect to content.
  • Own service setup in the production environment.
  • Identify candidates for automation of media processing related tasks.
  • Manage delivery, testing, validation, troubleshooting, Service Level Agreements, and customer communication related to content.
  • Act as the central point of contact for the customer, as well as the Subject Matter Expert for Sales/Product teams and during ISD/Product Sprint/Grooming Cycle.
  • Ensure customer satisfaction with respect to content initiatives.
  • Build close relationships to Service Providers and Content Providers and obtain regular forecasting.
  • Ensure documentation and hand off to other support groups for post implementation Procedures.

About You

Experience & Technical Requirements:

  • General knowledge of video and audio transcoding, codecs, and formats.
  • Knowledge of DRM systems like Apple Fairplay, Google Widevine & amp; and Microsoft Playready
  • Working knowledge of various platforms for streaming media playback (eg: AppleTV, Roku, iOS, Android).
  • Knowledge of HTTP based ABR techniques like Apple HLS, LL-HLS, MPEG-DASH, CMAF, and Smooth Streaming.
  • 2 to 5 years of experience working in a technical role (development or quality assurance) in a software development environment.
  • Experience working with XML and different metadata formats.
  • Experience working with Content Providers and VOD (video on demand) content delivery methods.
  • Experience working on Content Management Systems.
  • Flexibility to alter shifts/days off and work overtime to accommodate special projects and departmental objectives.

Highly Favorable Skills:

  • Excellent interpersonal and customer service skills with the ability to function effectively in a team environment.
  • Excellent written and verbal communication skills with an emphasis on communication across levels within an organization.
  • Strong analytical skills along with the ability to prioritize tasks and make decisions.
    Easy-going and flexible individual who can integrate and function within a pre-existing team.
  • Self-starter who can operate with minimal direction.
  • Excellent oral and written communication skills capable of leading design/architecture & training sessions.
  • A creative thinker and experienced problem solver.

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer / Platform Operations Engineer

Vancouver, British Columbia Targeted Talent

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Description

Job Description

We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.

You Will:

  • Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
  • Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
  • Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
  • Troubleshooting, reproducing and mitigating issues in our production environments
  • Mentoring other team members.
  • Operate global AWS Platforms at scale

You Have:

  • Evidence of Strong Troubleshooting, problem-solving and investigative skills
  • Experience of AWS or Other cloud providers
  • Experience developing in Java
  • Major incident management on experience operating production platforms at scale
  • Experience working with distributed web applications
  • Experience Automating operational tasks / Processes using other languages
  • Understanding of relational and/or NoSQL data structures
  • Experience mentoring/influencing peers
  • Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements

Bonus:

  • Worked with Ansible, Terraform, Python
  • Experience working with Serverless / Containers
  • Experience of ELK &/Or Graphite/Prometheus / Grafana
  • Used Tracing Tools in production before
  • Experience in Chaos Engineering / Failure Injection Testing
  • Experience of working in an Agile Environment
  • Experience working in a similar site reliability role

This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer / Platform Operations Engineer

Kitchener, British Columbia Targeted Talent

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Description

Job Description

We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.

You Will:

  • Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
  • Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
  • Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
  • Troubleshooting, reproducing and mitigating issues in our production environments
  • Mentoring other team members.
  • Operate global AWS Platforms at scale

You Have:

  • Evidence of Strong Troubleshooting, problem-solving and investigative skills
  • Experience of AWS or Other cloud providers
  • Experience developing in Java
  • Major incident management on experience operating production platforms at scale
  • Experience working with distributed web applications
  • Experience Automating operational tasks / Processes using other languages
  • Understanding of relational and/or NoSQL data structures
  • Experience mentoring/influencing peers
  • Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements

Bonus:

  • Worked with Ansible, Terraform, Python
  • Experience working with Serverless / Containers
  • Experience of ELK &/Or Graphite/Prometheus / Grafana
  • Used Tracing Tools in production before
  • Experience in Chaos Engineering / Failure Injection Testing
  • Experience of working in an Agile Environment
  • Experience working in a similar site reliability role

This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer / Platform Operations Engineer

Winnipeg, Manitoba Targeted Talent

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Description

Job Description

We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.

You Will:

  • Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
  • Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
  • Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
  • Troubleshooting, reproducing and mitigating issues in our production environments
  • Mentoring other team members.
  • Operate global AWS Platforms at scale

You Have:

  • Evidence of Strong Troubleshooting, problem-solving and investigative skills
  • Experience of AWS or Other cloud providers
  • Experience developing in Java
  • Major incident management on experience operating production platforms at scale
  • Experience working with distributed web applications
  • Experience Automating operational tasks / Processes using other languages
  • Understanding of relational and/or NoSQL data structures
  • Experience mentoring/influencing peers
  • Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements

Bonus:

  • Worked with Ansible, Terraform, Python
  • Experience working with Serverless / Containers
  • Experience of ELK &/Or Graphite/Prometheus / Grafana
  • Used Tracing Tools in production before
  • Experience in Chaos Engineering / Failure Injection Testing
  • Experience of working in an Agile Environment
  • Experience working in a similar site reliability role

This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer / Platform Operations Engineer

Montréal, Quebec Targeted Talent

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Description

Job Description

We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.

You Will:

  • Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
  • Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
  • Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
  • Troubleshooting, reproducing and mitigating issues in our production environments
  • Mentoring other team members.
  • Operate global AWS Platforms at scale

You Have:

  • Evidence of Strong Troubleshooting, problem-solving and investigative skills
  • Experience of AWS or Other cloud providers
  • Experience developing in Java
  • Major incident management on experience operating production platforms at scale
  • Experience working with distributed web applications
  • Experience Automating operational tasks / Processes using other languages
  • Understanding of relational and/or NoSQL data structures
  • Experience mentoring/influencing peers
  • Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements

Bonus:

  • Worked with Ansible, Terraform, Python
  • Experience working with Serverless / Containers
  • Experience of ELK &/Or Graphite/Prometheus / Grafana
  • Used Tracing Tools in production before
  • Experience in Chaos Engineering / Failure Injection Testing
  • Experience of working in an Agile Environment
  • Experience working in a similar site reliability role

This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer / Platform Operations Engineer

Ottawa, Ontario Targeted Talent

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Description

Job Description

We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.

You Will:

  • Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
  • Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
  • Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
  • Troubleshooting, reproducing and mitigating issues in our production environments
  • Mentoring other team members.
  • Operate global AWS Platforms at scale

You Have:

  • Evidence of Strong Troubleshooting, problem-solving and investigative skills
  • Experience of AWS or Other cloud providers
  • Experience developing in Java
  • Major incident management on experience operating production platforms at scale
  • Experience working with distributed web applications
  • Experience Automating operational tasks / Processes using other languages
  • Understanding of relational and/or NoSQL data structures
  • Experience mentoring/influencing peers
  • Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements

Bonus:

  • Worked with Ansible, Terraform, Python
  • Experience working with Serverless / Containers
  • Experience of ELK &/Or Graphite/Prometheus / Grafana
  • Used Tracing Tools in production before
  • Experience in Chaos Engineering / Failure Injection Testing
  • Experience of working in an Agile Environment
  • Experience working in a similar site reliability role

This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!

This advertiser has chosen not to accept applicants from your region.
Be The First To Know

About the latest Operations engineer Jobs in Canada !

Site Reliability Engineer / Platform Operations Engineer

Saskatoon, Saskatchewan Targeted Talent

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Description

Job Description

We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.

You Will:

  • Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
  • Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
  • Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
  • Troubleshooting, reproducing and mitigating issues in our production environments
  • Mentoring other team members.
  • Operate global AWS Platforms at scale

You Have:

  • Evidence of Strong Troubleshooting, problem-solving and investigative skills
  • Experience of AWS or Other cloud providers
  • Experience developing in Java
  • Major incident management on experience operating production platforms at scale
  • Experience working with distributed web applications
  • Experience Automating operational tasks / Processes using other languages
  • Understanding of relational and/or NoSQL data structures
  • Experience mentoring/influencing peers
  • Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements

Bonus:

  • Worked with Ansible, Terraform, Python
  • Experience working with Serverless / Containers
  • Experience of ELK &/Or Graphite/Prometheus / Grafana
  • Used Tracing Tools in production before
  • Experience in Chaos Engineering / Failure Injection Testing
  • Experience of working in an Agile Environment
  • Experience working in a similar site reliability role

This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!

This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer / Platform Operations Engineer

Halifax, Nova Scotia Targeted Talent

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Description

Job Description

We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used.

You Will:

  • Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
  • Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
  • Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
  • Troubleshooting, reproducing and mitigating issues in our production environments
  • Mentoring other team members.
  • Operate global AWS Platforms at scale

You Have:

  • Evidence of Strong Troubleshooting, problem-solving and investigative skills
  • Experience of AWS or Other cloud providers
  • Experience developing in Java
  • Major incident management on experience operating production platforms at scale
  • Experience working with distributed web applications
  • Experience Automating operational tasks / Processes using other languages
  • Understanding of relational and/or NoSQL data structures
  • Experience mentoring/influencing peers
  • Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements

Bonus:

  • Worked with Ansible, Terraform, Python
  • Experience working with Serverless / Containers
  • Experience of ELK &/Or Graphite/Prometheus / Grafana
  • Used Tracing Tools in production before
  • Experience in Chaos Engineering / Failure Injection Testing
  • Experience of working in an Agile Environment
  • Experience working in a similar site reliability role

This role offers great perks and a competitive salary, please apply to the job posting if it matches your career path!

This advertiser has chosen not to accept applicants from your region.
 

Nearby Locations

Other Jobs Near Me

Industry

  1. request_quote Accounting
  2. work Administrative
  3. eco Agriculture Forestry
  4. smart_toy AI & Emerging Technologies
  5. school Apprenticeships & Trainee
  6. apartment Architecture
  7. palette Arts & Entertainment
  8. directions_car Automotive
  9. flight_takeoff Aviation
  10. account_balance Banking & Finance
  11. local_florist Beauty & Wellness
  12. restaurant Catering
  13. volunteer_activism Charity & Voluntary
  14. science Chemical Engineering
  15. child_friendly Childcare
  16. foundation Civil Engineering
  17. clean_hands Cleaning & Sanitation
  18. diversity_3 Community & Social Care
  19. construction Construction
  20. brush Creative & Digital
  21. currency_bitcoin Crypto & Blockchain
  22. support_agent Customer Service & Helpdesk
  23. medical_services Dental
  24. medical_services Driving & Transport
  25. medical_services E Commerce & Social Media
  26. school Education & Teaching
  27. electrical_services Electrical Engineering
  28. bolt Energy
  29. local_mall Fmcg
  30. gavel Government & Non Profit
  31. emoji_events Graduate
  32. health_and_safety Healthcare
  33. beach_access Hospitality & Tourism
  34. groups Human Resources
  35. precision_manufacturing Industrial Engineering
  36. security Information Security
  37. handyman Installation & Maintenance
  38. policy Insurance
  39. code IT & Software
  40. gavel Legal
  41. sports_soccer Leisure & Sports
  42. inventory_2 Logistics & Warehousing
  43. supervisor_account Management
  44. supervisor_account Management Consultancy
  45. supervisor_account Manufacturing & Production
  46. campaign Marketing
  47. build Mechanical Engineering
  48. perm_media Media & PR
  49. local_hospital Medical
  50. local_hospital Military & Public Safety
  51. local_hospital Mining
  52. medical_services Nursing
  53. local_gas_station Oil & Gas
  54. biotech Pharmaceutical
  55. checklist_rtl Project Management
  56. shopping_bag Purchasing
  57. home_work Real Estate
  58. person_search Recruitment Consultancy
  59. store Retail
  60. point_of_sale Sales
  61. science Scientific Research & Development
  62. wifi Telecoms
  63. psychology Therapy
  64. pets Veterinary
View All Operations Engineer Jobs