We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Cloud Engineering Manager

BBSI
United States, Washington, Vancouver
8100 Northeast Parkway Drive (Show on map)
Mar 20, 2026

Cloud Engineering Manager

Position Summary

The Cloud Engineering Manager leads a team responsible for the reliability, scalability, security, and cost efficiency of the organization's cloud platform.

This role oversees Cloud Engineers and Site Reliability Engineers who build and operate cloud infrastructure supporting critical business and product workloads.

The Manager focuses on engineering leadership, platform ownership, operational excellence, and team development, ensuring the consistent delivery of secure, resilient, and cost-optimized cloud services.

While the team executes hands-on engineering work, the Manager brings prior technical experience to guide architectural decisions, mentor team members, and uphold engineering standards.

This role also serves as a senior escalation point for cloud-related incidents and leads response efforts for major production issues.

Team Leadership & Development

Lead, mentor, and develop a high-performing cloud engineering team.

Responsibilities include:

  • Recruiting and developing engineering talent
  • Establishing engineering standards and best practices
  • Coaching team members on architecture, reliability, and operations
  • Conducting performance management and career development planning
  • Fostering a culture of ownership, accountability, and excellence

Platform Ownership

Accountable for the engineering and operational health of the cloud platform, including:

  • Infrastructure architecture
  • Cloud networking
  • Identity and access management
  • Disaster recovery and business continuity
  • Observability and monitoring systems
  • Reliability engineering practices

Reliability & Production Operations

Lead Site Reliability Engineering (SRE) practices for production systems.

Responsibilities include:

  • Defining and managing reliability targets (SLOs/SLAs)
  • Leading root cause analysis for major incidents
  • Driving automation and continuous improvement
  • Ensuring comprehensive monitoring and alerting
  • Managing on-call processes and escalation paths

Cloud Financial Governance (FinOps)

Ensure responsible and efficient use of cloud resources.

Responsibilities include:

  • Monitoring usage and cost trends
  • Driving accountability for cost optimization
  • Establishing governance controls and guardrails
  • Supporting forecasting and budgeting processes

Architecture & Engineering Governance

Ensure the platform is built using scalable, secure, and standardized design principles.

Responsibilities include:

  • Reviewing architecture and design proposals
  • Promoting reusable infrastructure patterns
  • Encouraging automation and infrastructure-as-code (IaC)
  • Embedding security and reliability into system design

Cross-Team Collaboration

Partner with engineering, product, security, and operations teams to support delivery across environments and services.

Ensure effective coordination between teams responsible for development, testing, and production systems.

Manager Accountability Model

The Cloud Engineering Manager is accountable for:

  1. Platform Reliability
  2. Meeting availability targets and driving continuous improvement
  3. Team Effectiveness
  4. Building and leading a high-performing engineering team
  5. Architecture Quality
  6. Maintaining scalable, consistent, and efficient platform design
  7. Cost Management
  8. Ensuring cloud spend is visible, controlled, and optimized
  9. Collaboration
  10. Enabling strong alignment across engineering and business teams

What This Role Does NOT Do

This role leads through systems, standards, and people rather than acting as a primary individual contributor.

This role does not:

  • Serve as the primary hands-on engineer
  • Act as a bottleneck for technical decisions
  • Replace engineers during incidents
  • Own every technical component personally
  • Operate primarily as an individual contributor

First 90 Days

First 30 Days

  • Learn the cloud platform and key systems
  • Build relationships with team members and stakeholders
  • Review monitoring, incident history, and cost practices

First 60 Days

  • Clarify ownership and responsibilities across the team
  • Reinforce engineering and operational standards
  • Identify reliability and cost improvement opportunities

First 90 Days

  • Implement reliability and automation improvements
  • Strengthen incident management processes
  • Establish regular engineering and operational review rhythms

Candidate Profile

Strong Candidates Demonstrate:

  • Prior hands-on cloud engineering experience
  • Experience leading engineering or infrastructure teams
  • Strong focus on reliability, scalability, and operations
  • Experience managing production systems and incidents
  • Understanding of cloud cost management practices
  • Ability to collaborate effectively across teams

Less Effective Candidates May Show:

  • Preference for individual technical work over team leadership
  • Limited leadership experience
  • Reactive or tactical thinking
  • Limited exposure to production environments
  • Minimal awareness of cost governance

If you want, I can also tailor this for:

  • A specific cloud (AWS, GCP, multi-cloud)
  • A startup vs enterprise tone
  • A recruiting-friendly (shortened) version for job postings

Applied = 0

(web-bd9584865-7clgh)