Finden Sie den Job, der Ihnen gefällt!
search
reorder
sell
keyboard_arrow_left Zurück
Passt das zu Ihnen?

SRE Champion / Principal Consultant (m/f/x) - SAP Cloud Reliability (DE)

[15811]

We are currently looking for a freelance "SRE Champion / Principal Consultant (m/f/x) - SAP Cloud Reliability" for our client in the IT-sector. Start: asap End: 30.06.2026 ++ Capacity: fulltime Location: Remote / Walldorf Description: We are building up Site Reliability Engineering (SRE) practices for our mission-critical Customer Portal SAP for Me, a cloud-native, self-service, and transactional platform that is central to our digital business. The portal is delivered by an Agile Release Train (ART) with 15 teams, responsible for the platform and cross-cutting functions. In addition, external business feature teams outside the ART also contribute functionality to the portal through a shared contribution model. To accelerate this journey, one internal team member will take the lead for SRE in a “lift & shift” approach. As this person is new to SRE, we are looking for an experienced SRE Champion (external engagement) who can provide hands-on guidance and structured coaching. This is a transitional role: the Champion will introduce best practices, establish core reliability processes, and enable the internal lead and product teams to independently run and evolve SRE capabilities after the engagement ends. Expected Outcome of Engagement: Internal SRE lead and SAP for Me teams enabled to own and evolve SRE practices independently, with tangible deliverables such as first SLOs in production, post-mortem framework, and handover playbook, and an SRE-aware onboarding framework for new external feature teams. Responsibilities: ·Act as coach and mentor for the internal SRE lead, ensuring structured knowledge transfer. ·Establish and pilot SRE foundations for the Customer Portal: SLO/SLI framework, error budgets, incident/post-mortem processes, and runbooks. ·Guide the setup of observability, monitoring, and alerting aligned with business reliability needs. ·Promote a cultural shift toward “you build it, you run it” across teams delivering to the portal. ·Define a handover roadmap and playbook to secure sustainable ownership post-engagement. ·Collaborate with both ART teams and external business feature teams to align responsibilities and reliability goals. ·Ensure SRE practices are included in the onboarding process for new ART-external feature teams, providing guardrails and playbooks for reliability. ·Identify skills and roles needed for a SRE team Experience Required Skills & Experience: ·5+ years establishing or scaling SRE practices for complex, high-traffic, cloud-native products. ·Experience introducing SRE in organizations without existing SRE structure ·Expertise with observability and monitoring tooling (e.g., Dynatrace, Prometheus, Grafana, ELK/Opensearch, or similar). ·Proven track record implementing SLO/SLI/error budget frameworks. ·Hands-on experience with incident response, root cause analysis, and automation for reliability. ·Solid understanding of DevOps practices, CI/CD, and infrastructure-as-code. ·Strong communication and coaching skills to upskill less experienced colleagues. Nice to Have: ·Familiarity with AIOps and reliability automation. ·Background in compliance and governance in regulated industries

We are currently looking for a freelance "SRE Champion / Principal Consultant (m/f/x) - SAP Cloud Reliability" for our client in the IT-sector.

Start: asap
End: 30.06.2026 ++
Capacity: fulltime
Location: Remote / Walldorf

Description:
We are building up Site Reliability Engineering (SRE) practices for our mission-critical Customer Portal SAP for Me, a cloud-native, self-service, and transactional platform that is central to our digital business. The portal is delivered by an Agile Release Train (ART) with 15 teams, responsible for the platform and cross-cutting functions. In addition, external business feature teams outside the ART also contribute functionality to the portal through a shared contribution model.
To accelerate this journey, one internal team member will take the lead for SRE in a “lift & shift” approach. As this person is new to SRE, we are looking for an experienced SRE Champion (external engagement) who can provide hands-on guidance and structured coaching.
This is a transitional role: the Champion will introduce best practices, establish core reliability processes, and enable the internal lead and product teams to independently run and evolve SRE capabilities after the engagement ends.

Expected Outcome of Engagement:
Internal SRE lead and SAP for Me teams enabled to own and evolve SRE practices independently, with tangible deliverables such as first SLOs in production, post-mortem framework, and handover playbook, and an SRE-aware onboarding framework for new external feature teams.

Responsibilities:
  • Act as coach and mentor for the internal SRE lead, ensuring structured knowledge transfer.
  • Establish and pilot SRE foundations for the Customer Portal: SLO/SLI framework, error budgets, incident/post-mortem processes, and runbooks.
  • Guide the setup of observability, monitoring, and alerting aligned with business reliability needs.
  • Promote a cultural shift toward “you build it, you run it” across teams delivering to the portal.
  • Define a handover roadmap and playbook to secure sustainable ownership post-engagement.
  • Collaborate with both ART teams and external business feature teams to align responsibilities and reliability goals.
  • Ensure SRE practices are included in the onboarding process for new ART-external feature teams, providing guardrails and playbooks for reliability.
  • Identify skills and roles needed for a SRE team Experience

Required Skills & Experience:
  • 5+ years establishing or scaling SRE practices for complex, high-traffic, cloud-native products.
  • Experience introducing SRE in organizations without existing SRE structure
  • Expertise with observability and monitoring tooling (e.g., Dynatrace, Prometheus, Grafana, ELK/Opensearch, or similar).
  • Proven track record implementing SLO/SLI/error budget frameworks.
  • Hands-on experience with incident response, root cause analysis, and automation for reliability.
  • Solid understanding of DevOps practices, CI/CD, and infrastructure-as-code.
  • Strong communication and coaching skills to upskill less experienced colleagues.

Nice to Have:
  • Familiarity with AIOps and reliability automation.
  • Background in compliance and governance in regulated industries





map Remote / Walldorf date_range asap update Freiberuflich
SRE cloud-native Dynatrace Grafana Prometheus ELK/Opensearch DevOps CI/CD
Direkter Kontakt

Francesca Hameister

Senior Recruiterin
mail f.hameister@1st-solution-group.com
phone +49 211 15 98 35 - 53


Kein passender Job? Senden Sie uns eine Nachricht!

Kein passender Job für Sie dabei? Kein Problem! Senden Sie uns einfach Ihren Namen, Ihre E-Mail sowie eine kurze Beschreibung Ihres Jobwunsches. Wir melden uns umgehend mit passenden Vorschlägen!