Lead Engineer – Cloud Application Operations & Maintenance (m/f/d)
SEGULA Technologies
Description
-
Coordinate and drive activities in the area of cloud application operations in close collaboration with senior engineers and relevant stakeholders
-
Ensure the stable and reliable operation of cloud-based applications and services (AWS, Azure, Alibaba Cloud), supporting incident, problem, and change management processes
-
Take ownership of critical incidents (P1/P2), lead root cause analysis (RCA), and drive sustainable remediation actions
-
Plan, coordinate, and execute application and platform updates (e.g., releases, middleware, databases, Kubernetes-based deployments), including initial application rollouts
-
Define, coordinate, and ensure execution of testing activities in the context of releases and changes (integration, regression, and acceptance testing)
-
Collaborate closely with development and IT teams to ensure testable, stable, and production-ready deployments
-
Support the refinement and implementation of NIS2-related requirements, particularly in areas such as operational security, monitoring, incident reporting, and risk management
-
Drive continuous improvement initiatives focused on automation, stability, and performance optimization of applications
-
Monitor and report on KPIs and SLAs, ensuring transparency and service quality
-
Align with internal and external stakeholders (DevOps, development, architecture, security, customers) to ensure efficient delivery and operations
-
Contribute to the continuous improvement of ITIL/ITSM processes and service management workflows
-
5–8+ years of experience in cloud application operations, DevOps, or IT operations
-
Proven experience in coordinating complex technical topics or deliveries (without direct people management responsibility)
-
Strong experience operating cloud-based applications on at least one of the following platforms: AWS, Azure, or Alibaba Cloud
-
Solid knowledge of Linux-based environments and container technologies (Docker, Kubernetes)
-
Hands-on experience with incident, problem, and change management in production environments
-
Experience with application updates, release management, and lifecycle processes
-
Understanding of testing in an operations context (integration testing, system validation, release verification)
-
Familiarity with test automation and CI/CD pipelines
-
Knowledge of security and compliance practices, ideally in the context of NIS2, ISO27001, or similar frameworks
-
Experience with monitoring, observability, and automation (e.g., Python, Bash, CI/CD).
-
Structured and solution-oriented working style, with the ability to connect technical and organizational aspects
-
Strong communication and stakeholder management skills; fluent English required