Cloud Engineer
Abu Dhabi Telemedicine Centre
Description
Overview M42 delivers comprehensive healthcare services across the full continuum of care; from primary care to advanced specialty treatments. Leveraging cutting-edge health technologies and precision medicine, we ensure the highest standards of effectiveness, efficiency, and patient-centered outcomes. With a global presence spanning more than 480 facilities in 27 countries and a dedicated workforce of over 20,000 professionals, M42 is uniquely positioned to redefine the future of healthcare on a global scale.
We are looking for a Cloud Operations Engineer with strong hands-on experience in Microsoft Azure to ensure stable, secure, and efficient cloud operations. This role focuses on supporting day-to-day cloud workloads, maintaining AKS environments, strengthening security and governance, and driving automation across infrastructure, patching, backup, and monitoring. The ideal candidate is operationally strong, automation-minded, and comfortable working closely with engineering and DevOps teams to ensure platform reliability and scalability.
Responsibilities
- Manage daily operations of Azure environments across multiple subscriptions.
- Provision, configure, and maintain Azure resources including VMs, VNets, Load Balancers, App Services, Storage Accounts, and Key Vaults.
- Ensure compliance with cloud governance standards such as RBAC, tagging, security policies, and cost optimization.
- Perform capacity planning, performance tuning, and resource optimization.
- Operate and maintain Azure Kubernetes Service (AKS) clusters, including node health, scaling, upgrades, and patching.
- Manage Azure Container Registry (ACR), container image lifecycle, and container security.
- Troubleshoot AKS networking, ingress, and workload-related issues.
- Configure and manage Azure Backup, Recovery Services Vaults, and retention policies.
- Perform backup validation, restore testing, and support disaster recovery activities.
- Monitor and remediate vulnerabilities across Azure resources, VMs, and AKS nodes.
- Use Microsoft Defender for Cloud, Security Center, and Log Analytics for security monitoring and threat detection.
- Implement baseline hardening, secure configurations, and secrets management.
- Develop automation using Azure Automation, PowerShell, Bash, or Python.
- Implement and manage automated patching for Windows/Linux VMs and AKS node pools.
- Create and maintain runbooks for operational tasks and routine cloud activities.
- Configure monitoring, alerts, and dashboards using Azure Monitor, Log Analytics, and Prometheus.
- Lead incident response, root-cause analysis, and preventive action planning.
- Support operational tasks for Azure SQL, PostgreSQL, MySQL, and Cosmos DB, including performance monitoring and maintenance.
- Support CI/CD pipelines using GitHub Actions and manage GitHub repositories and access.
- Work closely with development teams to ensure deployment readiness and environment stability.
Qualifications
- 4-8 years of experience in Cloud Operations, Cloud Engineering, or DevOps roles.
- Strong hands-on experience with Microsoft Azure cloud infrastructure and operations.
- Proven experience managing AKS and containerized workloads.
- Solid understanding of backup, disaster recovery, patching, and vulnerability management.
- Experience with Azure Automation, runbooks, and scripting (PowerShell, Bash, or Python).
- Familiarity with monitoring, alerting, and incident management practices.
- Working knowledge of Azure database platforms and operational best practices.
- Experience with GitHub and basic CI/CD workflows.
- Experience with Infrastructure as Code (Terraform and/or Bicep) is a plus.
- Exposure to GitOps tools such as ArgoCD or Flux is desirable.
- Azure certifications (AZ-104, AZ-305, AZ-400, AZ-500) are preferred.
- Experience supporting hybrid or on‑prem environments is advantageous.