Skip to main content

Senior DevOps Engineer

Acveti

DubaiOn-siteFull-Time3w ago

Description

Acveti is seeking an experienced Senior DevOps / Site Reliability Engineer (SRE) to support and optimize a high-traffic marketplace platform. This role will focus on enhancing platform reliability, improving performance, strengthening observability, reducing infrastructure costs, and driving operational excellence across cloud-native environments.

Key Responsibilitie

  • sManage, optimize, and scale cloud infrastructure hosted on Microsoft Azur e
  • .Design, deploy, and maintain containerized workloads using Azure Kubernetes Service (AKS )
  • .Administer and optimize Cloudflar e services for security, performance, caching, and traffic management
  • .Perform in-depth analysis and tuning of PostgreSQ L databases to improve performance and scalability
  • .Manage and troubleshoot distributed systems leveraging Kafk a and Redi s
  • .Build and enhance observability frameworks using Azure Monitor, Application Insights, New Relic, Grafana, and Prometheu s
  • .Develop and maintain robust CI/CD pipeline s and release engineering processes
  • .Lead infrastructure cost optimization (FinOps ) initiatives, identifying opportunities to improve efficiency while maintaining performance and reliability
  • .Drive incident response activities, root cause analysis (RCA), and post-incident remediation efforts
  • .Identify and resolve recurring performance bottlenecks, latency issues, and HTTP 500 errors across the platform
  • .Collaborate closely with engineering, product, and operations teams to improve platform stability and scalability

.

Required Skills & Experien

  • ceStrong hands-on experience wit h Microsoft Azu re and cloud-native architecture
  • s.Deep expertise i n Kubernetes (AK S) administration and troubleshootin
  • g.Experience managing and optimizin g Cloudfla re environment
  • s.Proven track record i n PostgreSQL performance tuning and database optimizati o
  • n.Strong knowledge o f Kaf ka , Red is, and distributed system
  • s.Expertise in monitoring, logging, and observability tools includin g Azure Monitor, Application Insights, New Relic, Grafana, and Promethe u
  • s.Experience designing and maintaining moder n CI/CD pipelin e
  • s.Practical experience wit h FinO ps, cloud cost management, and infrastructure optimizatio
  • n.Strong understanding of incident management, production support, and root cause analysis methodologie
  • s.Excellent troubleshooting skills with the ability to diagnose complex performance and reliability issues in production environment

s.

Preferred Qualificati

  • onsExperience supporting high-volume marketplace, e-commerce, or SaaS platfor
  • ms.Strong scripting and automation skills using PowerShell, Bash, Python, or similar technologi
  • es.Experience implementing SRE best practices, SLIs, SLOs, and error budge
  • ts.Excellent communication and stakeholder management skil

ls.

More jobs in Dubai