KubeCraftJobs

DevOps & Cloud Job Board

Site Reliability Engineer

IBM

Mexico City Metropolitan Area

Hybrid
Mid Level
Full Time
Posted January 13, 2026

Tech Stack

windows elastic-go elastic-cloud pagerduty docker kubernetes powershell python microsoft-windows-server window-server windows-server-2016 beyondtrust realm postgresql timescale mongodb rabbitmq mqtt microsoft-azure azure-devops kibana avature

Please log in or register to view job application links.

Job Description

**Introduction** As a Site Reliability Engineer (SRE) in the REIS team, you will ensure the reliability, scalability, and security of edge deployments. You will collaborate with a global team to support a hybrid infrastructure of Linux and Windows VMs, containerized services, and cloud-native tools. This role emphasizes automation, observability, and proactive incident management. Realtime Edge Infrastructure and Services (REIS) is responsible for deploying, monitoring, and supporting edge computing infrastructure on oil rigs. Our primary software platform, DrillOps, runs on these edge devices and enables real-time automation and optimization of well construction processes. **Key Responsibilities** **Your role and responsibilities** - Design, implement, and maintain monitoring and observability using Elastic Cloud Stack and Elastic Fleet agents. - Participate in on-call rotations using PagerDuty for incident response and resolution. - Support containerized services (Docker) and contribute to the transition to Kubernetes. - Automate operational tasks using Salt, PowerShell, Bash, and Python. - Support AI/ML initiatives and integrate intelligent automation where applicable. - Manage and support edge devices running Zededa EVEOS with Linux and Windows Server 2016 VMs. - Perform remote support using SimpleHelp (transitioning to BeyondTrust). - Support and maintain databases including Realm, Postgres, Timescale, and MongoDB. - Manage messaging systems: RabbitMQ (edge), MQTT and IOTHub (cloud). - Collaborate with developers using Azure DevOps for version control and issue tracking. - Contribute to security best practices and compliance across all systems. **Preferred Education** Bachelor's Degree **Required Technical And Professional Expertise** - Strong experience with Elastic Cloud Stack, Kibana, and Elastic Fleet agents. - Direct experience with scripting and automating tasks using Salt, PowerShell, Bash and/or Python. - Familiarity with Docker and Kubernetes container orchestration. - Experience with Azure DevOps for CI/CD and collaboration.