KubeCraftJobs

DevOps & Cloud Job Board

Performance and Automation Architect - Ceph Performance & CI

IBM

Bengaluru East, Karnataka

Hybrid
Mid Level
Full Time
Posted January 05, 2026

Tech Stack

ceph python golang glusterfs minio jenkins gitlab gitlab-ci openstack kubernetes ansible terraform grafana prometheus elk avature

Please log in or register to view job application links.

Job Description

**Introduction** In this role, you will lead initiatives to design, build, and optimize performance and CI automation frameworks for large-scale distributed storage systems. You will collaborate with global teams across development, QE, and infrastructure to drive continuous performance improvements, build intelligent CI pipelines, and integrate AI-based analytics for quality engineering. This position offers the opportunity to influence architecture decisions, define benchmarking standards, and shape IBM’s enterprise-grade distributed storage validation ecosystem. **Your Role And Responsibilities** - Lead end-to-end performance engineering for distributed storage systems and CI frameworks. - Design and develop scalable automation tools for CI/CD, benchmarking, and performance analytics. - Build and maintain workload generators, monitoring dashboards, and test frameworks for large-scale environments. - Triage, analyze, and resolve performance issues across compute, storage, and network layers. - Implement AI/ML-driven insights into CI processes for predictive validation and anomaly detection. - Collaborate with upstream and internal teams to define KPIs, metrics, and performance objectives. - Mentor engineers on performance optimization, automation, and observability best practices. **Required Technical And Professional Expertise** - 12+ years of experience in Performance Engineering, Quality Engineering, or Automation Architecture. - Strong programming and scripting skills in Python, Bash, or Go. - Hands-on experience with distributed storage systems (Ceph, GlusterFS, MinIO, or similar). - Deep understanding of Linux internals, networking, and storage I/O performance. - Proficiency in CI/CD frameworks (Jenkins, GitLab CI, or similar). - Experience with cloud infrastructure platforms (OpenStack, OpenShift, or Kubernetes). - Familiarity with performance and workload tools (FIO, COSBench, vdbench, or equivalent). - Experience with infrastructure automation (Ansible, Terraform). - Strong problem-solving and analytical skills across distributed environments. **Preferred Technical And Professional Experience** - Experience integrating AI/ML-driven frameworks for CI analytics or performance insights. - Knowledge of monitoring and observability tools (Grafana, Prometheus, ELK stack). - Experience in container performance tuning and Kubernetes-based workloads. - Understanding of cloud networking and distributed storage architectures. - Contributions to open-source or distributed systems communities