Description
Our Purpose
Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title and Summary
Software Engineer II What is Mastercard?---------------------------------
Mastercard is a global technology company in the payments industry. We power payments and deliver innovative products and services for consumers, businesses, and governments worldwide. Our people, technology, data, and brand enable experiences that make everyday life safer, simpler, and smarter.
We believe our success is driven by the skills, integrity, and mindset of the talent we hire. By fostering an inclusive and high-performance culture, we offer our people the opportunity to work on platforms that operate at massive global scale and deliver real-world impact.
Role Overview
---------------------------------
We are looking for a Site Reliability Engineer (SRE) to help operate, scale, and continuously improve our on-premise OpenShift platform running on VMware. This role is well suited for an engineer who enjoys platform ownership, has strong hands-on OpenShift administration experience, and brings an automation-first mindset.
The role includes a blend of platform operations and continuous improvement initiatives. Consistency and repeatability are key aspects of operating a large-scale platform, and the ideal candidate naturally looks for opportunities to streamline, automate, and standardize operational activities.
Strong communication skills and the ability to collaborate effectively across global teams are essential.
Key Responsibilities
---------------------------------
• Operate and administer on-premise OpenShift clusters hosted on VMware infrastructure, ensuring reliability, stability, and performance.
• Perform hands-on OpenShift administration, including cluster health checks, upgrades, lifecycle management, and troubleshooting.
• Work closely with VMware virtualization layers (compute, storage, networking) to support and optimize the platform.
• Identify operational patterns and implement automation to improve efficiency, consistency, and reliability.
• Troubleshoot and resolve platform and infrastructure issues across Kubernetes, networking, storage, and virtualization layers.
• Ensure platform services meet defined availability, performance, and service level objectives.
• Participate in incident management, root cause analysis, and follow-up improvement actions.
• Collaborate with application teams, architects, and platform stakeholders to enable reliable workload onboarding and operations.
• Contribute to operational excellence, continuous improvement, and platform standardization.
• Participate in agile ceremonies such as stand-ups, retrospectives, and planning sessions.
• Maintain clear documentation, runbooks, and operational procedures.
What We Are Looking For?
------------------------------------------
Core Skills & Mindset
• 3+ years of experience operating on-premise Kubernetes platforms, preferably Red Hat OpenShift.
• Strong hands-on experience with OpenShift administration and day-2 operations.
• Solid understanding of on-premise virtualization, ideally VMware (compute, storage, and networking).
• A self-motivated and driven engineer who takes ownership of platform reliability and improvements.
• Strong analytical and problem-solving skills, with a structured approach to investigations.
• Clear and effective communication skills, both written and verbal.
• A natural inclination toward automation, standardization, and continuous improvement.
• Comfortable working with structured, repeatable operational processes while continuously enhancing them through automation.
Technical Experience (Preferred)
• Experience with Infrastructure as Code and automation tools, such as: Ansible, Terraform, or similar technologies
• Familiarity with: CI/CD tools (e.g., Jenkins)
• Container tooling (Docker, Kubernetes)
• Secrets management (HashiCorp Vault)
• Artifact repositories (Artifactory)
• Observability and monitoring tools: Prometheus, Grafana, ELK/Loki, Splunk, Dynatrace
• Scripting experience in one or more of: Bash, Python, Go
• Understanding of high-availability, scalability, and resilience patterns.
• Exposure to public cloud Kubernetes platforms (EKS, AKS) is a plus.
• Kubernetes certifications (CKA, CKAD, CKS) are a plus.
Working Style & Culture Fit
• Effective contributor in a global, distributed team.
• Demonstrates a learning & troubleshooting mindset and adaptability.
• Balances operational stability with innovation and automation.
• Willing to mentor peers and share knowledge.
• Values clarity, reliability, and engineering discipline.
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
Abide by Mastercard's security policies and practices;
Ensure the confidentiality and integrity of the information being accessed;
Report any suspected information security violation or breach, and
Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines.
Apply on company website