Visa·about 12 hours ago
Site Reliability Engineering (SRE) is essential to Visa’s Cloud platform strategy. In this role, you’ll ensure our development platform and tools let engineers focus on innovation instead of infrastructure. You’ll promote observability best practices and automate resolution of recurring issues, working closely with software engineering teams to support security, availability, and performance. Responsibilities include triaging issues, collaborating on infrastructure management, and setting up monitoring for full coverage. Hands-on expertise is required, especially with major DevTools like GitHub, Jenkins, Jira, and Artifactory.
We seek a Software Engineer + SRE hybrid engineer. The ideal candidate deeply understands at least one major DevTool, quickly resolves tool-related issues in collaboration with developers, and applies systems thinking to maintain reliable applications and infrastructure while improving developer productivity.
Key Responsibilities:
This is a hybrid position. Expectation of days in the office will be confirmed by your Hiring Manager.
Basic Qualifications:
Bachelor's degree, OR 3+ years of relevant work experience
Preferred Qualifications:
Bachelor's degree, OR 3+ years of relevant work experience
Bachelor's degree in IT, CS or related field and-or 3+ Years Working Experience IT Operations and Delivery.
Experience: 3 years in SRE and-or DevTools support roles.
Beginner level programming and-or scripting in 2 or more of the following: Python, Java, Go, PowerShell, JavaScript, Terraform, Ansible, Helm, Chef, Cloud Formation.
Basic understanding of YAML, JSON, HTML, XML.
Hands on experience in Linux and -or Windows systems and good understanding of distributed computing environments.
2 years experience with CI-CD tooling such as Jenkins, Github, Bitbucket, ArgoCD, Artifactory, Azure DevOps in a large-scale environment
2 years experience with observability tooling such as Grafana, Prometheus, Splunk, Datadog, New Relic, DynaTrace, Sentry, etc. in a large-scale environment
2 years experience supporting relational and non-relational databases (MySQL, MongoDB, PostgreSQL, etc.), including creating and running queries, managing performance and scaling
2 or more years working in a Platform, SRE or Production Engineering group for high availability-critical platforms-applications
Experience managing a distributed container platform including but not limited to deployment-release management, provisioning, capacity management, workload management
Experience managing container infrastructure and supporting development transformation to a container first model.
This role requires oncall support as the team provides 24-7 operational support.
Technical Expertise: Proficiency in at least one DevTool (GitHub, Jenkins, ArgoCD, Jira, Artifactory, ).
Strong understanding of CI-CD principles and pipelines.
Solid knowledge of Linux systems, networking, and containerization (Docker-Kubernetes).
Hands-on experience with cloud platforms.
Programming-Scripting: Proficiency in Python, Ansible, or similar languages.
Mindset: Strong problem-solving skills, systems thinking, self-starter, and a passion for reliability.
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.