Senior Lead - Site Reliability Engineer Java / APIs
Kyndryl
Posted 6 दिन पहले • Via www.themuse.com
Description
Who We Are
At Kyndryl, we run and reimagine the mission-critical technology systems that drive advantage for the world's leading businesses. We are at the heart of progress; with proven expertise and a continuous flow of AI-powered insight, enabling smarter decisions, faster innovation, and a lasting competitive edge. For our people-Kyndryls-that means doing purposeful work that powers human progress. Join us and experience a flexible, supportive environment where your well-being is prioritized and your potential can thrive.
The Role
To ensure the stability, availability, resilience, and scalability of critical systems, guaranteeing that Java applications, microservices, APIs, and batch chains run according to schedule and reliability objectives, with evidence, traceability, proactive monitoring, and failure recovery, while leading operations under SRE, DevOps, and continuous improvement principles.
1. Reliability, Incident, and AMS Operations Management
- Lead the management of critical incidents and major problems, ensuring root cause analysis (RCA) and remediation plans.
- Oversee the operation and reliability of batch jobs and process chains (Java and SAP) using Control‑M, including dependencies, calendars, alerts, and SLAs.
- Ensure continuous monitoring of critical transactional systems (Salesforce, Tandem, OmniPayments, or equivalent).
- Define and validate escalation, communication, and resolution criteria according to the AMS operating model.
- Ensure proper ticket management in ServiceNow / Jira, including evidence‑based closure and SLA compliance.
2. Reliability Engineering and Advanced Technical Analysis
- Design, implement, and evolve Site Reliability Engineering practices, including:
- SLIs, SLOs, and SLAs
- Operational automation
- Toil reduction
- Analyze complex failures across Java applications, microservices, integrations, and cloud platforms.
- Validate end‑to‑end flows, system dependencies, and single points of failure.
- Lead post‑incident reviews and propose structural reliability improvements.
3. Technical Leadership and Cross‑Functional Coordination
- Act as the technical lead for the AMS service, guiding support, operations, and development engineers.
- Coordinate with Java development teams, architects, DevOps, database, network, and security teams.
- Supervise vendor and technology partner activities.
- Ensure alignment between business needs, software architecture, and production operations.
- Participate in release planning, change windows, and post‑deployment stabilization.
4. Platforms, Development, and Architecture
- Provide expert support and technical leadership in:
- Java 8/11/17, Spring Boot
- Microservices and REST/SOAP APIs
- Messaging platforms (Kafka)
- Integrate cloud applications (Azure / GCP) with legacy systems.
- Apply design patterns, development best practices, and corporate standards.
- Use quality and security tools such as:
- SonarQube, BlackDuck, Fortify, AquaSec
- Version control and collaboration using:
- GIT, Bitbucket
- Implement and mature DevOps and CI/CD practices:
- Docker, Jenkins, Shell scripting
5. Monitoring, Backups, and Service Continuity
- Oversee monitoring, alerting, and observability strategies.
- Ensure correct execution of backup and restore processes:
- CommVault Simpana
- Veeam Backup & Replication
- Validate recovery testing and service continuity plans.
- Ensure compliance with security and operational policies.
6. Documentation, Continuous Improvement, and Governance
- Maintain up‑to‑date technical and operational documentation, including:
- Architecture diagrams
- Procedures
- Runbooks
- Postmortems
- Identify improvement opportunities in reliability, performance, security, and cost optimization.
- Drive automation and service standardization initiatives.
- Ensure compliance with methodologies, best practices, and IT governance guidelines.
Your Future at Kyndryl
Every position at Kyndryl offers a way forward to grow your career. We have opportunities that you won't find anywhere else, including hands-on experience, learning opportunities, and the chance to certify in all four major platforms. Whether you want to broaden your knowledge base or narrow your scope and specialize in a specific sector, you can find your opportunity here.
Who You Are
You're good at what you do and possess the required experience to prove it. However, equally as important - you have a growth mindset; keen to drive your own personal and professional development. You are customer-focused - someone who prioritizes customer success in their work. And finally, you're open and borderless - naturally inclusive in how you work with others.
Required Knowledge and Skills
- Bachelor's or Engineering degree in Systems Engineering, Computer Science, Information Technology, Software Engineering, or related fields
- 8+ years of experience in IT, with strong focus on:
- Production support
- AMS operations
- Java development and architecture
- Reliability and availability of critical systems
- Proven experience leading or guiding technical teams
- Availability for 24/7 operations, including on‑call rotations, release windows, and change windows
- Remote work model
- Languages:
- Spanish: Fluent
- English: Basic to intermediate (technical)
Preferred Knowledge and Experience
- Experience in the retail industry, preferably with El Palacio de Hierro
- Participation in implementations, stabilizations, and migration projects
- Advanced experience leading high‑availability and high‑reliability platforms
- Certifications:
- Java certifications
- ITIL / ITSM
- Cloud certifications (Azure / GCP)
- DevOps / SRE certifications
Being You
The "Kyn" in Kyndryl means kinship, which represents the strong bonds we have with each other, our customers and our communities. We focus on ensuring all Kyndryls feel included and we welcome people of all cultures, backgrounds, and experiences. Even if you don't meet every requirement, we encourage you to apply. We believe in growth, and we're excited to see what you can bring. At Kyndryl, employee feedback has told us that our number one driver of employee engagement is belonging. That sense of belonging - being a valued, respected, trusted member of the team - is fundamental to our culture and fueling great experiences for our customers. This dedication to welcoming everyone into our company means that Kyndryl gives you the ability to thrive and contribute to our culture of empathy and shared success. That's The Kyndryl Way.
What You Can Expect
Your career with us isn't just a job-it's an adventure with purpose. We offer a dynamic, hybrid-friendly culture that supports your well-being and empowers you to grow. Our Be Well programs are thoughtfully designed to support your financial, mental, physical, and social health-because we know that when you feel your best, you do your best.
From your very first day, you'll dive into impactful work that powers the systems our customers rely on every day. You won't just contribute-you'll make a difference, tackling meaningful projects that sharpen your skills and fuel your growth.
We're here to champion your journey. With powerful tools to chart your career path, personalized development goals aligned with your ambitions, and continuous feedback to keep you inspired and on track, you'll have everything you need to thrive and evolve. You'll develop in-demand skills to grow your career and achieve your ambitions with access to cutting-edge learning opportunities-from certifications with Microsoft, Google, and Amazon to coaching and hands-on experiences. And through it all, you'll be part of a culture that values empathy, restless learning, and a devotion to shared success.
We want you to thrive here-and we're committed to helping you do just that. Ready to make an impact? Join us and help shape what's next.
Get Referred!
If you know someone that works at Kyndryl, when asked 'How Did You Hear About Us' during the application process, select 'Employee Referral' and enter your contact's Kyndryl email address.
Similar Roles
Senior Fullstack Engineer, Auction
Sotheby's
Senior Fullstack Engineer, Pricing
Sotheby's
Enrollment Support Specialist
Unum Group