Current Statistics
1,581,556 Total Jobs 240,909 Jobs Today 17,821 Cities 222,734 Job Seekers 146,855 Resumes |
|
|
|
|
|
|
Lead Software Engineer, DevOps - Site Reliability Engineering - Newark New Jersey
Company: Capital One Location: Newark, New Jersey
Posted On: 01/30/2025
114 5th Ave (22114), United States of America, New York, New YorkLead Software Engineer, DevOps - Site Reliability EngineeringDo you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers and disruptors, who love to solve real problems and meet real customer needs. We are seeking DevOps Engineers who are passionate about marrying data with emerging technologies to join our team. In this role, you will act as a Lead DevOps Engineer on a Site Reliability team in Bank Tech. You'll have the opportunity to be on the forefront of driving a major transformation within Capital One.What You'll Do: - Lead a portfolio of diverse technology projects and a team of developers with deep experience in machine learning, distributed microservices, and full stack systems to create solutions that help meet regulatory needs for the company
- Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
- Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment
- Utilize programming languages like Java, Python, SQL, Ruby and Go, Container Orchestration services including Docker and Kubernetes, CM tools including Ansible and Terraform, and a variety of AWS tools and services -
- Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies
- Communicate Service Level Objective concepts to product partners and drive agreement on objectives
- Influence the strategic direction of the team, identifying and prioritizing opportunities to improve reliability
- Drive resolution of issues on incident calls, providing systematic and logical approaches and prioritize the work of multiple teams to drive resolution
- Drive improvements to the incident resolution process with the introduction of automation
- Conduct blameless incident reviews and post incident analysis and communicate incident review learnings
- Drive implementation of processes or solutions that improve reliability across multiple platforms
- Identify gaps in automation and develop strategic plans to drive solutions that reduce toil for the platform teams
- Work with other experts to arrive at optimal design and deployment configurations
- Establish standards that improve deployment and system reliability for integration pipelines and recommend approaches for chaos testing a particular system
- Identify and create proactive, automated approaches for system reliability and alerting and identify key performance indicators for a system, including adding, tuning and maintaining alert configurations
- Understand business requirements for system reliability and translate them into implementations such as scaling, failover, timeouts and health checks and work with development teams to test and improve system performance and reliability -Basic Qualifications:
- Bachelor's degree
- At least 6 years of experience in DevOps Engineering (Internship experience does not apply)
- At least 3 years of experience in Cloud Native technologies (Amazon Web Services, Microsoft Azure, Google Cloud Platform)
- At least 4 years of Unix or Linux system administration experiencePreferred Qualifications:
|
|
|
|
|
|
|