|
Lead Site Reliability Engineer - Reston Virginia
Company: Comcast Corporation Location: Reston, Virginia
Posted On: 11/10/2024
FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can insert advertisements around the world.Job SummaryResponsible for planning and designing new software and web applications. Analyzes, tests and assists with the integration of new applications. Oversees As the Site Reliability Engineer (SRE), you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for the FreeWheel platforms. You will engage in designing, analyzing and troubleshooting large-scale distributed systems, debugging /optimizing code, and automating routine tasks. You will be part of a team consisting of a healthy mix between software and technology infrastructure backgrounds, provide subject matter expertise, resolve complex break/fix scenarios and engage broader teams as necessary, partner with engineering, vendors and client services to deliver successful technical solutions. You shall work with limited supervision and direction while executing associated functions and responsibilities, follows operational practices and independently determines/develops approaches for non-routine solutions.Job DescriptionCore Responsibilities - Be responsible for reliability and technical operations of FreeWheel TV Platform Ad-Serving component(s).
- Lead technical solutions in measuring and improving reliability, quality and efficiency of FreeWheel platforms.
- Lead in a variety of complex analytical duties in the planning, deployment, testing and evaluation of FreeWheel products.
- Possesses in-depth working knowledge of FreeWheel platforms, infrastructure, internal processes, and teams/partners.
- Support FreeWheel powered live events such as Super Bowl, Olympic Games, March Madness, and FIFA World Cup.
- Plug into software release cycle, work closely with developers and tech leads to ensure software releases are well designed, planned, implemented, released, and monitored.
- Lead in design and implementation in authoring infrastructure as code with best practices, tool use, and quality assurance.
- Lead technical solutions for infrastructure and application management, monitoring, and operations with standardization and automation focus.
- Leverages engineering methodologies and technical knowledge in specific areas of focus.
- Lead code level debugging on issues escalated to the team.
- Lead on-call shifts, incident prevention, response, and retrospect.
- Advocate for engineering and technical operations procedures, policies, processes and SRE best practices.
- Partner with developers and vendors to identify and drive improvements including production quality, operational efficiency, engineering productivity.
- Provide support and influence for the Cybersecurity program needs such as patching, vulnerability cleanup, secure server configuration, testing and validation, technical controls implementation and cybersecurity incident remediation efforts.
- Provides training and coaching to peers and more junior SRE team members.
- Consistent exercise of independent judgment and discretion in matters of significance.
- Regular, consistent and punctual attendance. Must be able to work nights and weekends, variable schedule(s) and overtime as necessary.
- Other duties and responsibilities as assigned.Minimum Requirements
- Bachelor's degree in computer science, a related engineering field, or equivalent practical experience.
- Prior 7 years of experience in software engineering with one of programming languages: Python, Golang, JavaScript.
- Prior 5 years of technical operation experience for business-critical application(s) over public cloud (AWS specific is a big plus) services: VPC, subnets, network access control lists, security groups, EC2 instances, S3 buckets, IAM, Route 53, Lambda.
- Prior 5 years of experience with SDLC tools: Containers, Kubernetes, Docker, Salt / Ansible / Chef / Puppet, Jenkins, Git.
- Prior experience of Linux administration, network security, and system infrastructure.
- Excellent communication and collaboration, within/across team(s) and continents.
- Work / Shift Timings: Selected candidate will be expected to work Eastern Standard hours & be able to work on weekend during on-call rotation schedule: usually 12 hours a day including weekend.Preferred requirements
- Prior experience in supporting business-critical services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
- Technical leadership and influence demonstrated in focused product/tech areas and practices.
- Prior experience in providing technical solutions at an internet company.Employees at all levels are expected to:
- Understand our Operating Principles; make them the guidelines for how you do your job.
- Own the customer experience - think and act in ways that put our customers first, give them seamless digital options at every touchpoint, and make them promoters of our products and services.
- Know your stuff - be enthusiastic learners, users and advocates of our game-changing technology, products, and services, especially our digital tools and experiences.
- Win as a team - make big things happen by working together and being open to new ideas.
- Be an active part of the Net Promoter System - a way of working that brings more employee and customer feedback into the company - by joining huddles, making call backs and helping us elevate opportunities to do better for our customers.
- Drive results and growth.
- Respect and promote inclusion & diversity.
- Do what's right for each other, our customers, investors and our communities.Disclaimer:
|
|