|
Lead NoSQL Database Reliability Engineer - Boulder Colorado
Company: The Trade Desk Location: Boulder, Colorado
Posted On: 01/26/2025
The Trade Desk is a global technology company with a mission to create a better, more open internet for everyone through principled, intelligent advertising. Handling over 1 trillion queries per day, our platform operates at an unprecedented scale. We have also built something even stronger and more valuable: an award-winning culture based on trust, ownership, empathy, and collaboration. We value the unique experiences and perspectives that each person brings to The Trade Desk, and we are committed to fostering inclusive spaces where everyone can bring their authentic selves to work every day.Do you have a passion for solving hard problems at scale? Are you eager to join a dynamic, globally-connected team where your contributions will make a meaningful difference in building a better media ecosystem? Come and see why Fortune magazine consistently ranks The Trade Desk among the best small- to medium-sized workplaces globally.We are looking to hire a Lead NoSQL Database Engineer to join our engineering team to continue building our data-driven platform and support database related activities. The Trade Desk leverages Aerospike to perform many real-time activities, translating to over 75 million queries per second with a p99 latency under 1 millisecond on the back end! Additionally, our adoption of MongoDB and Kafka is enabling developers to move quicker, making them our fastest-growing technologies. Do you enjoy tuning, performance testing, troubleshooting, writing automation, and influencing use cases? Does operating at scale, tuning, and testing next-gen hardware sound fun to you?WHAT YOU WILL DO: - Operations for Aerospike, Kafka, and Mongo. You will be a point of contact to review new use cases, answer questions, and respond to production issues while participating in an on-call rotation.
- Learn to be a NoSQL SME. You do not need experience to apply - we will train you.
- Lead a team to influence, manage, and plan work streams, systems, and data structures at scale within a global ecosystem, spanning multiple infrastructure providers (cloud and traditional datacenters).
- Encourage, improve, and build upon infrastructure automation in a way that works with stateful systems at scale.
- Benchmark and analyze next-generation hardware offerings.
- Create and maintain alarm definitions that prevent issues, adjusting alarm definitions to prevent alert fatigue.WHO WE ARE LOOKING FOR:Skills and Experience
- Leadership experience and ability to mentor.
- Domain knowledge in one or more of the following:
- Physical (on-prem) server internals, their management and operation
- Linux operating system
- Performing testing and tuning
- Nice-To-Have experience:
- Databases (relational or otherwise)
- Ansible
- Prometheus
- Kubernetes
- Python/Ruby/Rust/Bash/Golang/C#An Empathetic, Objective, Critical Thinker:
|
|