|
Senior Director of Software Platform and Continuous Engineering - Austin Texas
Company: Electric Power Engineers Location: Austin, Texas
Posted On: 01/17/2025
OverviewWe are designing the grid of the future!Be a part of an innovative team shaping the grid of the future through advanced energy intelligence. For more than half a century, Electric Power Engineers (EPE) has partnered with power and energy clients across the globe providing consulting expertise and energy intelligence software solutions for complex engineering and grid modeling challenges. As leaders in the renewables space, we are focused on building a modern, secure, and resilient grid. Join us in making an impact on the communities we serve and the environment in which we live. Together we can transform the future of energy.ResponsibilitiesJoin us in leading the change!The Sr. Director of Software Platform and Continuous Engineering will design, develop, and manage our backend platform, API layers, and data infrastructure. The ideal candidate will also oversee our AI and ML initiatives, implement DevOps, DevSecOps, and SRE practices, and manage our cloud environments. Additionally, this leader will develop a centralized reporting platform to enhance analytics, efficiency, and data-driven decision-making across the organization. This hands-on, strategic role requires a visionary leader who can build and lead a world-class engineering team, establish robust processes, and drive technology adoption and innovation.How you can make an impact: - Backend Platform Development & API Management
- Lead the backend platform and API layer's design, architecture, and development.
- Ensure scalability, reliability, and security of all backend systems.
- Collaborate with cross-functional teams to ensure seamless integration of APIs and backend services.
- Data Engineering & Databases
- Design and implement a robust data engineering strategy that ensures efficient and secure data pipeline, data storage, processing, and retrieval.
- Oversee the management and optimization of databases, data lakes, and other data systems.
- Build intelligent and secure data pipelines and infrastructure that support analytics and AI/ML workflows for software solutions and other EPE's enterprise systems.
- AI and Machine Learning (AI/ML), Analytics, and Reporting
- Develop and implement AI and ML strategies that align with the company's vision and goals.
- Lead the development of AI/ML models, ensuring they are efficient, scalable, and integrated into the company's products and services.
- Stay abreast of industry trends to ensure the company leverages cutting-edge technologies.
- Lead the design and implementation of a centralized reporting platform integrating enterprise systems (e.g., FinOps, ERP, HRMS, Salesforce) to streamline data structures, enhance predictive analytics, and optimize resource management, financial forecasting, and HR planning while reducing reliance on ad hoc reporting and enabling software engineers to focus on product development.
- Collaborate with senior leadership across Finance, HR, IT, and LOBs to align reporting solutions with organizational goals, ensuring secure, reliable automated tools that drive data-driven decision-making, operational efficiency, and enterprise-wide productivity.
- DevOps, DevSecOps, and CloudOps
- Establish and maintain DevOps and DevSecOps practices to ensure continuous integration, delivery, and deployment.
- Automate software release processes and implement monitoring solutions for platform health and security.
- Drive collaboration between development, operations, and security teams to create a streamlined, secure, and efficient development environment.
- Lead the cloud architecture and infrastructure strategy, including deployment, monitoring, and maintenance of cloud resources.
- Optimize cloud resources to ensure high availability, security, and cost efficiency.
- Site Reliability Engineering (SRE)
- Implement and lead SRE practices to ensure our systems and services' reliability, availability, and performance.
- Develop and manage monitoring, alerting, and incident response protocols to maintain system health and uptime.
- Automate infrastructure management, capacity planning, and system scaling to minimize downtime and enhance system resilience.
- Foster a culture of observability and proactive monitoring, ensuring system metrics are continuously tracked and analyzed for improvements.
- Collaborate with development teams to build software and systems that are reliable, scalable, and optimized for performance.QualificationsBring your passion, here's what's needed:
- Experience
- 10+ years of experience in software engineering, data engineering, AI, and SRE, with at least 8 years in a senior software development leadership role.
- Proven experience in backend platform development, API management, cloud infrastructure (AWS, Azure, GCP), and SRE practices.
- Demonstrated hands-on experience with databases, data engineering frameworks, and familiarity with AI/ML technologies, as well as cloud architecture.
- Solid background in DevOps/DevSecOps practices, site reliability engineering, and cloud operations.
- Technical Skills:
- Previous knowledge of programming languages such as Python, Java, Go, or similar.
- Experience with cloud services (AWS, Azure, GCP), containerization (Docker, Kubernetes), and CI/CD tools (Jenkins, GitLab, etc.).
- Strong understanding of data architecture, ETL pipelines, data lakes, and database technologies (SQL, NoSQL).
- Familiarity with AI/ML frameworks and platforms like TensorFlow, PyTorch, Vertex AI, and cloud-native AI services.
- Familiarity with monitoring and observability tools (New Relic, DynaTrace, Prometheus, Grafana, Datadog) and SRE principles such as error budgets and service-level objectives (SLOs).
- Leadership Skills:
- Demonstrated ability to build, scale, and lead high-performance backend, DevOps, SRE, and Data engineering teams.
- Strong strategic thinking and problem-solving skills with a focus on innovation and continuous improvement.
- Excellent communication and collaboration skills to work effectively with cross-functional teams and stakeholders.How we support you:
|
|