SRE Architect 6
Comcast Reliability Engineering is looking for SRE Architect. Our architect will be a key technology leader dedicated to implementing the tenets of Site Reliability Engineering across engineering and operations teams within Comcast's Technology, Product and Experience (TPX) organization. Our architect will support multiple projects simultaneously, and must be able adapt to varying technologies, methodologies and
-Architect will evangelize Reliability Engineering SRE tenets for agile change, incident remediation,
and service level measurement with "automation at every step."
-Architect will develop an extensible "push on green" RE change model factoring in risk impacts, self-heal capabilities, monitoring sophistication and past performance that can be applied and adapted by various product, application and network teams within TPX.
-Architect responsible for collaboratively designing integration workflow between development
teams' continuous integration and deployment pipelines and Reliability Engineering's IOP Core (Change, Incident Management, and Event Correlation).
-Architect will lead an Automation Working Group for TPX with the intent of developing a strategy for determining ownership and consistent implementation of diagnostic and remediation processes in such away they are transparent to the IOP Core and other interested parties within the TPX ecosystem.
-Architect responsible for designing monitoring standards and integrations between TPX monitoring
and alarm platforms and IOP Core's event correlation engine.
-Architect will develop TPX strategy around SRE Service Level measurement and algorithmic
determination of change budget based on application performance and ecosystem impact.
-Provide technical leadership and mentoring across the RETINA development teams that support the
-Code prototypes as needed to illustrate SRE tenets and RE implementation models in action.
-Evangelize with "seeing is believing" delight.
Knowledge and Skill Requirements:
-Bachelor's degree in Computer Science or Information Systems related discipline
-5 years active development experience with the capabilities of true generalist able to pick-up and
drop various programming languages, standards, and protocols.
-10 years technical leadership experience at Principal Engineer-grade or higher.
-5 years experience with DevOps, continuous integration, tools, test-driven development and error driven development.
-Working knowledge of cloud implementation (OpenStack, AWS)
-Working knowledge of monitoring platforms (Nagios, Sensu)
Comcast is an EOE/Veterans/Disabled/LGBT employer