Comcast Careers

Operations Engineer

New York, NY
Technology (Technology - IT)


Job Description

Business Unit:

FreeWheel is hiring a Site Reliability Engineer based in our New York office who will report to the Operations Manager. The ideal candidate works closely with the NOC, DBA's, Technical support and QA teams to support our Advanced Advertising products. The candidate understands systems automation, software deployment, applications monitoring and infrastructure scaling. The Site Reliability is deeply familiar with a variety of tech stacks including Linux based Operating Systems, Containers, Virtualization, DNS, configuration management, Private/Public Cloud infrastructure, Load Balancers, RDBMS, NoSQL, Log collections; and is able to develop, set up, configure and use various tools.

The Site Reliability Engineer will monitor, manage and deploy software components. He/She will plan and execute infrastructure capacity (to account for growth), optimize and monitor servers in pre-production & production regions, of which he/she will ensure availability, backups, updates and security. On-call time is part of the job.

The Site Reliability Engineer will participate in the technological watch, creation and evolution of processes to automate and industrialize deployments in pre-production & production regions. He/She will also partake in the application of patches, server installations, etc. and participate in continuous updates to documentation (Git, Wiki). He/She will consistently practice risk management in all aspects of work, taking Infosec very seriously and will consistently demonstrate skill and experience in managing production to ensure application health. He/She will take quality very seriously and work to quantify and evaluate new ways of measuring quality to ensure systems and application uptime and a positive user experience.


RESPONSIBILITIES:

Maintain and develop tools for software release, monitoring, data analysis, data/file syncing, source code stats and application management (Big Data) running on AWS.
Manage/Deploy/Monitor applications wrapped in containers and Kubernetes.
Measure, evaluate and tune system/application performance via solid data analysis.
Refine and work on automation tools with Jenkins, Salt, puppet, fabric, etc.
Work on internal incubated projects and prove ideas by quickly showing the usable code.
Work on architectural refactoring projects for Advanced Advertising applications.
Automate time-consuming & error-prone manual processes.
Required to work on some weekends and be part of "on call" schedule to support a 24x7 Video Adserving Network.

ABOUT YOU:

Have 3-5 years of hands-on experience working with AWS.
Have 3-5 years of hands-on experience working with configuration management tool such as Chef, Puppet, Ansible, SALT.
Have 3-5 years of hands-on experience with programming language such as Python, Go or Node.
Have 3-5 years of hands-on experience with Hadoop (HDFS, HBase, YARN, Spark, MapReduce, Kafka, Hive).
Have basic knowledge of CI/CD experience with Jenkins.
Experience with virtualization and familiar with private cloud setup.
Familiar with Containers such as Docker, Kubernetes or ECS.
Have basic knowledge in Linux, shell scripting, system programming and system task automation processes.
Strong understanding of best practices for software engineering, system design and scalable fault tolerant web architecture.
Good written and verbal communication skills.
Bachelor's or Master's degree in Computer Science or related field.





Comcast is an EOE/Veterans/Disabled/LGBT employer