Job Information
Frontdoor Sr. Site Reliability Engineer in Pune, India
Site Reliability Engineers(SREs) at Frontdoor are responsible for maintaining the availability and uptime of infrastructure. SREs use software engineering principles to solve operational challenges to create reliable infrastructure. We try to reduce the toil from our everyday work using as much automation as possible.
Responsibilities:
Analyze solutions and implement best practices for our datastores(MongoDB, Elasticsearch, Oracle, Postgres), caches(Redis) and message queuing(Kafka) systems
Work on datastore reliability and performance aspects of infrastructure
Build and maintain automation tooling for infrastructure, CI/CD and observability(monitoring, alerting, logging, tracing) pipelines
Design, build and maintain cloud and container orchestration infrastructure for datastores
Setup intelligent Monitoring and Alerting such that we’re aware of incidents before outages happen
Plan for growth and capacity of the infrastructure
Implement SRE principles and practices across organization to improve performance and efficiency
Research and implement solutions to build an always-up, always-available, resilient services
Integrate and automate existing manual solutions and processes, build and maintain self-service tools
Participate in an on-call rotation for availability incidents
Skills Requirements
Hands-on experience with administering and managing Elasticsearch and MongoDB Clusters
Hands-on experience with cloud service providers(at least one of GCP, AWS or Azure)
Hands-on experience with at least one configuration management software (Ansible/Chef/Puppet)
Experience with setting up Logging (e.g. ELK) and Monitoring(e.g. Prometheus) solutions
Working knowledge of containers and any one container orchestration platform(Kubernetes/Nomad/Mesos/Swarm)
Understanding and experience in at least one CI/CD pipeline (Jenkins/Travis/CircleCI/Gitlab etc.)
Good understanding of Unix/Linux operating systems and its internals
Well-versed with Linux CLI
Apart from shell scripting(sh/bash), proficient with one other programming language(Python/Ruby/Go/Perl)
Working knowledge of any one distributed version control systems (git/bzr/hg)
Ability to write good technical user document
Exposure to managing Infrastructure as Code with
Experience Requirement
At least 6 years of hands-on DevOps/SRE experience
At least 4 years of experience managing production infrastructure on any cloud
At least 3 years of experience developing code, either maintaining scripts or applications
At least 2 years of experience in managing production clusters of Elasticsearch, MongoDB, Redis
Nice to Have
Expertise in either Elasticsearch, MongoDB, PostgreSQL or Kafka
Understanding intricacies and internal workings of datastores which can help in optimizing and architecting the solutions around it
Frontdoor is a company that’s obsessed with taking the hassle out of owning a home. With services powered by people and enabled by technology, it is the parent company of four home service plan brands: American Home Shield, HSA, Landmark and OneGuard, as well as AHS Proconnect , an on-demand membership service for home repairs and maintenance, and Streem, a technology company that enables businesses to serve customers through an enhanced augmented reality, computer vision and machine learning platform. Frontdoor serves more than two million customers across the U.S. through a network of more than 16,000 pre-qualified contractor firms that employ over 45,000 technicians. The company’s customizable home service plans help customers protect and maintain their homes from costly and unexpected breakdowns of essential home systems and appliances. With nearly 50 years of experience, the company responds to over four million service requests annually (or one request every eight seconds). For more details, visit frontdoorhome.com (http://cts.businesswire.com/ct/CT?id=smartlink&url=http%3A%2F%2Ffrontdoorhome.com&esheet=51890761&newsitemid=20181029005803&lan=en-US&anchor=frontdoorhome.com&index=2&md5=6947c38f3b6d24b5dad0a5d08663858e) .
Job Category: Information Technology
ID: R0015084