Reliability Engineer - REMOTE

company logo


Apply for this job

Job Description

We are based in San Francisco, however this role is remote! This Jobot Job is hosted by Brendan Thomas Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume. A bit about us Our mission is to improve the world's health through compassionate care and innovation. We believe that health is personal, and means so much more than treating illness. We're proud of the care we've provided over the years and the relationships we've developed with our patients, as evidenced by the 5-star reviews we continually receive. People use our service to gain access to some of the best physicians and licensed therapists in the country, all whenever and wherever is most convenient. It's as simple as opening our app on a smartphone or computer. Through live video visits, our hand-picked, US-trained doctors take patient history, perform an exam, and recommend a treatment plan. Prescriptions, if needed, go directly to the pharmacy of choice. While insurance isn't required, tens of millions of Americans enjoy covered medical and mental health visits through employer and health plan partnerships. Why join us? Be a core leading member of a small, elite productengineering team Fluid work hours, fun, fast-paced environment Full benefits + salary + stock options Unlimited PTO 401(k) program with matching Meals provided several days per week Continuing education stipend Finish your day knowing you worked on a product that helps people and has saved lives Job Details We are looking for a reliability engineer to join a talented team in the Platform Services engineering group. The Platform Services group is responsible for ensuring the reliability of our applications in addition to providing tools engineers can use to efficiently develop, test and deliver high-quality code to production. A successful candidate is a self-sufficient engineer, who can define and improve development and incident response processes, observability, and drive incident follow through with data and analysis. Preferred candidates will be ready to effect the implementation of software and infrastructure improvements at all levels of the system, from architecture to code. Work as part of our platform engineering team to improve the quality and stability of our platform Be part of an on-call rotation that monitorsmaintains availability of our application Define and support development and delivery processes Define and support on call and incident response processes Improve Observability and help define SLAs Resolve and analyze issues, using metrics and root causes to provide recommendations, and help to implement the resulting infrastructure, architecture, process, and application improvements. Maintain a working knowledge across our entire platform including backend, frontend, mobile, and infrastructure read and write code to improve the reliability of our applications Evangelize for reliability and performance concerns in code and architecture reviews Interested in hearing more? Easy Apply now by clicking the "Apply Now" button.