Site Reliability Engineer
Job Summary
In this role as a Site Reliability Engineer, you will be an integral member of a dynamic SRE/DevOps team continuously improving our AWS cloud deployment platform, “automation first." This role will report to our Director, Systems & Architecture in our Engineering Department.
Responsibilities
- Drive team initiatives to continuously refine AWS deployment practices for improved reliability, repeatability and security.
- Work closely with the development teams to automate deployment and configuration of infrastructure.
- Design monitoring and alerting for issues like application errors and high memory usage, with centralized log aggregation for quick troubleshooting and trend analysis. Collaborate with stakeholders to deliver proactive notifications and actionable metrics.
- Write code and scripts to automate provisioning of AWS services and to configure services, using tools and languages including AWS CLI / API, Terraform, Ansible, Python, Bash
- Configure build pipelines to support automated testing and deployments using tools including Jenkins, CircleCI, GitHub Actions
- Help refine DevSecOps security practices (including regular security patching, minimum-permissions accounts and policies, encrypt-everything) in compliance with Health IT, government and other standards regulations, implement, and verify them, using tools like the AWS security stack (GuardDuty, Systems Manager, Config), VeraCode, SonarQube, etc. to analyze and verify compliance.
- Document and diagram deployment-specific aspects of architectures and environments, working closely with Software Engineers, Software Engineers in Test, and others in DevOps.
- Troubleshoot issues in production and other environments, applying debugging and problem-solving techniques (e.g., log analysis, non-invasive tests), working closely with development and product teams.
Qualifications
- 3+ years Cloud administration experience (AWS, Azure, GCP)
- Strong experience developing and / or deploying Docker Containers on Kubernetes (Helm, Kustomize, etc)
- Working knowledge of IaC / configuration management tools such as Terraform, Ansible or Puppet
- Recent experience with setup, configuration and monitoring of RDBMS and NoSQL datastores
- A strong understanding of Linux administration including Bash scripting
- Experience in automation using Go or Python
- Experience with log aggregation tools such as Datadog, ELK, Splunk
- Software engineering experience in a modern, high-level language (Ruby, Java, Python, etc.)
- BA/BS degree in Computer Science, Software Engineering, or a related field, or equivalent experience
- Experience in HIPAA/SOC 2 environments
Why work here?
We support thousands of healthcare professionals and cancer patients every day. We are a mission-driven company that believes in transformational innovation. Everyday we develop new, better ways to improve the patient experience and help lower patient costs. Help us innovate through technology to reach the next level in patient care. We believe in each other, believe in our work, and have fun doing it!
This position is full-time & 100% remote, with the option to come into our Seattle and/or Nashville corporate offices on a voluntary basis for local employees. We not only offer competitive compensation, but have a ton of other perks - here are just a few:
- Flexible Work From Anywhere policy (must be US-based)
- Employer subsidized medical, dental and vision, short/long term disability & life insurance
- 401(k) plan and employer matching contributions
- Flexible Vacation
- Parental Leave Benefits
- Annual Hackathons
- Educational stipend
- Employee Referral program
Base Salary Range: $125,000/yr - $140,000/yr + 10% Corporate Bonus
Navigating Cancer is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
Our Values
- Work with Purpose: We exhibit an unwavering commitment to enrich the lives of cancer patients and delight our customers.
- Act with Integrity: We communicate openly and honestly, and act with transparency and gratitude.
- Commit to Equity & Inclusion: We’re dedicated to a culture where all people are respected and we believe in creating fair access, opportunity, and advancement for all of our employees.
- Be Agile: We welcome change and actively listen and adapt to deliver creative solutions and improvements.
- Seek Simplicity: We reduce complexity for ourselves, our customers and their patients and seek simplicity in all we do.
- Strive for Growth: We foster personal and professional growth and manage our performance to exceed the company’s goals.