• Lead Site Reliability Engineer

    Job ID
    14389
    Type
    Regular Full-Time
    Company
    Seattle Cancer Care Alliance
    Location
    US-WA-Seattle
    Category
    Information Technology
  • Overview

    The Seattle Cancer Care Alliance (SCCA), located in Seattle, Washington, is part of a dynamic collaboration among three organizations known nationally and internationally for their patient care and research: Fred Hutchinson Cancer Research Center, University of Washington, and Seattle Children's. Over the past 25 years, these institutions have worked together to support their mission of adult and pediatric oncology patient care services, research and education.


    As a Site Reliability Engineer, you will be responsible for developing and managing cloud server infrastructure and networks, monitoring and improving system performance, building and maintaining CI/CD systems that facilitate build automation and automated testing, and help formulate and implement security policies. You will work with operations, support and engineering teams to ensure the automation platform is capable of serving current and future needs. You will document and drive best-practices across the team.


    The ideal candidate is proficient in coding, thinks in terms of architecture and test automation, and has substantive experience writing code running within a changing environment where servers and databases are interchangeable and impermanent. Candidates should have the ability to take existing application and refactor, redesign with modern solutions (such as containizing existing monolithic applications). Have a strong desire to pursue the principles of infrastructure as code, holistic infrastructure design, immutable infrastructure, and orchestration and automation whenever possible.

    Responsibilities

    • Develop automations to manage the infrastructure.
    • Research/analyze data processing functions, methods and procedures.
    • Monitor production systems for expected performance.
    • Perform root cause analysis on production issues and determine action items for prevention and resolution.
    • Participate in architectural decisions about the next iterations of our cloud environments.

    Qualifications

    • Experience working with AWS (ECS/Fargate is required)
    • Strong knowledge of infrastructure tools (CloudFormation or Terraform)
    • Strong knowledge with one or more configuration management tools (Ansible is preferred)
    • Experience with Docker
    • Strong knowledge with one or more build automation tool (Gitlab is preferred)
    • Experience with application and systems monitoring services and logging solutions (Splunk and Datadog are preferred)
    • Linux systems support in a 24x7 production environment.
    • Ability to manage multiple activities and changing priorities
    • 3-5 years of experience in enterprise level IT projects Cloud based technologies (PaaS, IaaS, Cloud Infrastructure Services)

    Our Commitment to Diversity

    We are committed to cultivating a workplace in which diverse perspectives and experiences are welcomed and respected. We are proud to be an Equal Opportunity and VEVRAA Employer. We do not discriminate on the basis of race, color, religion, creed, ancestry, national origin, sex, age, disability, marital or veteran status, sexual orientation, gender identity, political ideology, or membership in any other legally protected class. We are an Affirmative Action employer. We encourage individuals with diverse backgrounds to apply and desire priority referrals of protected veterans. If due to a disability you need assistance/and or a reasonable accommodation during the application or recruiting process, please send a request to our Employee Services Center at escmail@fredhutch.org or by calling 206-667-4700.

    Options

    Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
    Share on your newsfeed