Staff Site Reliability Engineer

US Remote

At Okta, our motto is "Always On," and nowhere do we embrace that more than in Technical Operations. We strive to build the most reliable and performant systems on the planet through the skillful use of automation. If you like to be challenged and have a passion for solving large-scale automation, testing, and tuning problems, we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it” and who can rapidly self-educate on new concepts and tools.

You will work on:

  • Design, build and monitor Okta's global production infrastructure
  • Respond to production incidents and determine preventive solutions
  • Build and support logging and metric collection using a stack that includes Splunk, VMWare Wavefront, Zabbix, ThousandEyes, Pingdom, AWS, and more
  • Act as first point of contact for one of our major tools (Zabbix or ThousandEyes an extra plus)
  • Automate manual processes, evolve our monitoring tools, and develop technical documentation
  • Support a highly available and large scale online environment as part of an on-call rotation once per quarter

Qualifications and Requirements:

  • US Person Status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or
    Asylee)*
  • Experience with Federal and DoD compliance requirements - FedRAMP, IL
  • Background using and supporting Splunk, Zabbix, Wavefront, Elasticsearch, Logstash, Kibana, ThousandEyes, or related tools
  • Passionate about automation
  • Experience in chatops tooling, Slack automation, PagerDuty integration
  • Background with Linux systems administration and strong scripting skills in Bash, Ruby, Python, Go, etc.
  • Experience supporting Docker containers and web applications running on Java / Apache / Tomcat in a live production environment
  • Strong expertise with production services in AWS such as EC2, ECS, KMS, Kinesis, CloudWatch
  • Previous experience with automating systems and infrastructure via Ansible, Chef or Terraform
  • Solid understanding of networking concepts and IP protocols
  • Experience with multi-cloud infrastructure is desired

*This position requires the ability to access Impact Level 4 (IL4) data, as defined by the Department of Defense (DoD) Cloud Computing Security Requirements Guide. As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire.

((Colorado, New York and Washington only*) Minimum OTE of $154,000/year + equity + benefits))

Okta is an Equal Opportunity Employer

Okta is rethinking the traditional work environment, providing our employees with the flexibility to be their most creative and successful versions of themselves, no matter where they are located.  We enable a flexible approach to work, meaning for roles where it makes sense, you can work from the office, or from home, regardless of where you live.  Okta invests in the best technologies and provides flexible benefits and collaborative work environments/experiences, empowering employees to work productively in a setting that best and uniquely suits their needs.  Find your place at Okta https://www.okta.com/company/careers/. 

By submitting an application, you agree to the retention of your personal data for consideration for a future position at Okta.  More details about Okta’s privacy practices can be found at: https://www.okta.com/privacy-policy.

 

#LI-Remote

#LI-ML1

Apply

Resume
Upload Resume/CV (PDF must be less than 8 MB )
Cover Letter
Upload Cover Letter (PDF must be less than 8 MB )
U.S. Equal Opportunity Employment Information (Click here for instructions)

We request this data to promote diversity, inclusion, and belonging and to ensure we maintain fair and equitable hiring practices. Responding to the survey is voluntary.