Site Reliability Engineer
Employment Type: W2 Contract
REMOTE CURRENTLY, sometime in 2021 ideally would want someone onsite in Beaverton, OR
****No C2C *****
Site Reliability Engineer
As a site reliability engineer, you will be focused on maximum availability, observability, reliability, security, and performance for our Digital Experiences.
SREs perform deep problem analysis, detect infrastructure or code defects, define, report, and create observability processes for Key Performance Indicators (KPIs), and work with product delivery teams to provide long term solutions to production issues.
Define roadmap and architecture based on technology and business needs.
Build holistic visibility into SLIs, SLOs, SLAs, dependency graphs, past performance of software, network, and system to ensure that we can continue to scale without increasing operational burden or toil.
Build infrastructure and drive projects that break things with the aim to improve the robustness of production systems
Use the core Site Reliability Engineering principles of Monitoring, emergency response, capacity planning, and production readiness reviews to run the platform.
Step back to observe patterns and develop innovative tools and automation to minimize toil. Use those learnings to drive the best operational practices.
Unblock, support, and effectively communicate across teams to achieve results
Diagnose and develop fixes to implement quickly and efficiently for production incidents.
Proficient in Java with 5+ years experience.
Experience with Java
Script on frontend (React, Angular, etc.) and backend (Node.js) components.
3 years experience in building cloud-based enterprise systems, ideally on AWS.
Basic understanding of DNS, Networking, Virtualization, Linux.
Experience with Docker/Containers and Serverless patterns.
Expertise in designing, debugging and running fault-tolerant large scalable Distributed systems.
Expertise in NoSQL datastore systems to build highly scalable solutions. Experience with messaging (pub-sub) patterns
Good understanding of async/non-blocking Restful APIs approaches and frameworks
Basic understanding of the following tools: ServiceNow, Jira, Jenkins, Splunk, SignalFx, NewRelic.
Good communication skills Good to have skills
Experience with python or Scala
Experience with test driven development
Background with ITIL or Lean a plus
Experience with code instrumentation for adding Metrics & Traces.
Demonstrated negotiation and influencing skills.
Requires a Bachelors Degree in Computer Science, Engineering, IT or a related field; MBA a plus. Minimum of 3 to 5 years of relevant work experience.