Site Reliability Engineer at AP Professionals in Beaverton, OR

AP Professionals
September 16, 2020
Beaverton, OR
Job Type


Apply You will be redirected to AP Professionals's preferred application process.

Site Reliability Engineer

Employment Type: W2 Contract

REMOTE CURRENTLY, sometime in 2021 ideally would want someone onsite in Beaverton, OR

****No C2C *****

Site Reliability Engineer


As a site reliability engineer, you will be focused on maximum availability, observability, reliability, security, and performance for our Digital Experiences.

SREs perform deep problem analysis, detect infrastructure or code defects, define, report, and create observability processes for Key Performance Indicators (KPIs), and work with product delivery teams to provide long term solutions to production issues.


Define roadmap and architecture based on technology and business needs.

Build holistic visibility into SLIs, SLOs, SLAs, dependency graphs, past performance of software, network, and system to ensure that we can continue to scale without increasing operational burden or toil.

Build infrastructure and drive projects that break things with the aim to improve the robustness of production systems

Use the core Site Reliability Engineering principles of Monitoring, emergency response, capacity planning, and production readiness reviews to run the platform.

Step back to observe patterns and develop innovative tools and automation to minimize toil. Use those learnings to drive the best operational practices.

Unblock, support, and effectively communicate across teams to achieve results

Diagnose and develop fixes to implement quickly and efficiently for production incidents.


Proficient in Java with 5+ years experience.

Experience with Java
Script on frontend (React, Angular, etc.) and backend (Node.js) components.

3 years experience in building cloud-based enterprise systems, ideally on AWS.

Basic understanding of DNS, Networking, Virtualization, Linux.

Experience with Docker/Containers and Serverless patterns.

Expertise in designing, debugging and running fault-tolerant large scalable Distributed systems.

Expertise in NoSQL datastore systems to build highly scalable solutions. Experience with messaging (pub-sub) patterns

Good understanding of async/non-blocking Restful APIs approaches and frameworks

Basic understanding of the following tools: ServiceNow, Jira, Jenkins, Splunk, SignalFx, NewRelic.

Good communication skills Good to have skills

Experience with python or Scala

Experience with test driven development

Background with ITIL or Lean a plus

Experience with code instrumentation for adding Metrics & Traces.

Demonstrated negotiation and influencing skills.


Requires a Bachelors Degree in Computer Science, Engineering, IT or a related field; MBA a plus. Minimum of 3 to 5 years of relevant work experience.

Job Expires: 2020-12-15
Apply You will be redirected to AP Professionals's preferred application process.

Rate This Job

Related Jobs

Uh oh! Something went wrong. Please try again.
We were unable to find any more job. Have you tried changing your search keywords?