Site Reliability Engineer

  • Overview is one of the largest retail e-commerce companies in North America. Nordstrom Operations Center (NOC) is ensuring that all services run smoothly and dev teams get the support they need to operate

  • Responsibilities

    • Monitor critical business systems from 9am to 5pm (7 days a week but no more than 8h/day and 40h/week)
    • Facilitate work on incidents
    • Respond to support requests from internal users
    • When everything works as expected, learn technology or work on self-driven initiatives =)
  • We Require

    • Good English to communicate directly with customers, great communication;
    • Experience with Linux/Unix systems administration (monitoring, troubleshooting, performance tuning, preventative maintenance, and capacity planning);
    • Experience with configuration management, source control, and containerization tools;
    • Experience with Cloud-based infrastructure and automation;
    • Experience with at least one scripting language (e.g. Bash, Python, Ruby, Go)
    • Experience with site/infrastructure monitoring systems (like AWS Cloudwatch, Datadog);
    • Understanding of Networking (TCP/IP, routing, network topologies, and hardware, SDN, etc);
    • Broad understanding of large scale system architecture, automation, integration, and processes;

You may be interested

Regular DWH/ETL engineer

Senior Engineer - FrontEnd

Senior Engineer - FrontEnd

Looks like talking about your friend?

Be the one to get us in touch