SRE - Site Reliability Engineer - EU

Multiple Locations: Prague, Praha, Czechia • United Kingdom • Czechia • Netherlands

Apply

Requisition Number: 1363

Position Title: Software Engineer_G01

External Description:

Site Reliability Engineer (SRE)

Own the product!

The Site Reliability Engineer (SRE) will work with other members of the SRE team supporting software engineers to build highly reliable and performing infrastructure.  Typical projects will include developing automated solutions for operational aspects like capacity planning, performance and improving site reliability. 

Responsibilities 

  • Hands-on design, analysis and troubleshooting of highly-distributed large-scale production systems;
  • Ownership of reliability, uptime, capacity, and performance analysis thereof
  • Ensuring the repeatability, traceability, and transparency of our infrastructure automation
  • Identifying highest-impact opportunities to optimize existing systems
  • System design consulting for teams seeking to leverage or improve their production infrastructure
  • Anticipate, build and plan capacity for upcoming product/feature launches
  • Practice sustainable incident response and blameless postmortems

 

Job Requirements

  • 1-3 Years of experience required
  • Bachelor's degree in Computer Science, a related technical field involving systems engineering (e.g., Physics or Mathematics) or equivalent practical experience.
  • Experience in one or more of the following: C, C++, Java, Python, Go, Perl, Ruby or shell scripting, Yaml, Json.
  • Experience with Unix/Linux operating systems internals and administration (e.g., filesystems, inodes, system calls, etc) or networking (e.g., TCP/IP, routing, network topologies and hardware, SDN, etc.).  AWS Services (i.e. CloudFormation, CloudWatch, EKS, Landing Zone, Administration, etc.), GCP Services (i.e. Data Flow, SubPub, BigQuery, BigTable, etc.)

 

Preferred Requirements  

  • Expertise in designing, analyzing and troubleshooting large-scale distributed systems.
  • Experience in Terraform
  • Ability to debug and optimize code and to automate routine tasks.
  • Systematic problem-solving approach, coupled with effective communication skills and a sense of ownership and drive
  • Strong problem solving, root cause analysis and systems engineering skills
  • Good presentation and communication skills

City:

State:

Country:

Community / Marketing Title: SRE - Site Reliability Engineer - EU

Company Profile:

Monster (Randstad Group) is the worldwide leader in successfully connecting people to job opportunities. From the web, to mobile, to social, we help companies find people with customized solutions and we use the world's most advanced technology to match the right people to the right job.

We've made it our mission to help companies find better candidates. And nobody brings more cutting-edge tools to help them do just that than Monster. Whatever their needs are, we have the products and technologies to build a bespoke solution for our clients, to help them find #TheRightFit.

Innovation is the heart of our success... and our future. We're changing the way people think about work, and we're helping them improve their lives and their work performance with new technology, tools and training.

What makes Monster great…

Monster is synonymous with innovation; we are passionate about bringing great people and great companies together. In fact, we are obsessive about it – it’s what we do every day. We believe that the work that we do has a noble purpose... Making people’s lives better.

At Monster, we let people breath, giving everyone the opportunity to shape their destiny and provide the development support that allows them to do so.

Find out more about Working at Monster here: https://www.monster.com/about/working-here/

Location_formattedLocationLong: Prague, Praha CZ

CountryEEOText_Description: