Site Reliability Engineer

                                                

We are CARIAD Estonia, international hub of CARIAD, the automotive software company of the Volkswagen Group. CARIAD is building and supporting software solutions for all of Volkswagen Group’s brands. Our mission is to make mobility more safe, sustainable, comfortable, digital, and more fun.

                   

CARIAD Estonia has a growing Digital Services team that forms an integral part of the Digital Business and Mobility Services team of entire CARIAD Group. Engineers in Tallinn focus on supporting the performance and stability of customer-facing apps used by millions of customers around the world. CARIAD Estonia also contributes to the entire mission of CARIAD as the shareholding entity of non- German CARIAD subsidiaries.

                   

We’re looking for talented, digital minds like you to reshape the automotive experience for everyone, everywhere.

                   

In Estonia you are welcomed by an agile team ready to take on new challenges. Across the globe you will join nearly 5,000 software developers and engineers.

                   

Role Summary:                 

The Site Reliability Engineer develops software systems and automated solutions for operational aspects within the CARIAD organization. The position holder is responsible for monitoring applications, services, and infrastructures. As well as building and enhancing the overall observability and define KPIs together with Operation Engineers to ensure monitoring stability and quick response to issues.                                                          

Your tasks:

–  Identify, analyze, and use automation opportunities to improve efficiency and scalability of the manual tasks and services

–  Collaboration with different product / SRE/ OPS teams to perform Root Cause Analysis (RCA), practice blameless Post Mortems, and to improve reliability and velocity of services

– Maintain services once they are live by measuring and monitoring availability, latency, and overall system health

 –  Define Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to represent and measure service quality                

– Be on-call, responding to and managing incidents                        

– Design and implement projects that improve the reliability, efficiency, and performance
                        

–  Align with development teams on feature launches to ensure our customers are delivered reliable and scalable functionality
                      

– Improve team practices through guiding, coaching, and supporting the Ops team

                           

Competencies / Qualification:

– Successfully completed study in the field of computer science, or a comparable course of studies

 – Expertise in designing, analyzing, problem-solving and troubleshooting large- scale distributed systems

–  Ability to debug, optimize code, and automate routine tasks

–   Experience with Cloud Platforms (preferably Azure) as well as monitoring, logging, and application performance solutions (preferably New Relic)

 –  Knowledge with infrastructure as code tools like Terraform, Ansible, Bicep

– Expertise around Linux, networking and security

–  Fluent English skills, in both speech and writing

– Strong collaboration & networking abilities

– Positive attitude and openness to different cultures

                                                                                                    

    

Related Jobs

Graphic Designer

Graphic designer Tallinn, Estonia/Hybrid/Remote We are looking for a talented Graphic Designer to join our...

Chief Sales Officer

Our client, a leading provider of cutting-edge surveillance and security solutions, is seeking a dynamic and...

Head of Product

Head of Product Tallinn, Estonia (Hybrid) We are looking for a talented and experienced Head of Product to...