Site Reliability Engineer 2

Site Reliability Engineer 2

Job Overview

Location
London, England
Job Type
Full Time Job
Job ID
30722
Date Posted
1 year ago
Recruiter
John Apl
Job Views
29

Job Description

Power BI is quite possibly the most exciting Microsoft service today. We have one of the fastest growing user bases, we are at the top of all the BI offerings in the industry with a big and passionate community around us.

 

The Power BI team is looking for a SRE to join the Release Management team, who are on a mission to ship Power BI quality products and features for customers around the world, empowering them to make data driven decision so they achieve their goals and constantly improve their team and impact the world. Focused on driving a best-in-class delivery experience, the Power BI Release Management team leverages operational excellence, data driven decision making and the creation, implementation, and delivery of features to achieve that end. A growing team with incredible room for impact makes this an ideal role for ambitious, motivated candidates with customer passion.

 

If you are looking for an opportunity to leverage your strong development, operational, and engineering satisfaction in an KPI driven role, come join this fast paced, growing team.

Responsibilities

Technical Knowledge and Domain-Specific Expertise

Develops a foundational understanding of distributed systems design, interactions between cloud technology layers and components, basic dependencies at scale, and the code that defines infrastructures. Can contribute to the code base that defines components or features of systems or cloud technologies to improve the reliability and operability of supported products, with direction from other engineers.

Develops an understanding of the code, features, and operations of specific products at scale as required to contribute to incremental improvements in product availability, reliability, efficiency, observability, and/or performance; participates in on-boarding, code/design reviews, and regular meetings with the engineering teams that develop and/or manage those products.

 

Contributions to Development and Design

Develops and tests basic changes to optimize code and improve the observability, reliability and operability of a defined range of platform, system, or product components or features with direction from other engineers.

Supports ongoing engagements with product engineering teams by participating in code/design reviews, regular meetings, on-call rotations, and incident responses throughout product development and operations cycles; draws insights from engagements with product engineering teams and basic analyses of telemetry data to propose potential improvements to code and designs for a defined set of product components or features with guidance from other engineers.

 

Driving Operational Excellence

Implements simple configuration and data changes across a predefined range of product components or features with guidance from other engineers to develop an understanding of how configurations, binaries, and data can be managed using code, tooling, and automation.

Develops an understanding of how to safely and reliably manage changes in production by using existing tools and automation to enable product engineering teams to implement changes across a defined range of components or features, with direction from other engineers.

Uses existing tools to troubleshoot problems or flaws affecting the availability, reliability, performance, and/or efficiency of components or features with guidance from other engineers. Suggests potential solutions to resolve and prevent recurring issues and brings them to the attention of other engineers or team leads.

Responds to incidents during regular on-call rotations by identifying the level of impact, troubleshooting basic issues, and deploying appropriate fixes to resolve root cause(s); alerts product teams or owners to major customer impacting issues and escalates the resolution of complex issues and/or those affecting multiple components or features to other engineers as needed. Shares details related to incidents and their resolution through post-mortem reports and during regular review meetings.

Develops an understanding of key learnings, insights, and best practices that can be applied to improve system, platform, and/or product development and operations by participating in code/design reviews, incident drills and debriefs, and regular meetings, as well interactions with more experienced Site Reliability Engineers (SREs) and members of product engineering teams.

Qualifications

Required Qualifications

3+ years technical experience in software engineering and systems administration 

Preferred Qualifications

Bachelor's Degree in Computer Science, Information Technology, or related field. 

#DTPJobs

#azdat

#msftintelplat

 

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

 

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

 

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Job ID: 30722

Similar Jobs

Cargill

Full Time Job

Site reliability engineer 2 Site reliability engineer 2

A Typical Work Day May Include: • Completing preventative, predictive, ...

Full Time Job

Deloitte

Full Time Job

Site reliability engineer 2 Site reliability engineer 2

Are you looking to elevate your cyber career? Your technical skills? Your opport...

Full Time Job

Cargill

Full Time Job

Site reliability engineer 2 Site reliability engineer 2

Cargill Animal Nutrition is a global business that serves large-scale feed mill ...

Full Time Job

Veolia

Full Time Job

Site reliability engineer 2 Site reliability engineer 2

Primary Duties / Responsibilities:● Assist in daily operational troublesho...

Full Time Job

Cookies

This website uses cookies to ensure you get the best experience on our website.

Accept