Big Cyber Monday Sale Limited Time Flat 70% Discount offer - Ends in 0d 00h 00m 00s - Coupon code: 70spcl

PeopleCert DevOps-SRE Site Reliability Engineering (SRE) Foundation v1.2 Exam Practice Test

Page: 1 / 8
Total 80 questions

Site Reliability Engineering (SRE) Foundation v1.2 Questions and Answers

Question 1

Engineering operational work to scale with a growing application is BEST achieved by addressing which of the following issues?

Options:

A.

Staffing levels

B.

Interruptions

C.

Toil

D.

On-call rotations

Question 2

Why is observability potentially better than traditional monitoring?

Options:

A.

Observability is less expensive than traditional monitoring

B.

Traditional monitoring does not adapt well to the cloud since it focuses on discrete components and applications

C.

Traditional monitoring can struggle to scale when service growth is rapid

D.

Traditional monitoring cannot support containers

Question 3

Following a major outage, an analysis of the outage is conducted. This BEST describes an example of which of the following?

Options:

A.

A follow-up culture

B.

A major incident culture

C.

A postmortem culture

D.

A problem culture

Question 4

What is the primary difference between SRE and DevOps?

Options:

A.

SRE is an implementation of DevOps but focuses mostly on post-production responsibilities

B.

DevOps is mostly for software engineers and SRE is mostly for infrastructure engineers

C.

DevOps encourages closer collaboration between development and operations whereas SRE is about building a silo around production operations

D.

DevOps and SRE are the same thing

Question 5

“Problem-solving with a group of people with different skillsets.”

Which of the following concepts is BEST inferred by the above statement?

Options:

A.

Coordination

B.

Collaboration

C.

Communication

D.

Cooperation

Question 6

What metrics will embracing failure help to improve?

Options:

A.

Mean time to detect and mean time between system incidents

B.

Change lead time and change failure rate

C.

Empirical test data and mean time to recover service

D.

Mean time to detect and mean time to recover

Question 7

Which of the following BEST describes capacity planning?

Options:

A.

Monitoring the percentage of capacity of resources being used over a time period

B.

Activities performed to manage provider resources and provide multiple services

C.

Activities used to create a plan that manages resources to meet service demand

D.

Determining the maximum amount that any resource can accommodate or deliver

Question 8

A bank has been using traditional monitoring tools for ensuring that their systems are available and operating as planned. Their strategic initiatives now include a renewed focus on customer experience as well as identifying ways to scale service.

Why would migrating to an observability approach be important now?

Options:

A.

It’s better for managing container workloads and dynamic architectures

B.

Monitoring at the component level may no longer provide the right data

C.

It is impossible to anticipate all potential problems

D.

All of the above

Question 9

Where should an organization store versioned and signed artifacts that are used to deploy system components?

Options:

A.

In the Configuration Management System (CMS)

B.

In a Subversion source code repository

C.

In a Definitive Media Library (DML)

D.

In a secure artifact repository

Question 10

Reliability is a key pillar of digital experience monitoring and incident management.

Which of the following describes the BEST type of reliability monitoring strategy in SRE?

Options:

A.

A strategy that uses traditional and familiar monitoring tools rather than advanced artificial intelligence

B.

A strategy that instruments observability and provides monitoring insights across all components and layers

C.

A strategy that focuses on monitoring and discovering useful patterns in the performance of all active networks

D.

A strategy that harnesses advanced technologies to measure, analyze, and maintain the fitness of applications

Question 11

Which of the following describes work that would be considered "toil"?

Options:

A.

Work that is devoid of enduring value

B.

Work that has some enduring value but requires manual tasks

C.

Engineering work to add service features

D.

Engineering work that does not add enduring value

Question 12

Which of the following is BEST described as the role responsible to maintain the live incident state document?

Options:

A.

The logistics specialist

B.

The communications lead

C.

The planning specialist

D.

The incident commander

Question 13

What types of outages must fit into an Error Budget?

Options:

A.

Unplanned incidents

B.

Defect fixes

C.

Any planned or unplanned outage

D.

Any change approved by the CAB or decision authority

Question 14

Known workarounds represent what type of toil?

Options:

A.

Linear scaling

B.

Tactical

C.

Automatable

D.

No enduring value

Question 15

Which of the following BEST describes the most important rationale for NOT seeking an SLO of 100% availability?

Options:

A.

It is not realistic for the complexity and scale of services.

B.

The likely result is failure where such targets are defined.

C.

There is no room for improvements if targets are so high.

D.

The user satisfaction score is affected by a low percent.

Question 16

Which of the following is a principle of SRE-Led Service Automation?

Options:

A.

No automated tests in production

B.

Environments provisioned using IaC

C.

Using unsigned artifacts in production

D.

Adding as much hardware as possible

Question 17

Which of the following is the MOST accurate description of Kubernetes?

Options:

A.

A proprietary system developed to automate the integration, building, testing, and deployment of application containers

B.

An independent platform that enables organizations to implement continuous integration and delivery practices

C.

A platform used to manage containers in a cloud environment and also includes automated scaling and failover

D.

An open-source operating system on which containerized applications can be run, monitored, and managed efficiently

Question 18

Which of the following features of Puppet Labs is described as the ability to locate, identify, and group cloud nodes?

Options:

A.

Provisioning

B.

Delivery

C.

Discovery

D.

Insight

Question 19

What is the goal of SRE?

Options:

A.

To spend 50% of a SRE's time on operational tasks and 50% of the time on development tasks to reduce toil

B.

To ensure that Service Level Objectives are consistently met through monitoring and observability

C.

To create highly reliable post-deployment operational systems that align with DevOps and Agile

D.

To create ultra-scalable and highly reliable distributed software systems

Question 20

Identify the defense-in-depth (DiD) layer where data flows in from, and out to, other networks, including the Internet.

Options:

A.

Host layer

B.

Physical layer

C.

Perimeter layer

D.

Data layer

Question 21

Identify the missing word(s) in the following sentence:

Site reliability engineering is a _________ approach to IT operations.

Options:

A.

structural engineering

B.

security engineering

C.

software engineering

D.

simulation engineering

Question 22

In a safety culture, engineers are allowed to do more with the production environment without fear of repercussions.

What else do engineers need to do?

Options:

A.

Share production incidents on social media

B.

Be accountable for their actions

C.

Skip all blameless post-mortems

D.

Avoid being on-call

Question 23

Service Level Objectives (SLOs) are tightly related to

Options:

A.

User experience

B.

Management approval

C.

Change success rate

D.

Toil reduction

Question 24

Which of these approaches can alleviate linear scaling toil?

Options:

A.

Manual scaling of services

B.

Using auto-scaling capabilities

C.

Outsourcing development

D.

Switching cloud providers

Page: 1 / 8
Total 80 questions