Sample Root Cause Analysis Report
Key Takeaways
- A well-structured RCA report captures what failed, why it failed, and what will prevent it from failing again.
- Every report should include a problem statement, timeline, root cause, contributing factors, corrective actions, and a verification plan.
- Common RCA methods include the 5 Whys, fishbone diagram, fault tree analysis, and failure mode and effects analysis (FMEA).
- The report is only complete when corrective actions have been assigned, dated, and reviewed against outcomes.
- Using a consistent template across your facility improves investigation quality and enables trend analysis over time.
What Is a Root Cause Analysis Report?
A root cause analysis report is the formal output of an RCA investigation. It documents the entire investigation process in a structured format so that findings can be reviewed, shared, acted upon, and referenced in the future. Unlike a simple incident log, an RCA report explains not just what happened but why it happened at a fundamental level.
The report serves two audiences. Maintenance and reliability teams use it to drive corrective action. Managers and plant leadership use it to understand systemic risks, justify capital investment, and demonstrate due diligence for safety and compliance purposes.
Without a formal report, investigation findings rarely translate into lasting change. The report is the mechanism that converts analysis into accountability.
Key Sections of an RCA Report
A complete RCA report follows a logical structure that walks the reader from the event through to the solution. Each section builds on the last, creating a traceable record of how the team reached its conclusions.
Executive Summary
The executive summary is a brief overview of the incident, the root cause, and the corrective actions assigned. It is written for readers who need the key findings without working through the full document. Keep it to one paragraph or a short bullet list.
Problem Statement
The problem statement defines the failure precisely. It should answer: what failed, when it failed, where it failed, and what the measurable impact was. Avoid vague language. "Pump P-201 seized at 14:32 on March 15, causing a two-hour production shutdown and an estimated $18,000 in lost output" is a strong problem statement. "The pump stopped working" is not.
Timeline of Events
The timeline maps out the sequence of events leading up to, during, and immediately after the failure. It is sourced from maintenance logs, sensor data, operator observations, and work order history. A clear timeline often reveals decision points where the failure could have been caught earlier.
Root Cause
This section states the fundamental cause: the single condition or action that, if corrected, would prevent the failure from recurring. Root causes are typically physical (a broken component), human (a procedural error), or latent (a systemic gap in process or design). Avoid stopping at symptoms. If the answer to "why" still has a deeper answer, keep digging.
Contributing Factors
Contributing factors are conditions that increased the likelihood or severity of the failure without being the root cause. Common examples include delayed maintenance, inadequate training, missing inspection steps, or poor environmental conditions. Document all contributing factors even if corrective actions focus on the root cause first.
Corrective and Preventive Actions
This is the most critical section for preventing recurrence. Each action should be specific, assigned to a named owner, and given a due date. Actions fall into two categories: corrective actions address the current failure, and preventive actions reduce the risk of similar failures elsewhere in the facility.
Verification Plan
The verification plan defines how the team will confirm that corrective actions were effective. This may include a follow-up inspection date, a performance benchmark the equipment must meet, or a review of maintenance records after a defined period. Without verification, the RCA cycle is incomplete.
RCA Report Template: Sample Structure
The table below shows each section of a standard RCA report, its purpose, and an example of what the content looks like in practice. Use this as a starting point and adapt it to your facility's reporting standards.
| Section | Purpose | Example Content |
|---|---|---|
| Executive Summary | Brief overview for leadership and stakeholders | "Bearing failure on Compressor C-104 caused a 6-hour shutdown on March 10. Root cause: lubrication interval not updated after duty cycle change. Actions assigned to maintenance planner and reliability engineer." |
| Problem Statement | Precise definition of what failed and the impact | "Compressor C-104 shut down at 08:14 on March 10 due to bearing overtemperature alarm. Resulted in 6 hours of unplanned downtime and $24,000 in lost production." |
| Timeline of Events | Chronological sequence of events from normal operation to failure | Feb 28: Duty cycle increased. Mar 5: Elevated vibration noted in sensor log. Mar 9: Temperature alarm triggered (first occurrence). Mar 10 08:14: Shutdown on overtemperature. |
| Root Cause | The fundamental cause of the failure | "Lubrication interval was not updated when the duty cycle increased on Feb 28. The bearing operated for 10 days without adequate lubrication at the new load level, causing accelerated wear and overheating." |
| Contributing Factors | Conditions that increased risk or severity | No formal process for updating PM intervals after operational changes. Temperature alarm threshold set too high relative to bearing operating range. Mar 9 alarm not actioned within the shift. |
| Corrective Actions | Specific actions to fix the root cause and prevent recurrence | 1. Update lubrication PM interval for C-104 (Owner: J. Silva, Due: Mar 17). 2. Create change management checklist for duty cycle modifications (Owner: Reliability Eng., Due: Mar 31). 3. Review alarm thresholds for all rotating equipment (Owner: Maintenance Mgr., Due: Apr 15). |
| Verification Plan | How the team confirms the fix worked | "Review bearing temperature trend data at 30-day mark. Confirm lubrication PM completed on revised schedule. Sign-off by reliability engineer required before closing." |
Common RCA Methods and When to Use Them
The method you choose shapes how you structure the investigation and what questions appear in the report. Different methods suit different failure types and levels of complexity.
| Method | How It Works | Best For |
|---|---|---|
| 5 Whys | Ask "why" repeatedly until the root cause is exposed. Each answer becomes the next question. | Straightforward, single-path failures with a clear causal chain |
| Fishbone Diagram (Ishikawa) | Map potential causes across categories (machine, method, material, man, environment, measurement) branching toward the failure event. | Failures with multiple potential causes that need systematic exploration before a root cause is confirmed |
| Fault Tree Analysis | Build a top-down logic diagram from the undesired event through all possible causes, using AND/OR gates to show relationships. | Complex, multi-system failures where multiple conditions must combine to cause the event |
| Failure Mode and Effects Analysis (FMEA) | Systematically identify all possible failure modes, their effects, and their likelihood and severity to prioritize risk. | Proactive risk analysis during design or process review, or when recurring failures suggest a systemic gap |
Many investigations combine methods. A team might use a fishbone to explore all possible causes, then apply the 5 Whys to the most likely branch to reach the root cause. The method section of your RCA report should state which approach was used and why.
How to Write an Effective RCA Report
A technically sound investigation loses its value if the report is unclear or incomplete. These practices improve both the quality of the report and the likelihood that corrective actions get implemented.
Write for a reader who was not at the scene
Assume the reader has no prior knowledge of the equipment or the event. Define technical terms. Spell out acronyms. Provide enough context in the problem statement and timeline that someone from another department can follow the logic without asking follow-up questions.
Separate facts from conclusions
The timeline section should contain only verified facts: sensor readings, work order dates, operator logs, and physical observations. Conclusions and interpretations belong in the root cause and contributing factors sections. Mixing them creates confusion and invites challenges to the findings.
Use data to support the root cause
A root cause statement is stronger when backed by evidence. Reference the sensor trend that showed bearing temperature rising over three days, the work order history that confirms the last lubrication date, or the vibration data from failure analysis. Claims without supporting data are harder to act on and easier to dispute.
Make corrective actions specific and owned
Vague actions do not get completed. "Improve lubrication practices" is not an action. "Update PM-114 lubrication task frequency from 30 days to 14 days in the work order system by March 24, assigned to J. Pereira" is an action. Every item should have a verb, a measurable outcome, a named owner, and a due date.
Link the report to your maintenance documentation system
Store completed RCA reports in your maintenance documentation system and cross-reference them to the relevant work orders and asset records. This creates a searchable history of past failures that supports future investigations and helps identify recurring patterns across similar equipment.
Close the loop with verification
The report is not complete when actions are assigned. It is complete when actions are verified. Build a review date into the report and assign someone to confirm the outcomes. If corrective actions did not produce the expected result, the investigation should be reopened rather than closed.
RCA Reports and Recurring Equipment Failures
Equipment failure is rarely a one-time event. When maintenance teams track RCA reports over time, patterns emerge: the same failure mode appearing on similar assets, the same contributing factors listed in report after report, the same corrective actions that keep closing without verification. This pattern analysis is one of the most valuable outputs of a mature RCA program.
Facilities that review RCA history before scheduling corrective maintenance jobs often discover that the proposed fix has already been tried. Knowing this changes the scope of the repair and the investigation that follows it.
A connected maintenance platform that links work orders, sensor data, and RCA reports in one place makes this kind of analysis faster and more reliable than manual review of paper records or disconnected spreadsheets.
The Bottom Line
A root cause analysis report is the document that turns an investigation into lasting improvement. Without it, findings stay in someone's memory, corrective actions get deprioritized, and the same failures repeat. With it, every incident becomes a data point that builds a more reliable facility over time.
The most effective RCA reports are specific, evidence-based, and actionable. They name owners, set deadlines, and define how success will be measured. Teams that treat the report as a living document, rather than a filing requirement, see the most meaningful reduction in repeat failures.
If your maintenance data is fragmented across spreadsheets, paper records, and disconnected systems, producing consistent, high-quality RCA reports becomes harder than it needs to be. A platform that connects sensor data, work order history, and asset records gives your team the evidence it needs to write reports that hold up and drive real change.
See How Tractian Supports Faster, More Accurate Failure Investigation
Tractian connects real-time sensor data, asset history, and maintenance records in one platform, giving your team the evidence needed to identify root causes quickly and prevent repeat failures.
See How Tractian WorksFrequently Asked Questions
What should a root cause analysis report include?
A complete root cause analysis report should include an executive summary, a clear problem statement, a timeline of events leading up to the failure, identification of the root cause and contributing factors, corrective and preventive actions, and a verification plan to confirm the fix worked.
How long should a root cause analysis report be?
Length depends on the severity and complexity of the incident. A minor equipment failure may require only one to two pages. A critical failure affecting safety, production, or multiple systems may require a detailed report of five to fifteen pages with supporting data, timelines, and attachments.
What is the difference between a root cause and a contributing factor?
A root cause is the fundamental reason a failure occurred. Eliminating the root cause prevents recurrence. A contributing factor is a condition that increased the likelihood or severity of the failure but did not cause it on its own. Both should be documented in the report, but corrective actions must address the root cause first.
Who should receive the completed RCA report?
Distribution depends on the scope of the failure. Typically, the maintenance manager, operations manager, reliability engineer, and relevant technicians receive a copy. For safety incidents or systemic failures, plant leadership, HSE teams, and sometimes external regulators may also need to be notified.
Related terms
Routine Maintenance
Routine maintenance is a set of recurring scheduled tasks performed at fixed intervals to prevent equipment deterioration, detect early faults, and sustain reliable operation.
Return on Investment
Return on investment (ROI) measures the net gain from a maintenance program relative to its cost. Learn the formula, worked examples, and how to calculate ROI for predictive maintenance.
Rotable
A rotable is a repairable component removed from service, repaired, and returned to inventory for reuse. Learn rotable vs consumable differences, pool management, and industry applications.""SAE JA1011
Robot Maintenance
Robot maintenance covers scheduled and condition-based activities to keep industrial robots operating reliably. Learn key components, failure modes, maintenance schedules, and how condition monitoring applies.
RTU (Remote Terminal Unit)
An RTU (Remote Terminal Unit) is a ruggedized field device that acquires I/O signals, runs local logic, and communicates with a SCADA master over DNP3, IEC 60870, or Modbus.