What are the main steps to conduct a PFMEA?

A PFMEA typically follows 10 steps: (1) define scope, (2) assemble the cross-functional team, (3) map the process, (4) identify potential failure modes, (5) determine failure effects, (6) identify root causes, (7) list existing controls, (8) calculate the Risk Priority Number (RPN) using Severity, Occurrence, and Detection ratings, (9) develop and assign corrective actions, and (10) reassess and update the RPN after actions are implemented.

When should a PFMEA be used?

A PFMEA should be used before launching a new production process, when an existing process is modified, when quality escapes or recurring defects occur, or when new equipment is introduced. It is also used during annual process reviews to ensure controls remain effective as conditions change.

PFMEA

Name: Condition Monitoring System
Brand: Tractian
Rating: 4.7 (200 reviews)

Definition: Process Failure Mode and Effects Analysis (PFMEA) is a structured method used by cross-functional teams to identify potential failure modes within a manufacturing or service process, evaluate their effects on product quality and safety, and prioritize corrective actions before failures occur in production.

What Is PFMEA?

PFMEA is a risk analysis tool applied specifically to manufacturing and service processes. While related methods focus on what could go wrong in a product's design, PFMEA asks a different question: at each step of the production process, how could the process itself fail to deliver the intended output?

The analysis is performed by a cross-functional team that walks through every process step, documents potential failure modes, links each mode to its root cause, and assesses how likely the failure is to occur, how severe its effect would be, and how detectable it is with current controls. The result is a ranked list of risks that tells teams exactly where to focus improvement efforts.

PFMEA originated in the aerospace and automotive industries, where it remains a mandatory element of quality management systems such as IATF 16949. Today it is used across oil and gas, pharmaceuticals, food processing, and any sector where process reliability directly affects product quality, safety, or regulatory compliance.

PFMEA vs. DFMEA

Both PFMEA and DFMEA belong to the broader FMEA family, but they target different phases of the product lifecycle and different sources of risk.

Dimension	DFMEA	PFMEA
Focus	Product design	Manufacturing or service process
When used	Before design is finalized	Before production launch or process change
Failure source	Design intent, material choices, tolerances	Process steps, equipment, operator variation
Key question	Could the design fail to meet requirements?	Could the process fail to produce the design?
Output	Design changes, tolerance reviews	Control plans, error-proofing, maintenance tasks
Team lead	Design engineering	Manufacturing / process engineering

A related extension of FMEA is FMECA (Failure Mode, Effects, and Criticality Analysis), which adds a formal criticality dimension. PFMEA achieves a similar prioritization through the RPN score rather than a separate criticality matrix.

The 10 Steps of PFMEA

PFMEA follows a disciplined sequence. Skipping or compressing steps is the most common reason PFMEA outputs fail to generate real process improvements.

Define the scope. Identify the process, the process boundaries (start and end points), and the customer requirements the process must satisfy. A scope that is too broad makes the analysis unmanageable; one that is too narrow misses interactions between steps.
Assemble the cross-functional team. Include process engineers, quality engineers, maintenance technicians, operators, and any other function with direct knowledge of how the process works and fails. PFMEA quality depends on the diversity of knowledge in the room.
Create or review the process flow diagram. Map every step in sequence. This becomes the backbone of the PFMEA worksheet and ensures no step is overlooked.
Identify potential failure modes. For each process step, ask: in what ways could this step fail to deliver its intended output? A single step can have multiple failure modes. Common examples include incorrect torque, contaminated material, wrong sequence, or tool wear beyond specification.
Determine the effects of each failure mode. Describe what the customer (internal or external) would experience if the failure occurred. Effects can range from minor cosmetic defects to safety incidents or regulatory violations.
Identify root causes. For each failure mode, determine the underlying cause: machine variability, operator error, tooling condition, incoming material quality, environmental factors, or a combination. This step links PFMEA directly to root cause analysis methods such as 5-Why and fishbone diagrams.
List existing process controls. Document current prevention controls (actions that reduce the chance of a failure mode occurring) and detection controls (actions that identify a failure before it reaches the customer). Examples include SPC charts, visual inspections, error-proofing devices, and automated in-process checks.
Calculate the Risk Priority Number (RPN). Score each failure mode on Severity (S), Occurrence (O), and Detection (D) using a 1-10 scale, then multiply: RPN = S x O x D. The highest RPNs identify where action is most urgently needed.
Develop and assign corrective actions. For high-RPN items, define specific actions that will reduce Severity, Occurrence, or Detection ratings. Assign an owner and a target completion date for each action.
Reassess and update the RPN. After actions are implemented, re-score each affected failure mode to confirm the RPN has decreased to an acceptable level. Document the new scores and keep the PFMEA as a living document.

Key PFMEA Components: S, O, D, and RPN

The scoring system is the analytical core of PFMEA. Understanding what each rating measures prevents the most common scoring errors.

Severity (S)

Severity rates the impact of the failure effect on the customer or on downstream processes. A score of 1 means no noticeable effect; a score of 10 means a safety or regulatory impact with no warning. Severity is a property of the effect, not the cause: it does not change based on how likely the failure is or how easy it is to detect. Only a design change that eliminates the hazardous effect can lower the Severity rating.

Occurrence (O)

Occurrence rates how frequently the failure cause is expected to produce the failure mode. A score of 1 means the cause is extremely unlikely (fewer than 1 in 1,000,000 opportunities); a score of 10 means failure is almost certain. Occurrence ratings should be based on historical data, field returns, or process capability studies where possible. Reducing Occurrence requires changes to the process, tooling, or incoming materials that address the root cause directly.

Detection (D)

Detection rates the ability of current controls to identify the failure mode or its cause before the product reaches the customer. A score of 1 means the control almost certainly detects the failure; a score of 10 means the failure cannot be detected. Detection ratings improve when error-proofing devices, automated sensors, or in-process inspection steps are added. A high Detection score is a strong signal that the control plan needs upgrading.

Risk Priority Number (RPN)

RPN = Severity x Occurrence x Detection. The result ranges from 1 to 1,000. Teams use RPN to rank failure modes and focus resources on the highest-risk items first. However, RPN should not be used in isolation: a failure mode with a Severity of 10 should receive attention even if the overall RPN is moderate, because the potential consequence of that failure is severe. The AIAG-VDA FMEA handbook (2019) introduced Action Priority (AP) ratings as a complement to RPN, placing greater weight on Severity.

PFMEA and Maintenance Planning

PFMEA outputs directly inform maintenance strategy. Failure modes with high Occurrence ratings caused by equipment degradation (tool wear, bearing fatigue, seal deterioration) become the foundation for preventive maintenance task lists and inspection intervals.

Where Occurrence is driven by unpredictable degradation, teams often escalate from time-based prevention to condition-based maintenance, using sensor data to detect drift before a failure mode is triggered. This connection between PFMEA and maintenance engineering is why reliability engineers treat the PFMEA as an input to the reliability-centered maintenance process, not a separate quality document.

High Detection scores (meaning current controls rarely catch the failure) are also a trigger for adding predictive maintenance capabilities: continuous monitoring that closes the detection gap without relying on periodic manual checks.

PFMEA and Criticality Analysis

PFMEA produces an implicit criticality analysis through its RPN and Severity scores. Assets and process steps with the highest combined scores represent the criticality tier that should receive first priority for investment in controls, spare parts, and maintenance resources.

For teams that need a formal criticality ranking separate from the PFMEA worksheet, the RPN data serves as a ready-made input to an asset criticality matrix, reducing duplicate analytical effort.

When to Use PFMEA

PFMEA delivers the most value in these situations:

New process launch. Before a new production line or process is released to full volume, PFMEA identifies risks while changes are still low-cost to implement.
Process modification. Any change to equipment, tooling, raw materials, or process parameters should trigger a review of the relevant PFMEA sections to assess new risks introduced by the change.
Recurring quality escapes. When a failure mode or defect type recurs despite existing controls, PFMEA re-evaluation identifies whether the root cause was missed or whether controls have degraded.
New equipment introduction. New machinery brings new failure mechanisms. A PFMEA review at commissioning captures these before they affect production quality.
Annual process review. Processes change gradually through operator turnover, tooling wear, and supplier shifts. An annual PFMEA review catches control gaps that have opened over time.
Regulatory or customer audit preparation. IATF 16949, ISO 9001 advanced quality planning, and APQP processes require current PFMEA documentation as evidence of proactive risk management.

Common PFMEA Mistakes to Avoid

Teams that conduct PFMEA for the first time often fall into patterns that reduce its effectiveness.

Treating it as a documentation exercise. A PFMEA that is written to satisfy an audit rather than to drive action produces no improvement. Actions must be assigned, tracked, and verified.
Scoring without data. Occurrence ratings based on gut feel rather than historical defect data produce inflated or deflated RPNs that misdirect resources.
Ignoring high-Severity items with low RPN. A Severity-10 failure mode with moderate Occurrence and Detection can yield an acceptable-looking RPN while carrying catastrophic risk. Severity-10 items should always receive a review regardless of RPN.
Never updating the document. A PFMEA that was accurate at launch becomes misleading as the process evolves. Living documentation is a requirement, not an ideal.
Excluding operators. Operators know how a process actually runs versus how it was designed to run. Their input on real failure modes and current detection gaps is irreplaceable.

PFMEA in the Context of Broader Failure Analysis

PFMEA sits within a family of analytical tools that organizations use to understand and prevent failures. A full failure analysis program typically combines PFMEA (proactive, process-level) with reactive investigation tools such as root cause analysis (applied after a failure has occurred) and maintenance engineering reviews (focused on asset degradation patterns over time).

Used together, these tools form a closed loop: PFMEA anticipates failures, root cause analysis explains failures that do occur, and maintenance engineering translates both into sustainable asset management strategies.

The Bottom Line

PFMEA is one of the most practical risk management tools available to manufacturing and reliability teams. It forces a structured conversation about where a process can fail, how bad the consequences would be, and whether current controls are good enough. When conducted rigorously by a team that includes operators, engineers, and maintenance personnel, PFMEA produces a prioritized action list that prevents defects, reduces unplanned downtime, and builds a defensible control plan.

The real value of PFMEA is not the document itself but the disciplined thinking it requires. Teams that update their PFMEA as processes evolve, use real failure data to set Occurrence ratings, and act on high-Severity findings regardless of RPN consistently outperform teams that treat PFMEA as a one-time compliance task.

Reduce Process Risk Before It Becomes Downtime

Tractian connects PFMEA-identified failure modes to real-time asset monitoring, so your highest-risk process steps are the first ones covered by continuous sensor data and automated alerts.

See How Tractian Works

Frequently Asked Questions

What does PFMEA stand for?

PFMEA stands for Process Failure Mode and Effects Analysis. It is a structured, team-based method for identifying potential failure modes within a manufacturing or service process, evaluating their effects, and prioritizing corrective actions before those failures reach the customer or disrupt production.

How is PFMEA different from DFMEA?

DFMEA (Design Failure Mode and Effects Analysis) examines potential failures in a product's design before manufacturing begins. PFMEA examines potential failures in the production process itself: how a process step could fail, what effect that failure has on the output, and whether existing controls can detect it. The two analyses are complementary and are typically both required in automotive and aerospace quality programs.

What is a good RPN score in PFMEA?

There is no universal threshold, and relying on a single RPN cutoff can be misleading. A common guideline is to prioritize action on any RPN above 100-125, but any failure mode with a Severity rating of 9 or 10 warrants immediate review regardless of the total RPN. The AIAG-VDA FMEA handbook recommends using Action Priority (High, Medium, Low) alongside RPN to ensure that catastrophic-severity items are never deprioritized by moderate Occurrence or Detection scores.

When should a PFMEA be updated?

A PFMEA should be updated whenever a process change is introduced, a new failure mode is discovered in production, corrective actions have been implemented and verified, new equipment or tooling is added, or the annual process review cycle is complete. PFMEA is a living document: a version that reflects last year's process is a liability, not an asset, in both quality management and regulatory contexts.