How to Evaluate and Champion Condition Monitoring as a Maintenance Manager
The maintenance manager's role in a technology selection is not just technical evaluation. You are doing two jobs simultaneously: identifying the right solution for your specific plant, and building the internal case for your recommendation so it gets approved without being sent back for another review cycle.
Most maintenance managers are confident on the first job. The second is where recommendations stall. A technically sound selection that is not packaged for a Plant Manager's decision criteria gets deferred. A technically sound selection presented as a risk reduction investment with a measurable return gets approved.
This guide covers both: the technical must-haves for condition monitoring in discrete manufacturing, and the internal case structure that moves from your recommendation to a signed approval.
- What Most Maintenance Managers Get Wrong About Technology Selection
- The Four Technical Must-Haves
- What to Test During Evaluation
- The Shortlist Decision Framework
- What Questions Your Plant Manager Will Ask
- How to Structure Your Recommendation
- The Pilot Proposal That Reduces the Risk Objection
- How Tractian Is Built for Discrete Manufacturing
What Most Maintenance Managers Get Wrong About Technology Selection
Evaluating features instead of outcomes. Platform comparison by feature count misses the question that matters: will this tool catch the failure modes that are costing us production value, on the specific assets in our plant, fast enough to schedule a repair in the next changeover window? That is an outcomes question, not a feature question.
Leading the internal case with technical specifications. Your Plant Manager does not need to understand vibration spectrum analysis to approve the investment. They need to understand what the investment protects and what it costs. A recommendation framed as "this platform uses continuous spectral analysis with 4G connectivity and IP69K-rated sensors" goes to the engineering team for review. A recommendation framed as "this protects $[X] in annual production risk on our five highest-cost assets at a cost of $[Y] per year" gets approved or declined in the meeting.
Not anticipating the objections. There are four questions your Plant Manager will ask. If you arrive with specific, well-sourced answers to all four, you get a decision. If you answer them reactively and partially, you get a request for more information and another meeting.
Selecting for the average case. A platform designed for a process manufacturing plant (continuous production, lower-frequency monitoring, analyst-reviewed data) is a different product from one designed for a discrete manufacturing floor (high-cycle assets, fast degradation rates, need for non-analyst-ready alerts). Verify that the platform has references from your sub-sector: auto parts, appliances, electronics, consumer goods. A reference from a chemical plant is not relevant evidence.
The Four Technical Must-Haves
These four requirements are the filter. A platform that does not meet all four is not suitable for a discrete manufacturing environment. They are non-negotiable because each addresses a failure mode that affects whether the tool actually works in your context.
1. Continuous Vibration Spectrum, Not Periodic Spot-Checks
High-cycle assets in discrete manufacturing, stamping press motors, assembly conveyor drives, CNC spindle motors, progress from early-stage bearing degradation to failure-critical in two to four weeks. A monitoring tool that takes weekly or monthly spot-check measurements creates a detection window long enough for a condition-based failure to develop and reach the production floor undetected.
Continuous monitoring detects the degradation pattern as it develops. The alert arrives while there is still time to stage the repair in the next changeover window rather than responding to a production emergency. This is the core mechanism by which predictive maintenance prevents unplanned events. Anything less than continuous is periodic inspection with better data, not predictive maintenance.
2. 4G/LTE Connectivity, Not Wi-Fi Dependency
Manufacturing floor environments introduce connectivity challenges that eliminate Wi-Fi-dependent solutions in practice. Metal enclosures, RF interference from variable-frequency drives and welding equipment, and the physical layout of large press rooms all degrade Wi-Fi reliability. A monitoring platform that depends on a stable Wi-Fi connection will have data gaps precisely in the environments where the critical assets are located.
4G/LTE cellular connectivity is independent of your plant network infrastructure. It works in the press room, in the paint shop, in enclosed gearbox enclosures. It does not require IT coordination to provision, does not go down when the plant Wi-Fi is reconfigured, and does not create a maintenance burden of its own.
3. IP69K Environmental Protection
Manufacturing floors are not clean environments. Lubricant mist, coolant spray, metal particulate, high-pressure wash-down cycles in food and beverage adjacent lines, and temperature cycling from process equipment all affect sensor reliability. A sensor rated for office or light industrial environments will fail or deliver unreliable data in a manufacturing floor context.
IP69K is the standard for high-pressure, high-temperature wash-down environments. A sensor rated to IP69K will operate reliably in any discrete manufacturing environment. This is not an edge-case requirement; it is baseline durability for the environment where the sensor needs to work.
4. Interpreted Alerts That Require No Reliability Analyst
The most common failure mode for condition monitoring implementation is not the technology. It is the workflow: sensors generate data, data requires interpretation, no one on the maintenance team has a vibration analysis background, raw data sits unreviewed, and a failure event occurs that the data would have predicted. The program is technically running and operationally invisible.
Interpreted alerts change this entirely. An alert that says "Bearing fault developing on Assembly Conveyor 3 east drive motor, severity: moderate, recommend service within 14 days" is actionable by a maintenance planner. An alert that delivers a raw FFT spectrum and a fault frequency table is not.
Verify during evaluation that the platform's alerts specify the asset, the component (not just the asset), the failure mode, the severity, and the recommended action window. Ask for 10 representative alerts from a comparable manufacturing site. If those alerts require explanation to understand, they will not be acted on by a maintenance team without vibration analysts.
What to Test During Evaluation
Before finalizing your shortlist, run a structured evaluation on each candidate platform. Focus on three areas:
Alert specificity and actionability. Request a sample of 10 recent alerts from comparable manufacturing sites. For each: could a maintenance planner without vibration analysis training read it and create an accurate work order? If yes, the alert quality is sufficient. If no, the platform requires analyst support to deliver value.
Time-to-first-alert. Ask how long from sensor installation to first valid alert on a healthy asset. The platform needs to establish a vibration baseline before it can identify departures. A platform that takes three months to baseline is not providing value during that window. Ask for the baseline timeline and what visibility you have during the baselining period.
Reference from comparable plants. Request two to three customer references from discrete manufacturing plants in your sub-sector: auto parts, appliances, electronics, consumer goods. Ask each reference: which specific failure modes has the platform detected, how much lead time did you have before the failure would have occurred, and what would that event have cost. A vendor who cannot provide specific failure mode references from comparable plants is not giving you evidence; they are giving you potential.
The Shortlist Decision Framework
Use this table to evaluate candidates. Fill in your ratings based on the evaluation process above. Any platform scoring below 3 on a must-have criterion is eliminated.
| Criterion | Weight | Platform A | Platform B | Platform C |
|---|---|---|---|---|
| Continuous vibration spectrum | Must-have | Pass/Fail | Pass/Fail | Pass/Fail |
| 4G/LTE connectivity | Must-have | Pass/Fail | Pass/Fail | Pass/Fail |
| IP69K environmental rating | Must-have | Pass/Fail | Pass/Fail | Pass/Fail |
| Interpreted alerts (no analyst required) | Must-have | Pass/Fail | Pass/Fail | Pass/Fail |
| Alert specificity (sample of 10) | High | 1 to 5 | 1 to 5 | 1 to 5 |
| References from comparable plants | High | 1 to 5 | 1 to 5 | 1 to 5 |
| Baseline timeline | Medium | 1 to 5 | 1 to 5 | 1 to 5 |
| CMMS integration | Medium | 1 to 5 | 1 to 5 | 1 to 5 |
| Implementation support | Medium | 1 to 5 | 1 to 5 | 1 to 5 |
Bring this completed table to your Plant Manager along with your recommendation. Showing your evaluation methodology makes the recommendation credible and makes it harder to send back for more work.
What Questions Your Plant Manager Will Ask
Prepare specific answers to these four questions before the meeting. Arriving with them already answered is the difference between a decision and a deferral.
"What does it cost and what does it return?"
Your answer: "The program costs $[X] annually to cover our [N] highest-risk Tier 1 assets. Based on our last 12 months of unplanned downtime data, those assets generated $[Y] in combined production loss, emergency repair premium, and OEM penalty exposure. At 20% failure prevention in year one, that is $[0.2 x Y] in avoided cost against $[X] in program cost. Payback is [N] months."
This answer requires you to have the annual downtime cost calculation built before the meeting. Build it from your work order history.
"Will the team actually use it?"
Your answer: "The platform delivers interpreted alerts that specify the asset, the failure mode, and the recommended action. A maintenance planner creates the work order directly from the alert. No vibration analysis background required. I have reviewed sample alerts from comparable plants and the format is actionable without specialist knowledge."
"What happens if it misses a failure or gives a false alarm?"
Your answer: "The platform does not replace our PM program; it adds continuous monitoring on the gaps between PM service dates where condition-based failures develop. A missed failure means we are no worse than we are today. False alarms are a training cost: the team investigates, finds no fault, and the model refines. Ask the vendor for their false alarm rate by failure mode from comparable sites."
"Can we start small and expand?"
Your answer: Yes, and I am proposing exactly that. See below.
How to Structure Your Recommendation
A recommendation that gets approved has three elements: the financial baseline, the shortlist with rationale, and the phased proposal. Anything more risks being read as a technical argument that needs to go to engineering. Anything less risks being read as an unsupported preference.
The financial baseline (one paragraph): "Over the last 12 months, [N] unplanned events on Tier 1 assets generated $[X] in combined production loss, emergency repair premium, and [OEM penalty exposure / displaced planned work / secondary damage]. The five assets contributing most to that total are [list]. A condition monitoring program targeting those five assets specifically addresses the dominant failure modes: [list from work order history]."
The shortlist with rationale (one page or less): Present the completed evaluation framework. State your recommendation and the two primary reasons: which must-have criteria the recommended platform exceeds versus the alternatives, and the reference evidence from comparable plants.
The phased proposal: Cover the two or three highest-risk assets first. Bounded investment, bounded risk, measurable outcome within 90 days.
The Pilot Proposal That Reduces the Risk Objection
The most common objection to a condition monitoring recommendation from a Plant Manager is not the annual cost. It is the risk of championing something that does not work and having that failure reflected personally.
A pilot proposal addresses this directly. It reduces the initial investment, bounds the risk, and creates a clear success milestone that justifies expansion.
Pilot structure: Cover the three highest-cost Tier 1 assets based on your downtime cost analysis. Run for 90 days. Success criteria: at least one developing fault detected and resolved in a planned window rather than a production event, or three months of baseline data showing no degradation on the covered assets (which is itself valuable information).
Why this framing matters for your career: A pilot you proposed, defined the success criteria for, and managed to a documented outcome is a win regardless of whether the program expands. You identified the risk, proposed the structured test, managed the execution, and documented the result. That is program leadership in the Plant Manager's language.
If the pilot detects and prevents an event, you have documented avoided cost. If the 90 days is clean with no events, you have baseline data and evidence of a healthy asset pool. Either outcome is a defensible result.
MTBF improvement and the run-to-failure snowball: Evaluate whether the platform detects faults early enough in the failure progression to prevent secondary damage. A bearing fault detected at stage 2 severity, weeks before it reaches catastrophic failure, is a planned window repair. The same fault detected at late stage, or at failure, is a run-to-failure snowball: the $50 bearing destroys the $5,000 shaft, which burns out the $50,000 motor. The platform's detection sensitivity at early severity stages is the direct lever on MTBF improvement and on preventing the unbudgeted CapEx events that cascade from late-stage detection.
Auto Diagnosis™, skills gap neutralized: Evaluate whether the platform delivers specific failure mode identification without requiring a trained vibration analyst to interpret the data. Tractian's Auto Diagnosis™ specifies the fault type, component, severity, and recommended action in plain language. The generalist mechanic on the team receives the same diagnostic quality as a 30-year vibration analyst. The skills gap does not affect the reliability program's effectiveness.
The cultural shift from reactive to proactive: Evaluate whether the platform's detection lead time is long enough to consistently schedule repairs during planned maintenance windows rather than emergency callouts. If faults are detected weeks before failure, the team responds to alerts during working hours rather than to breakdowns at 2am on a Saturday. The unplanned overtime budget drops. Team morale improves. Safety risk decreases. The shift from firefighting culture to proactive culture is only possible when the platform provides enough lead time to plan.
ROI documentation for leadership: Evaluate whether the platform produces the record-keeping that supports the budget justification conversation. Every prevented failure should generate a documented record: asset, alert date, fault type, severity at detection, corrective action, estimated consequence avoided. That record is what allows the Maintenance Manager to walk into a quarterly review and say "our team prevented $X in production losses and $Y in emergency repair costs this quarter", and prove it with specific data rather than estimates.
How Tractian Is Built for Discrete Manufacturing
Tractian meets all four technical must-haves: continuous vibration spectrum monitoring, 4G/LTE cellular connectivity independent of plant Wi-Fi, IP69K-rated hardware designed for manufacturing environments, and interpreted alerts specifying the asset, component, failure mode, and recommended action window. No reliability analyst required to act on an alert.
The alert format is designed for maintenance planners, not vibration analysts. When the assembly conveyor east drive motor develops an outer race bearing fault, the alert says: outer race bearing fault, Assembly Conveyor 3 east drive motor, severity moderate, recommend service within 14 days. That creates a work order. It does not require interpretation.
For the internal case: Tractian provides documented outcomes from discrete manufacturing references with named customers, specific assets, confirmed failure modes, and verified cost avoidance figures. That documentation answers the "what does it return?" question with evidence, not projections.
See how Tractian supports maintenance managers in manufacturing
Tractian continuously monitors equipment health in real time, detecting faults early and preventing unplanned downtime.
Explore the PlatformWhat technical requirements matter most for condition monitoring in discrete manufacturing?
Continuous vibration spectrum (not periodic spot-checks), 4G/LTE connectivity (not Wi-Fi dependent), IP69K environmental protection, and interpreted alerts specifying failure mode and recommended action without requiring a reliability analyst. These four requirements separate platforms that work on a manufacturing floor from those that do not.
Why does continuous monitoring matter more than periodic spot-checks for stamping presses and conveyor drives?
High-cycle assets in discrete manufacturing can progress from early bearing degradation to failure-critical in two to four weeks. Periodic spot-checks create detection windows long enough for that progression to reach the production floor undetected. Continuous monitoring closes that window.
What questions will my Plant Manager ask about a condition monitoring recommendation?
Cost and return, team adoption, missed detection or false alarm risk, and phased investment options. Arrive with specific answers to all four. The maintenance manager who answers these proactively gets a decision. The one who answers them reactively gets a request for more information.
How do I make my recommendation credible?
Build it from your plant's actual downtime data. Show your evaluation methodology with the completed comparison framework. Propose a pilot on your highest-risk assets with defined success criteria. All three together make the recommendation hard to send back for more work.
What is the right pilot structure for a first deployment?
Cover the two or three highest-cost Tier 1 assets by downtime history. 90-day timeline. Success criteria defined in advance: one developing fault detected and resolved in a planned window, or 90 days of clean baseline data on covered assets. Both outcomes are defensible and documentable.
How do I handle the "we already have PM" objection?
Your PM program services assets at fixed intervals regardless of actual condition. Condition-based failures develop between service dates and are invisible to interval-based PM. Show one failure from the last 12 months that occurred on an asset recently serviced. They exist in every plant.