Thermal Monitoring
Key Takeaways
- Thermal monitoring detects heat anomalies caused by friction, resistance, or restricted flow long before they escalate into equipment failure.
- The two primary technologies are infrared thermography (periodic, wide-area surveys) and fixed temperature sensors (continuous, point-specific monitoring).
- Key applications include electrical panels, motor bearings, transformers, steam traps, and refractory linings.
- Alert thresholds are asset-specific; for electrical connections a rise of 10 degrees Celsius above similar components in the same panel typically triggers a warning.
- ISO 18436-7 and ASTM E1934 define qualification requirements for thermographers and survey procedures.
- Thermal data integrates with predictive maintenance platforms to automate trend tracking and work order generation.
What Is Thermal Monitoring?
Thermal monitoring is a branch of condition monitoring that uses temperature data to assess the health of industrial equipment. Most mechanical and electrical faults generate excess heat before they cause visible damage or functional failure: a worn bearing increases friction and raises temperature at the housing, a loose electrical connection increases resistance and dissipates energy as heat, and a blocked steam trap traps condensate that cools the downstream line.
By measuring those temperature signatures regularly or continuously, maintenance teams can catch faults in their early stages, plan repairs during scheduled windows, and avoid the production losses and safety risks that accompany sudden failures. Thermal monitoring is applicable across a wide range of industries, from manufacturing and process plants to utilities, data centers, and food production facilities.
How Thermal Monitoring Works
All thermal monitoring methods share a common principle: heat emitted or conducted by equipment surfaces is measured and compared against a known baseline or against similar components in the same installation. Deviations from the baseline indicate a change in the thermal behavior of the asset, which is then investigated as a potential fault.
The physics underpinning thermal monitoring is straightforward. Every object above absolute zero emits infrared radiation. The intensity and wavelength distribution of that radiation depend on the surface temperature and emissivity of the material. Infrared cameras and sensors convert that radiation into temperature readings or visual heat maps that technicians and software systems can interpret.
Infrared Thermography
Infrared thermography uses a thermal imaging camera to capture a two-dimensional map of surface temperatures across an asset or installation. A trained thermographer scans electrical panels, motor housings, pipe runs, or refractory walls and looks for hot spots that deviate from the surrounding thermal pattern. The output is a thermogram: a color-coded image where warmer areas appear in red and orange tones and cooler areas appear in blue and green.
Thermography is particularly effective for wide-area surveys. A single scan of a switchgear room can cover dozens of connection points in minutes, identifying the specific circuit breaker or busbar joint that is running hot. This makes it a highly efficient screening tool when combined with periodic inspection routes.
Fixed Temperature Sensors
Where continuous monitoring is required, fixed sensors provide real-time temperature data without the need for a technician to be present. Common sensor types include thermocouples, resistance temperature detectors (RTDs), and infrared spot sensors mounted at fixed positions on bearing housings, winding insulation, transformer cores, or heat exchangers.
These sensors connect to data acquisition systems or industrial IoT platforms that log readings at set intervals, compare values against thresholds, and trigger alerts when temperatures rise above defined limits. Fixed sensors are well suited to critical rotating equipment such as motors, gearboxes, and compressors where a bearing or winding failure could cause significant production loss.
Data Acquisition and Integration
Modern thermal monitoring systems integrate temperature data from multiple sources into a central platform. Readings from fixed sensors stream continuously, thermogram images are uploaded after survey routes, and the platform applies trend analysis to distinguish gradual degradation from sudden spikes. When a reading crosses a warning or alarm threshold, the system can automatically generate a work order in the connected maintenance management system, assigning the task with the relevant thermal data attached.
This integration with predictive maintenance workflows eliminates the manual step of transferring inspection findings into maintenance planning and reduces the time between fault detection and corrective action.
Key Applications of Thermal Monitoring
Thermal monitoring is applied wherever heat anomalies are a reliable indicator of developing faults. The table below covers the most common applications in industrial settings.
| Asset / System | Fault Detected | Thermal Signature |
|---|---|---|
| Electrical panels and switchgear | Loose connections, overloaded circuits, failing components | Hot spot at connection point; asymmetric heating across phases |
| Motors and generators | Bearing wear, winding insulation breakdown, cooling blockage | Elevated bearing housing temperature; hot winding section |
| Transformers | Core overheating, winding hot spots, cooling system failure | Asymmetric surface temperature; hot top relative to expected gradient |
| Steam traps | Failed open (blowing live steam), failed closed (condensate flooding) | Failed-open: high downstream temperature; failed-closed: cool downstream pipe |
| Bearings and gearboxes | Lubrication failure, overload, fatigue damage | Rising housing temperature above normal operating band |
| Refractory linings (kilns, furnaces) | Brick erosion, hot spots indicating liner breach | Localized external hot spot on shell surface |
| Heat exchangers | Fouling, tube blockage, flow imbalance | Uneven temperature distribution across header or shell |
Temperature Thresholds and Alert Levels
Effective thermal monitoring programs define clear alert levels for each asset class. Thresholds are set relative to a baseline established under normal operating conditions rather than as fixed absolute values, because the same absolute temperature can be normal for one asset type and critical for another.
Electrical Equipment
For electrical connections and switchgear, IEC and NFPA guidance typically structures alert levels around the temperature rise above a reference point (a similar connection in the same panel operating under the same load):
- Temperature rise of 1 to 10 degrees Celsius above reference: monitor, recheck at next survey
- Temperature rise of 11 to 20 degrees Celsius: schedule inspection and tightening within 30 days
- Temperature rise of 21 to 40 degrees Celsius: repair within the next planned maintenance window
- Temperature rise above 40 degrees Celsius: critical, take out of service and repair immediately
Rotating Equipment Bearings
For motor and pump bearings, a common framework uses absolute surface temperature measured at the bearing housing outer race area:
- Below 70 degrees Celsius: normal for most industrial motors
- 70 to 80 degrees Celsius: elevated; investigate lubrication, loading, and alignment
- 80 to 90 degrees Celsius: warning; plan replacement at next opportunity
- Above 90 degrees Celsius: shutdown threshold; risk of lubricant breakdown and catastrophic failure
These values are starting points. Asset-specific baselines established during commissioning should always take precedence over generic thresholds.
Steam Traps
Steam trap assessment compares the upstream and downstream pipe temperatures. A correctly functioning trap shows a significant temperature drop from the upstream side (near steam saturation temperature) to the downstream side (near condensate temperature, typically 60 to 90 degrees Celsius lower). A failed-open trap shows comparable temperatures on both sides. A failed-closed trap shows a cold downstream pipe at or near ambient temperature.
Thermal Monitoring vs Infrared Thermography vs Temperature Sensors
These three terms are sometimes used interchangeably, but they describe different scopes. The table below clarifies the distinctions.
| Dimension | Thermal Monitoring | Infrared Thermography | Temperature Sensors |
|---|---|---|---|
| Scope | Umbrella discipline covering all temperature-based condition monitoring | One technique within thermal monitoring | Hardware component used in thermal monitoring |
| Data format | Time-series trends, heat maps, alert logs | 2D thermogram images with temperature scale | Single-point numeric readings at defined intervals |
| Monitoring mode | Continuous or periodic | Periodic (survey routes) | Continuous (fixed installation) |
| Coverage | Wide: all assets and systems | Wide-area surface scanning; ideal for panels and large structures | Point-specific; one sensor per measurement location |
| Personnel required | Varies by method | Trained thermographer; ISO 18436-7 certification recommended | Installation technician; automated operation after setup |
| Best use case | Program-level strategy | Electrical surveys, refractory inspection, large surface areas | Critical bearings, motor windings, transformer cores |
Integrating Thermal Monitoring with Predictive Maintenance
Thermal monitoring delivers its greatest value when it is embedded in a structured predictive maintenance program rather than used as an ad-hoc inspection tool. Integration works across three levels.
Data Integration
Temperature readings from fixed sensors and thermogram results from survey routes should flow into a single asset health monitoring platform alongside vibration, oil analysis, and ultrasound data. This gives reliability engineers a unified view of each asset's condition rather than siloed readings from separate tools. An asset that shows rising bearing temperature alongside elevated vibration at bearing defect frequencies provides a much stronger case for intervention than either reading alone.
Automated Alerting and Work Order Generation
Modern monitoring platforms apply rule-based or machine-learning alert logic to temperature time series. When a bearing temperature exceeds its warning threshold for three consecutive readings, the system can automatically generate a work order, classify the priority, and attach the temperature trend data. This removes the manual step of translating inspection findings into maintenance tasks and reduces the time between detection and repair.
Failure Mode Targeting
Different failure modes produce distinct thermal signatures, and knowing which signature corresponds to which fault guides the maintenance response. A motor with a steadily rising bearing temperature over two weeks points to progressive lubricant degradation and calls for relubrication or bearing replacement during the next planned window. A motor with a sudden spike in winding temperature points to an electrical fault or overload that may require immediate action. Pairing thermal data with fault tree analysis helps teams move from "the temperature is high" to "this is the most likely cause and the correct corrective action."
Standards and Certifications
Several international standards govern how thermal monitoring programs should be designed, executed, and staffed.
ISO 18436-7
This standard defines the competency requirements for thermographers performing condition monitoring of machines using infrared thermography. It describes four levels of certification, from Level I (basic survey execution) to Level IV (program design and management). Many industrial facilities require Level II certification as a minimum for technicians conducting electrical and mechanical thermal surveys.
ASTM E1934
ASTM E1934 provides a guide for examining electrical and mechanical equipment with infrared thermography. It covers camera setup, environmental conditions that affect readings (ambient temperature, wind, solar loading), emissivity corrections, and image interpretation criteria. Following E1934 procedures ensures that thermograms are comparable across surveys and that hot spots are not missed due to poor technique.
NFPA 70B
NFPA 70B (Recommended Practice for Electrical Equipment Maintenance) includes guidance on infrared inspection intervals for electrical distribution equipment. For most industrial facilities it recommends annual thermographic surveys of all energized electrical connections, with more frequent surveys for high-criticality equipment or systems with a history of problems.
IEC 60068
IEC 60068 covers environmental testing methods, including thermal cycling and temperature endurance tests used to validate equipment ratings. While primarily a product qualification standard, it informs the operating temperature limits that maintenance teams use when setting alarm thresholds for individual assets.
Benefits of Thermal Monitoring
When implemented correctly, thermal monitoring delivers measurable improvements across safety, reliability, and cost dimensions.
Early Fault Detection
The primary benefit is the ability to detect faults in their earliest thermal stage, before they cause damage that is expensive or impossible to reverse. A loose electrical connection identified by a 15-degree-Celsius hot spot can be tightened in minutes. Left undetected, the same connection can arc, damage adjacent conductors, trip a circuit breaker feeding a production line, and potentially cause a fire. The cost ratio between early intervention and failure response is typically 10:1 or greater for electrical faults.
Reduced Unplanned Downtime
Thermal monitoring converts potential unplanned failures into planned maintenance events. A bearing identified as running warm can be scheduled for replacement during a routine shutdown rather than failing mid-shift and forcing an emergency stop. This shift from reactive to planned work reduces the total time equipment is out of service, because planned repairs are shorter and require less expedited parts procurement. Reducing unplanned maintenance is one of the most consistent outcomes reported by facilities that implement thermal monitoring programs.
Safety Improvement
Thermal faults in electrical equipment are a leading cause of industrial fires and arc flash events. Infrared surveys conducted on energized equipment from a safe distance allow technicians to identify serious electrical faults without exposing themselves to contact hazards. Refractory monitoring in high-temperature furnaces and kilns identifies shell hot spots that could indicate liner breach, enabling controlled shutdowns before catastrophic failures occur.
Energy Efficiency
Thermal monitoring also identifies energy waste. A steam trap failing in the open position bleeds live steam continuously, wasting both energy and water treatment chemicals. A study of typical industrial steam systems estimates that 15 to 25 percent of steam traps fail in any given year, and a failed-open trap on a medium-pressure steam line can waste hundreds of thousands of dollars in energy annually if left undetected. Thermal surveys of steam trap populations, conducted with an infrared camera, can identify failed traps in minutes and provide the data needed to prioritize replacements.
Extended Asset Life
Heat is one of the most destructive forces in electrical insulation and mechanical components. Motor winding insulation life follows a well-established rule: operating temperature 10 degrees Celsius above rated maximum halves the expected insulation life. Thermal monitoring that keeps windings within design limits preserves insulation integrity and extends motor service life. Similarly, bearings operating within their temperature design range last significantly longer than those running hot from lubrication or loading problems. This connects directly to improvements in remaining useful life predictions for critical rotating assets.
Building a Thermal Monitoring Program
A thermal monitoring program should be structured around asset criticality rather than applied uniformly across a facility. The steps below reflect common industrial practice.
1. Asset Criticality Ranking
Begin by ranking assets by the consequences of failure: production impact, safety risk, and repair cost. High-criticality assets such as main power transformers, large motors on single-train production lines, and critical steam distribution systems justify continuous temperature sensing. Medium-criticality assets are better served by periodic thermographic surveys on a quarterly or annual cycle.
2. Baseline Establishment
For every monitored asset, record a baseline thermal profile under defined operating conditions: load level, ambient temperature, and time since last startup. This baseline is the reference against which future readings are compared. A new motor installation, a transformer at 80 percent load on a 20-degree-Celsius day, and a steam trap after system warmup each have a characteristic thermal signature that is unique to those conditions.
3. Survey Route Design
For periodic thermography, define standardized survey routes that cover all relevant assets in a logical sequence. Document the camera settings, standoff distances, and angles required for each asset to ensure consistent image acquisition across surveys. Standardized routes also make it easier to train replacement thermographers and to compare images from different survey dates.
4. Threshold and Alert Configuration
Set temperature thresholds for each asset class based on manufacturer specifications, industry standards, and baseline data. Configure the monitoring platform to generate tiered alerts: informational at warning level, requiring acknowledgment at serious level, and triggering automatic work order creation at critical level. Review and update thresholds annually or after any significant change in operating conditions.
5. Corrective Action Integration
Define the corrective action for each alert type before the program goes live. A critical electrical hot spot requires a different response procedure than a warning-level bearing temperature rise. Predefining the response removes ambiguity and speeds up the decision-to-repair cycle. Integration with root cause analysis processes helps ensure that identified faults are fully corrected rather than temporarily addressed, preventing recurrence.
The Bottom Line
Thermal monitoring gives maintenance and reliability teams a reliable early warning system for the faults that cause the most damaging and expensive industrial failures. By measuring temperature anomalies in electrical connections, rotating machinery, steam systems, and high-temperature processes, it converts hidden degradation into visible, actionable data. The combination of periodic infrared thermography for wide-area screening and continuous fixed sensors for critical assets provides coverage across the full asset population. When thermal data is integrated with other condition monitoring streams and connected to maintenance workflows, the result is a measurable reduction in unplanned downtime, lower repair costs, and a safer operating environment. For facilities that rely on electrical distribution, rotating equipment, or steam infrastructure, thermal monitoring is one of the most cost-effective tools available in a predictive maintenance program.
Monitor Asset Temperature Continuously with Tractian
Tractian's condition monitoring platform combines continuous temperature sensing with vibration and ultrasound data, giving your team a complete picture of asset health in one system.
See How Tractian WorksFrequently Asked Questions
What is thermal monitoring in industrial maintenance?
Thermal monitoring is the continuous or periodic measurement of surface and component temperatures on industrial equipment to detect abnormal heat patterns that indicate developing faults. It uses infrared thermography, thermal cameras, and contact or non-contact temperature sensors to identify problems such as overloaded electrical connections, failing bearings, blocked steam traps, and overheating motors before they cause unplanned failures.
What is the difference between thermal monitoring and infrared thermography?
Infrared thermography is one technique within the broader thermal monitoring discipline. It uses a thermal imaging camera to capture two-dimensional heat maps of equipment surfaces, making it particularly effective for surveying electrical panels, switchgear, and refractory linings. Thermal monitoring also includes fixed-point temperature sensors, RTDs, thermocouples, and fiber-optic sensing that provide continuous data streams rather than periodic snapshots. A complete thermal monitoring program typically combines both approaches.
What temperature thresholds trigger action in thermal monitoring?
Alert levels depend on the asset type and baseline. For electrical connections, a temperature rise of 10 degrees Celsius above similar components in the same panel is generally flagged as a warning, 20 to 30 degrees above baseline as serious, and above 40 degrees as critical requiring immediate action. For motor bearings, most programs set a warning at 70 to 80 degrees Celsius surface temperature and a shutdown threshold at 90 to 95 degrees Celsius. For steam traps, a failed-open trap shows a continuous downstream temperature comparable to the steam header, while a failed-closed trap shows ambient temperature on the downstream side.
Which standards govern thermal monitoring programs?
The primary standards are IEC 60068 (environmental testing including thermal), ISO 18436-7 (qualification and assessment of condition monitoring personnel for thermography), and ASTM E1934 (guide for examining electrical and mechanical equipment with infrared thermography). NFPA 70B recommends annual infrared surveys of electrical distribution equipment. Some industries also reference ISO 13373 for vibration condition monitoring, which is often run alongside thermal monitoring in integrated predictive maintenance programs.
Related terms
SCADA (Supervisory Control and Data Acquisition)
SCADA is an industrial control system architecture that monitors and controls distributed physical processes in real time using sensors, RTUs, PLCs, and supervisory software.
Shelf Life
Shelf life is the maximum period a stored part, material, or consumable remains fit for use. Learn how to manage shelf life in MRO storerooms with FIFO, date coding, and CMMS alerts.
Scoping
Scoping defines the boundaries, tasks, materials, labour, access requirements, and acceptance criteria for a maintenance job before execution. Learn why it matters and how to do it right.
Shewhart Cycle
The Shewhart Cycle is a four-stage iterative improvement framework: Plan, Do, Check, Act. Learn its origins, how it differs from PDCA and DMAIC, and how maintenance teams apply it.
Single-Minute Exchange of Dies (SMED)
SMED is a lean methodology for reducing equipment changeover time to under ten minutes by separating, converting, and streamlining internal and external setup activities.