UNCOVERING AN OFTEN UNSEEN ASPECT OF MTBF
Last Mile Consultants

UNCOVERING AN OFTEN UNSEEN ASPECT OF MTBF

Mean time between failures (MTBF) in the world of maintenance is quite often misunderstood and taken for granted as just another maintenance key performance indicator (KPI). Many articles have been published and books written on the subject of MTBF by scholars which apply mostly to products which generally are individual components e.g. electronic gadgets, bearings etc. and do not contain much detail on the MTBF of machinery/equipment and systems comprised of multiple components operating as one for the production of a/any product. Successful operation of such is very dependent on maintenance and MTBF is used as one of the KPI for maintenance. However, the high impact that MTBF has on production compared to other KPIs is often overlooked or unseen, because the relationship between them is quite” subtle.” Thus, the purpose of this article is to logically and statistically uncover this relationship and to clearly illustrate how it affects production either adversely or favorably.

Quite normally production/operation teams view MTBF as maintenance KPI while maintenance teams view it as just another lesser KPI besides Availability. And that is one of the main reasons why both teams often fail to see how it affects production. Most times in places I’ve worked over the years, maintenance teams are so concerned about maintaining high equipment availabilities and not so much on their MTBFs. The common notion towards MTBF is that, it’s just another statistic that reliability engineers/analysts use to draw pretty graphs to make their monthly reports look good. I believe, this mentality may have changed over the years in some places but not everywhere.

And I can understand why, simply because usually the management “dubs” it as their main KPI and measures their performance by it. Thus, they do everything to achieve high availabilities and most times end up doing “band aid/patch up” repairs; postpone schedule PM’s; major component change outs; fudge figures etc. then before you know it, the cycle of recurring failures/breakdowns begins. Resulting in maintenance being very reactive, high maintenance costs, high production losses, poor OHS, low MTBF’s and ultimately low availabilities. And as a result the maintenance superintendents or even managers get hammered in management meetings. And they wonder what went wrong & how they ended up like this. So, every time the search for a scapegoat occurs, giving rise to the age old enmity between maintenance & operations teams. One starts blaming the other and the other vice versa for any major equipment downtime, thus collaboration across departments is thrown out the window resulting in low productivity. I like to call that whole scenario the “Illusion of Maintenance.”

It is about time management focuses or places emphasis on achieving high MTBF’s and not just high availabilities. Why? Because having high availabilities doesn’t necessarily mean that you will have a high or increased production but having a high MTBF will. This is due to the fact that it directly affects effective utilization (which will be shown statically in the latter section of this article) and effective utilization is production. In addition, having high MTBFs will also definitely give rise to high availabilities. On the contrary, while we are too busy maintaining high availabilities, we have low MTBF’s and ultimately low production/high production losses (hours) or disruptions. And here’s how, it is a bit elusive but it is factual and does happen all the time.

Having a low MTBF means the number/frequency of breakdowns (regardless of the amount of repair time) for equipment has increased during effective utilization or production. That seems quite straight forward but here’s the “subtle” part: Each breakdown is accompanied with a delay factor/duration, besides the time taken to repair (MTTR). These are the types of delays;

1. Human Nature Delay Factor (Work Ethic) 

  • Operators decides to use the lavatory whilst/even after repair is done 
  • Or operator decides to have early lunch/snack whilst/even after repair is done
  • Operator psychologically losses focus of the job (takes time bounce back into the mode) 
  • And if failure occurs the 2nd & 3rd time then he becomes psychologically switched off and grows tired

2. Work Environment Delay Factor

  • If the production area is situated in the open i.e. exposed to the weather.
  • Or underground or even in confined spaces i.e. hot, humid, wet etc.

The environmental factors also set in, adversely affecting both operator & equipment.

These are a few examples of the 2 delay factors that I have seen happen, there may be more. So we can say that the repair time/duration can be controlled and brought to a minimum as humanly possible. However, as for the delays caused by human nature and environmental factors, they cannot always be controlled and are unpredictable. These factors could have been insignificant and would not have set in to add up to the total delay time i.e. production lost time, if the equipment did not breakdown and/or do so recurringly during production.

Thus, returning to the subject, I disagree when MTBF is viewed has just another statistic or KPI. It really depends on how one defines it, especially in the world of maintenance. If one defines it as Effective Utilization ÷ No. of Unplanned Maintenance done in a period. Then one would realize that it is directly proportional to production. Thus, I like to call it the “Corner Stone of Production.” I understand that it is not a RCM method that can be directly applied to improve equipment & component performance, etc. However, if we (maintenance team) concentrate our efforts via RCM methods to increase MTBF rather than just concentrating on increasing availabilities and reducing MTTR, then we will notice a significant increase in production.

Several years back when working for a gold mining company as the reliability engineer for the underground mine mobile fleet. We defined mobile equipment MTBF as per the equation below;

 MTBF = (Effective Equip Utilization Hours) ÷ (No. of Unplanned Maintenance)

●    Effective Utilization = Availability - (OPs Delay + OPs Standby) = Production

●    Unplanned Maintenance = Mechanical Breakdowns + Accident Damages (i.e. Inadequate design protection in accident prone working areas. Acts of God & Operator Errors are excluded)

Thus, basing on the above definition of MTBF, I used statistical analysis to illustrate the “subtle relationship” and stress the impact it had on production. The calculations below show the proportional increase in production as a result of a slight increase in equipment MTBF. This is a flip side to a slight decrease in equipment MTBF where the 2 delay factors described previously come into play/take effect. The adverse effect caused can even double because an operator’s work ethic varies from person to person.

This is data from Jumbo Drill Fleet graphs (actual data captured via dispatch modular software) where the performance of the fleet in the month of Jan 2012 was compared with Jan 2013;

-      In 2012 Jan MTBF = 8.64 hrs (9 times blown hydraulic hose per 22 hrs on avg.) while in 2013 Jan MTBF = 10.36 (6 times blown hydraulic hose per 22 hrs on avg.). There was a 1.72 hrs i.e. 20% increase in MTBF for each Jumbo in the fleet of 5 Jumbos and,

-      In 2012 Jan Eff. Utilization = 1675.8 hrs while in 2013 Jan Eff. Utilization = 2143.5 hrs equating to a 467.7 hrs i.e. 28% increase in the total utilization hours for the whole fleet

Therefore, as can be seen from the comparison and calculation above, a small scale increase in each equipment’s MTBF in a fleet will have a consequential large scale proportional increase in the fleet’s Utilization and also Availability. It is very difficult to mathematically show this relationship i.e. the consequential increase; it can only be explained through logical reasoning i.e. the *2 delay factors. Finally, this will only be true on these conditions;

-      If the fleet number remains the same (i.e. 5 Jumbos in this case),

-      If Operation demand/need for utilization remains the same and

-      If MTTR remains at a minimum/constant. (experienced/skilled labor, tooling & parts availability)

*Since the failure rate has been reduced from 9 to 6 times in every 22 hours of operation, the frequency of opportunity/window for the 2 delay factors to come into play have also been reduced and “unnecessary” lost production hours besides repair time/hours was saved.

Furthermore, an estimation of tons/ounces as a result of the 20% increases in MTBF on average for each Jumbo. From previous statistics on TTM & Total Ore Movement, the amount of ore produced by the Jumbo fleet/hour on average was 26 tons. So using that, the increase was calculated as below;

­  - 467.7 hrs x 26 tons(ore)/hr x 2.5 grams/t x 86.2 % recovery x 0.035Oz/g x US$1683.05/Oz = US$ 154,366,499.12

*Note: The gold price used was the current price in Jan 2013

Thus, due that comparatively small MTBF increase, an additional amount of ore tonnage was produced resulting in additional ounces of gold and finally its monetary value in USD;

­      - 12,160.2 tons (ore)

­      - 917.18 Oz

­      - US$ 154,366,499.12

And finally these calculations for increased can only materialize on the condition that all the tons (ore) get processed through Crushing, Grinding and Refinery Mills.

To conclude, since we have uncovered the subtle relationship between MTBF and Production. We can now clearly see that the value of MTBF as a maintenance KPI for operating/producing systems, equipment and components is far greater than Availability. And as a matter of fact, MTBF is not just a KPI; it is Reliability which is a positive innate ability of a system, equipment or component. Thus, it has the greatest impact on operation when it matters i.e. during production. It is the link between maintenance and operation departments, Availability and Utilization, a common ground that effectively fosters collaboration and cross departmental projects.

Thus, I would like to reinforce what I stated earlier and that is; more emphasis/priority should be placed on achieving high MTBFs and not just high Availabilities, by the management teams of any organizations that depend on maintenance to produce. Moreover, both maintenance & operation teams within those organizations should work in collaboration towards achieving that. Because at the end of the day, all teams within those organizations contribute towards one ultimate goal and i.e. continuous high production.

Sanam Bp

Sales Manager at FluidTech Co.

6y

Great article.

Benson Palijah

Chairman/CEO @ Pandtec International Limited | Oil & Gas Industry

6y

Elaborate explanation. For those who give more management focus on MTBF should benefit in the long run. Most often you find complacency by operators and maintenance personnel that invigorate this factors unknowingly. Education and training towards more focus on MTBF would be good business practice in this area.

To view or add a comment, sign in

Insights from the community

Others also viewed

Explore topics