Skip to main content
Log in

Fault Diagnosis of Planetary Gear Carrier Packs: A Class Imbalance and Multiclass Classification Problem

  • Regular Paper
  • Published:
International Journal of Precision Engineering and Manufacturing Aims and scope Submit manuscript

Abstract

Fault diagnosis plays a key role in monitoring manufactured products for the purpose of quality control. Among the several fault diagnosis approaches, knowledge-based fault diagnosis, which uses signals from sensors and machine learning algorithms instead of a priori information, is widely employed to diagnose the status of products. In this paper, we propose a knowledge-based procedure to establish a fault diagnosis model. The model is aimed to diagnose planetary gear carrier packs, which have many fault types and an unbalanced number of samples in the sample classes, using transmission error. In the procedure, the best feature subset that contains the most important features is selected using two different feature selection processes. Several ensemble algorithms are used during the model training process. The imbalance ratio between classes of samples is addressed. The number of weak learners is automatically determined by a genetic algorithm. Finally, the performance of the proposed procedure is validated by comparison with other models trained without applying the proposed procedure. We observed that it is important to incorporate the class imbalance technique in the training process as it reduces the misclassification of faulty products as normal ones. This reduction is important in production quality control.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Abbreviations

α t :

Weight of the tth weak hypothesis

ADASYN:

Adaptive synthetic

Bagging:

Bootstrap aggregating

C :

Target class

CART:

Classification and regression tree

CI:

Class imbalance

D t(i):

Weight of the ith sample at the tth iteration

ε t :

Error of a hypothesis at the tth iteration

g i :

ith feature

GA:

Genetic algorithm

G_mean:

Geometric mean

h t :

Weak hypothesis at the tth iteration

H :

The final hypothesis

IR:

Imbalance ratio

I(a;b):

Mutual information of two random variables, a and b

k :

Number of classes

m :

Number of samples

μ :

Mean

MIQ:

Mutual information quotient

mRMR:

Min-redundancy and max-relevance

n :

Number of the fault types

N i,j :

Number of ith class samples that are classified as jth class

OVA:

One-versus-all

p(a):

Probability density function of variable, a

RSA:

Revolution synchronous averaging

σ :

Deviation

S :

Objective subset

SBS:

Sequential backward selection

SFS:

Sequential forward selection

SMOTE:

Synthetic minority over-sampling technique

TE:

Transmission error

V I :

Relevance

W I :

Redundancy

x :

Input

y :

Output

References

  1. Leonhardt, S., & Ayoubi, M. (1997). Methods of fault diagnosis. Control Engineering Practice, 5(5), 683–692.

    Article  Google Scholar 

  2. Gao, Z., Cecati, C., & Ding, S. X. (2015). A survey of fault diagnosis and fault-tolerant techniques—part I: Fault diagnosis with model-based and signal-based approaches. IEEE Transactions on Industrial Electronics, 62(6), 3757–3767.

    Article  Google Scholar 

  3. Gao, Z., Cecati, C., & Ding, S. X. (2015). A survey of fault diagnosis and fault-tolerant techniques—part II: Fault diagnosis with knowledge-based and hybrid/active approaches. IEEE Transactions on Industrial Electronics, 62(6), 3768–3774.

    Google Scholar 

  4. Janjarasjitt, S., Ocak, H., & Loparo, K. A. (2008). Bearing condition diagnosis and prognosis using applied nonlinear dynamical analysis of machine vibration signal. Journal of Sound and Vibration, 317(1), 112–126.

    Article  Google Scholar 

  5. Santos, P., Villa, L., Reñones, A., Bustillo, A., & Maudes, J. (2015). An SVM-based solution for fault detection in wind turbines. Sensors, 15(3), 5627–5648.

    Article  Google Scholar 

  6. Zarei, J., Tajeddini, M. A., & Karimi, H. R. (2014). Vibration analysis for bearing fault detection and classification using an intelligent filter. Mechatronics, 24(2), 151–157.

    Article  Google Scholar 

  7. Asr, M. Y., Ettefagh, M. M., Hassannejad, R., & Razavi, S. N. (2017). Diagnosis of combined faults in rotary machinery by non-naive bayesian approach. Mechanical Systems and Signal Processing, 85(15), 56–70.

    Article  Google Scholar 

  8. Li, Y., Li, G., Yang, Y., Liang, X., & Xu, M. (2018). A fault diagnosis scheme for planetary gearboxes using adaptive multi-scale morphology filter and modified hierarchical permutation entropy. Mechanical Systems and Signal Processing, 105(15), 319–337.

    Article  Google Scholar 

  9. Al-Ghamd, A. M., & Mba, D. (2006). A comparative experimental study on the use of acoustic emission and vibration analysis for bearing defect identification and estimation of defect size. Mechanical Systems and Signal Processing, 20(7), 1537–1571.

    Article  Google Scholar 

  10. Elforjani, M., & Mba, D. (2010). Accelerated natural fault diagnosis in slow speed bearings with acoustic emission. Engineering Fracture Mechanics, 77(1), 112–127.

    Article  Google Scholar 

  11. Mba, D., & Rao, R. B. K. N. B. (2006). Development of acoustic emission technology for condition monitoring and diagnosis of rotating machines: Bearings, pumps, gearboxes, engines, and rotating structures. Shock and Vibration Digest, 38(1), 3–16.

    Article  Google Scholar 

  12. Åkerblom, M. (2001). Gear noise and vibration: A literature survey, Volvo Construction Equipment Components AB, Stockholm.

  13. Park, S., Kim, S., & Choi, J. H. (2018). Gear fault diagnosis using transmission error and ensemble empirical mode decomposition. Mechanical Systems and Signal Processing, 108, 262–275.

    Article  Google Scholar 

  14. Venkatasubramanian, V., Rengaswamy, R., Kavuri, S. N., & Yin, K. (2003). A review of process fault detection and diagnosis: Part III: Process history based methods. Computers & Chemical Engineering, 27(3), 327–346.

    Article  Google Scholar 

  15. Henriquez, P., Alonso, J. B., Ferrer, M. A., & Travieso, C. M. (2014). Review of automatic fault diagnosis systems using audio and vibration signals. IEEE Transactions on Systems, Man, and Cybernetics Systems, 44(5), 642–652.

    Article  Google Scholar 

  16. Tang, J., Alelyani, S., & Liu, H. (2014). Feature selection for classification: A review. In Data classification: algorithms and applications, CRC Press, pp. 37–64.

  17. Galar, M., Fernandez, A., Barrenechea, E., Bustince, H., & Herrera, F. (2012). A review on ensembles for the class imbalance problem: Bagging-, boosting-, and hybrid-based approaches. IEEE Transactions on Systems, Man, and Cybernetics Part C (Applications and Reviews), 42(4), 463–484.

    Article  Google Scholar 

  18. He, H., & Garcia, E. A. (2009). Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering, 21(9), 1263–1284.

    Article  Google Scholar 

  19. Do, J. G., Lee, H. K., Lee, T. H., PSYLOGIC, INC., Hyundai Motor Co. (2014). Planetary gear carrier pack transmission error inspection device. US Patent, No. US9664592B2.

  20. Aly, M. (2005). Survey on multiclass classification methods. In Neural networks, pp. 1–9.

  21. Rifkin, R., & Klautau, A. (2004). Parallel networks that learn to pronounce english text. Journal of Machine Learning Research, 5, 101–141.

    Google Scholar 

  22. Peng, H. C., Long, F. H., & Ding, C. (2005). Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(8), 1226–1238.

    Article  Google Scholar 

  23. Ding, C., & Peng, H. (2003). Minimum redundancy feature selection from microarray gene expression data. In Proceedings of the 2nd IEEE computational systems bioinformatics conference, pp. 523–528.

  24. Chan, P. K., Fan, W., Prodromidis, A. L., & Stolfo, S. J. (1999). Distributed data mining in credit card fraud detection. IEEE Intelligent Systems and their Applications, 14(6), 67–74.

    Article  Google Scholar 

  25. Dumais, S., Platt, J., Heckerman, D., & Sahami, M. (1998). Inductive learning algorithms and representations for text categorization. In Proceedings of the 7th international conference on information and knowledge management, pp. 148–155.

  26. Kubat, M., Holte, R. C., & Matwin, S. (1998). Machine learning for the detection of oil spills in satellite radar images. Machine Learning, 30(2), 195–215.

    Article  Google Scholar 

  27. Mazurowski, M. A., Habas, P. A., Zurada, J. M., Lo, J. Y., Baker, J. A., & Tourassi, G. D. (2008). Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance. Neural Networks., 21(2), 427–436.

    Article  Google Scholar 

  28. Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16(1), 321–357.

    Article  MATH  Google Scholar 

  29. Haibo, H., Yang, B., Garcia, E. A., & Shutao, L. (2008). ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In Proceedings of the 2008 IEEE international joint conference on neural networks, pp. 1322–1328.

  30. Acharya, U. R., Sudarshan, V. K., Rong, S. Q., Tan, Z., Lim, C. M., Koh, J. E. W., et al. (2017). Automated detection of premature delivery using empirical mode and wavelet packet decomposition techniques with uterine electromyogram signals. Computers in Biology and Medicine, 85, 33–42.

    Article  Google Scholar 

  31. López, V., Fernández, A., García, S., Palade, V., & Herrera, F. (2013). An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics. Information Sciences, 250, 113–141.

    Article  Google Scholar 

  32. Opitz, D., & Maclin, R. (1999). Popular ensemble methods: An empirical study. Journal of Artificial Intelligence Research, 11, 169–198.

    Article  MATH  Google Scholar 

  33. Dietterich, T. G. (2000). Ensemble methods in machine learning. In Proceedings of the international workshop on multiple classifier systems, pp. 1–15.

  34. Breiman, L. (1996). Bagging predictors. Machine Learning, 24(2), 123–140.

    MATH  MathSciNet  Google Scholar 

  35. Freund, Y., & Schapire, R. E. (1996). Experiments with a new boosting algorithm. In Proceedings of the 13th international conference on international conference on machine learning, pp. 148–156.

  36. Schapire, R. E. (1990). The strength of weak learnability. Machine Learning, 5(2), 197–227.

    Google Scholar 

  37. Freund, Y., & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119–139.

    Article  MathSciNet  MATH  Google Scholar 

  38. Schapire, R. E. (2013). Explaining AdaBoost. In Empirical inference, Springer, pp. 37–52.

  39. Friedman, J., Hastie, T., & Tibshirani, R. (2000). Additive logistic regression: A statistical view of boosting. Annals of Statistics, 28(2), 337–407.

    Article  MathSciNet  MATH  Google Scholar 

  40. Ferreira, A., & Figueiredo, M. (2012). Boosting algorithms: A review of methods, theory, and applications. In Ensemble machine learning, Springer, pp. 35–85.

  41. Wolpert, D. H., & Macready, W. G. (1997). No free lunch theorems for optimization. IEEE Transactions on Evolutionary Computation, 1(1), 67–82.

    Article  Google Scholar 

  42. Htike, K. K. (2017). Efficient determination of the number of weak learners in adaboost. Journal of Experimental & Theoretical Artificial Intelligence, 29(5), 967–982.

    Article  Google Scholar 

  43. Razavi-Far, R., Zanjani, M., & Saif, M. (2017). An integrated class-imbalance learning scheme for diagnosing bearing defects in induction motors. IEEE Transactions on Industrial Informatics, 13(6), 2758–2769.

    Article  Google Scholar 

  44. Sun, Y. M., Kamel, M. S., & Wang, Y. (2006). Boosting for learning multiple classes with imbalanced class distribution. In Proceeding of the 6th international conference on data mining, pp. 592–602.

  45. Fawcett, T. (2003). ROC graphs: Notes and practical considerations for researchers. Technical report HPL-2003-4, HP Laboratories.

Download references

Acknowledgements

This research was supported by the Chung-Ang University Research Scholarship Grants in 2017. This work was also supported by the Technology Innovation Program (10073196, Development of Prognostics and Quality Improvement Technologies for Enhancing Productivity and Yield Rate of Automobile Components by Minimizing Downtime of Manufacturing Robots/Process Equipment) funded By the Ministry of Trade, industry and Energy (MOTIE, Korea).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hae-Jin Choi.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Han, S., Choi, HJ., Choi, SK. et al. Fault Diagnosis of Planetary Gear Carrier Packs: A Class Imbalance and Multiclass Classification Problem. Int. J. Precis. Eng. Manuf. 20, 167–179 (2019). https://doi.org/10.1007/s12541-019-00082-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12541-019-00082-4

Keywords

Navigation