Journal

Authors: Maragoudakis E., Loukis E.
Title: Using Ensemble Random Forests Classifiers for the Extraction and Exploitation of Knowledge on Gas Turbine Blading Faults Identification
Journal: Operational Research Insight (Association of Busness Schools (ABS) List)
Volume: 25
Number: 2
Pages: 80-104
Year: 2012
Publisher: Palgrave Macmillan
To appear: No
Link:
ISI: No
Impact Factor:
File name: OR_Insight_Maragoudakis_Loukis_final.pdf##^^&&518997931.pdf
Abstract: The extraction and exploitation of existing knowledge assets for supporting decision making and increasing the effectiveness of various internal and external interventions is of critical importance for the success of modern organizations. The use of advanced Operational Research based quantitative methods in combination with high capabilities information systems can be very useful for this purpose. In this paper we are investigating the use of Ensemble Random Forests for extracting, codifying and exploiting existing organizational knowledge on gas turbine blading faults identification, in the form of a large number of decision trees (called a ‘forest’); each of them has internal nodes corresponding to various tests on features of signals acquired from the gas turbine and leaf nodes corresponding to classifications to the healthy condition or particular faults. Two heterogeneous kinds of inserting randomness to the development of these forest trees, based on different theoretical assumptions, have been examined (Random Input Forests and Random Combination Forests). Using data from a large power gas turbine the performance of Ensemble Random Forests has been evaluated, and also compared against other machine learning classification methods, such as Neural Networks, Classification and Regression Trees and K-Nearest Neighbor. The Ensemble Random Forests reached a level of 97% in terms of precision and recall in engine condition diagnosis from new signals acquired from the gas turbine, which was higher than the performance of all the other examined classification methods. These results provide some first evidence that Ensemble Random Forest can be an effective tool for the extraction, codification and exploitation of the technological knowledge assets of modern organizations, and contribute significantly to the improvement of organizations’ decision making and interventions in this area. The extraction and exploitation of existing knowledge assets for supporting decision making and increasing the effectiveness of various internal and external interventions is of critical importance for the success of modern organizations. The use of advanced Operational Research based quantitative methods in combination with high capabilities information systems can be very useful for this purpose. In this paper we are investigating the use of Ensemble Random Forests for extracting, codifying and exploiting existing organizational knowledge on gas turbine blading faults identification, in the form of a large number of decision trees (called a ‘forest’); each of them has internal nodes corresponding to various tests on features of signals acquired from the gas turbine and leaf nodes corresponding to classifications to the healthy condition or particular faults. Two heterogeneous kinds of inserting randomness to the development of these forest trees, based on different theoretical assumptions, have been examined (Random Input Forests and Random Combination Forests). Using data from a large power gas turbine the performance of Ensemble Random Forests has been evaluated, and also compared against other machine learning classification methods, such as Neural Networks, Classification and Regression Trees and K-Nearest Neighbor. The Ensemble Random Forests reached a level of 97% in terms of precision and recall in engine condition diagnosis from new signals acquired from the gas turbine, which was higher than the performance of all the other examined classification methods. These results provide some first evidence that Ensemble Random Forest can be an effective tool for the extraction, codification and exploitation of the technological knowledge assets of modern organizations, and contribute significantly to the improvement of organizations’ decision making and interventions in this area.