Evolutionary Induction of Decision Tree Classifier Ensembles Using Class Density Structures

Inese Poļaka

Evolutionary Induction of Decision Tree Classifier Ensembles Using Class Density Structures

2014
Inese Poļaka

Defending
19.05.2014. 14:30, DITF, Meža ielā 1, 3.korpusā, 202.auditorijā

Supervisor
Arkādijs Borisovs

Reviewers
Zigurds Markovičs, Jānis Zuters, Aleksandr Božeņuk

The problem analyzed in the thesis is biomedical diagnostics. Its specifics is several thousand biological indicators of a patient status (genes, proteins and antibodies) that have to be analyzed simultaneously in order to find disease markers. This problem is formalized in the thesis as data mining classification task, in which patient status is described by vectors made up of biological indicator values and the diagnosis of the patient is the class label. A methodology is developed to solve the defined task, using two methods developed for the solution of this task – class decomposition and a hybrid classification method that is based on genetic algorithms and decision tree classifier ensembles. Class decomposition allows improving classification accuracy by describing the inner structure properties of the data and using the description in classification. The classification method that is based on genetic algorithms and decision tree classifier ensembles and that uses Random subspace method allows finding quasi-optimal and easily interpretable classifier ensembles that consist only of the most informative attributes and their relationships. The thesis is arranged so that confirming the initial hypotheses step-by-step it proves the efficacy of the developed methodology. As a result, the use of a smaller biomarker panel that is acquired due to the built-in feature selection of the developed method is justified, the usefulness of class decomposition application is proved, the accuracy of the developed classification method is confirmed and the advantages of using the developed methodology for the analysis of biomedical data are shown.

Keywords
Bioinformatics, data mining, classification, class decomposition

Poļaka, Inese. Evolutionary Induction of Decision Tree Classifier Ensembles Using Class Density Structures. PhD Thesis. Rīga: [RTU], 2014. 141 p.

Publication language
Latvian (lv)

Publication Type
Doctoral Thesis
Funding for basic activity
Unknown
Field of research
2. Engineering and technology
Sub-field of research
2.2 Electrical engineering, Electronic engineering, Information and communication engineering
ID: 18214