Fuzzy Classification Methodology for Processing and Analyzing Bioinformatics Data

Madara Gasparoviča-Asīte

Fuzzy Classification Methodology for Processing and Analyzing Bioinformatics Data

2015
Madara Gasparoviča-Asīte

Defending
01.02.2016. 14:30, Rīgas Tehniskās universitātes Datorzinātnes un informācijas tehnoloģijas fakultātē, Sētas ielā 1, 202. auditorijā

Supervisor
Ludmila Aleksejeva

Reviewers
Zigurds Markovičs, Peteris Grabusts, Vytautas Čyras

The main aim of Thesis is to develop a fuzzy classification methodology for the use in processing and analysis of bioinformatics data. The use of fuzzy algorithms is suggested for the task of bioinformatics data analysis due to their use of linguistic terms that are easily comprehensible for humans. The task of bioinformatics-based diagnostics approached in Thesis is formalized as a classification task in data mining, where one has access to a predefined class (diagnosis) for each observed case. The algorithms and methods to be included in any step of the methodology are determined experimentally by testing on twenty five actual bioinformatics data sets. The study focused on investigating different methods and evaluating for suitability for bioinformatics data. The methodology steps that needed improvement were identified in experimental examination. Therefore in this Thesis are also proposed a membership function construction method that is based on clustering approach, a modification of the accepted classification algorithm FuzzyBexa by introducing the principle of rule-stretching that is used in FURIA as well as a novel rule fuzzification method that applies information from the constructed membership functions. The experimental validation of the fuzzy classification system that implements the proposed methodology proves its efficacy in the bioinformatics classification task. The developed methodology can be easily adapted for other fuzzy classification algorithms. The methodology can also be applied to other areas with the necessity to process and analyze similar data structures (tens of thousands of attributes and comparatively few, approximately hundred, records). The work contains 160 p. (include appendixes), 31 tables, 47 figures,6 appendixes and 133 referances

Keywords
Bioinformatics, data mining, classification, fuzzy logic

Gasparoviča-Asīte, Madara. Fuzzy Classification Methodology for Processing and Analyzing Bioinformatics Data. PhD Thesis. Rīga: [RTU], 2015. 160 p.

Publication language
Latvian (lv)

Publication Type
Doctoral Thesis
Funding for basic activity
Unknown
Field of research
2. Engineering and technology
Sub-field of research
2.2 Electrical engineering, Electronic engineering, Information and communication engineering
ID: 21548