Please use this identifier to cite or link to this item:
https://dspace.uzhnu.edu.ua/jspui/handle/lib/66547
Title: | Comparison Analysis of the Pearson’s Phi-Square Test and Correlation Metric Effectiveness to Form the Subset of Differently Expressed and Mutually Correlated Genes |
Authors: | Yasinska-Damri, Lyudmyla Babichev, Sergii Liakh, Igor |
Keywords: | Gene expression profiles, correlation metric, Pearson’s chi-square test, gene expression profiles classification, classification quality criteria, Gene expression profiles, correlation metric, Pearson’s chi-square test, gene expression profiles classification, classification quality criteria |
Issue Date: | 25-Mar-2022 |
Publisher: | International Workshop on Intelligent Information Technologies & Systems of Information Security |
Citation: | The development of patients' health monitoring systems based on gene expression data is a very important direction of current bioinformatics. In this instance, the allocation of both differently expressed and mutually correlated gene expression profiles (GEP) which allow monitoring in real-time the patients' health with high accuracy is a very important step of this problem solution. There are various types of similarity metrics to identify the level of GEP proximity. In this research, we compare the Pearson chi-square test and correlation metric to evaluate the gene expression profiles proximity. The evaluation of appropriate metric effectiveness has been executed by applying the object's classification quality criteria such as accuracy, f-score and Matthews correlation coefficient (MCC). The simulation results have shown that the metric based on Pearson’s phi-square coefficient is significantly effective in comparison with the correlation metric to allocate the mutually similar gene expression profiles and, this metric can be used when the differently expressed and mutually correlated GEP will be extracted using various clustering algorithms. Keywords 1 Gene expression profiles, correlation metric, Pearson’s chi-square test, gene expression profiles classification, classification quality criteria |
Series/Report no.: | Computer Science, Biology;3156 |
Abstract: | The development of patients' health monitoring systems based on gene expression data is a very important direction of current bioinformatics. In this instance, the allocation of both differently expressed and mutually correlated gene expression profiles (GEP) which allow monitoring in real-time the patients' health with high accuracy is a very important step of this problem solution. There are various types of similarity metrics to identify the level of GEP proximity. In this research, we compare the Pearson chi-square test and correlation metric to evaluate the gene expression profiles proximity. The evaluation of appropriate metric effectiveness has been executed by applying the object's classification quality criteria such as accuracy, f-score and Matthews correlation coefficient (MCC). The simulation results have shown that the metric based on Pearson’s phi-square coefficient is significantly effective in comparison with the correlation metric to allocate the mutually similar gene expression profiles and, this metric can be used when the differently expressed and mutually correlated GEP will be extracted using various clustering algorithms. |
Description: | The development of patients' health monitoring systems based on gene expression data is a very important direction of current bioinformatics. In this instance, the allocation of both differently expressed and mutually correlated gene expression profiles (GEP) which allow monitoring in real-time the patients' health with high accuracy is a very important step of this problem solution. There are various types of similarity metrics to identify the level of GEP proximity. In this research, we compare the Pearson chi-square test and correlation metric to evaluate the gene expression profiles proximity. The evaluation of appropriate metric effectiveness has been executed by applying the object's classification quality criteria such as accuracy, f-score and Matthews correlation coefficient (MCC). The simulation results have shown that the metric based on Pearson’s phi-square coefficient is significantly effective in comparison with the correlation metric to allocate the mutually similar gene expression profiles and, this metric can be used when the differently expressed and mutually correlated GEP will be extracted using various clustering algorithms. |
Type: | Text |
Publication type: | Тези до статті |
URI: | https://dspace.uzhnu.edu.ua/jspui/handle/lib/66547 |
Appears in Collections: | Наукові публікації кафедри інформатики та фізико-математичних дисциплін |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
paper4.pdf | Stattja | 950.05 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.