Accepted_test

Predicting protein function based on homology
by Malyugin E. V. | Mustafin Z. S. | Pronozin A. Yu. | Genaev M. A. | Afonnikov D. A. | Novosibirsk state university, NSU | Institute of Cytology and Genetics; Kurchatov Genomic Center of the Institute of Cytology and Genetics | Institute of Cytology and Genetics; Kurchatov Genomic Center of the Institute of Cytology and Genetics | Institute of Cytology and Genetics; Kurchatov Genomic Center of the Institute of Cytology and Genetics | Institute of Cytology and Genetics; Kurchatov Genomic Center of the Institute of Cytology and Genetics
Abstract ID: 203
Event: BGRS-abstracts
Sections: [Sym 6] Section “Genomics, genetics and systems biology of plants”

Our work is devoted to creating a gene function annotation method that works with high accuracy regardless of the age of genes. We proposed an algorithm for gene function annotation based on the search for similar sequences using the k nearest homologs method, analysis of orthogroups in the OrthoDB database, and filtering of GO terms based on sequence similarity using logistic regression. The methods were tested on sequences of protein-coding genes from a number of plant genomes, including Arabidopsis thaliana et al. Estimation of gene ages was used using the Orthoscape program. We also proposed to use a measure of prediction efficiency (PE) to evaluate the accuracy of gene function prediction methods, which avoids the influence of using different annotations when comparing different prediction methods.