RGFinder: A system for determining semantically related genes using GO graph minimum spanning tree

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Biologists often need to know the set S′ of genes that are the most functionally and semantically related to a given set S of genes. For determining the set S′, most current gene similarity measures overlook the structural dependencies among the Gene Ontology (GO) terms annotating the set S, which may lead to erroneous results. We introduce in this paper a biological search engine called RGFinder that considers the structural dependencies among GO terms by employing the concept of existence dependency. RGFinder assigns a weight to each edge in GO graph to represent the degree of relatedness between the two GO terms connected by the edge. The value of the weight is determined based on the following factors: 1) type of the relation represented by the edge (e.g., an "is-a" relation is assigned a different weight than a "part-of" relation), 2) the functional relationship between the two GO terms connected by the edge, and 3) the string-substring relationship between the names of the two GO terms connected by the edge. RGFinder then constructs a minimum spanning tree of GO graph based on these weights. In the framework of RGFinder, the set S′ is annotated to the GO terms located at the lowest convergences of the subtree of the minimum spanning tree that passes through the GO terms annotating set S. We evaluated RGFinder experimentally and compared it with four gene set enrichment systems. Results showed marked improvement.

Original languageBritish English
Article number6926848
Pages (from-to)23-36
Number of pages14
JournalIEEE Transactions on Nanobioscience
Volume14
Issue number1
DOIs
StatePublished - 1 Jan 2015

Keywords

  • Biological search engine
  • Gene Ontology
  • GO term
  • related genes
  • related terms
  • semantic similarity

Fingerprint

Dive into the research topics of 'RGFinder: A system for determining semantically related genes using GO graph minimum spanning tree'. Together they form a unique fingerprint.

Cite this