We present CAFA-evaluator, a powerful Python program designed to evaluate the performance of prediction methods on targets with hierarchical concept dependencies. It generalizes multi-label evaluation to modern ontologies where the prediction targets are drawn from a directed acyclic graph and achieves high efficiency by leveraging matrix computation and topological sorting. The program requirements include a small number of standard Python libraries, making CAFA-evaluator easy to maintain. The code replicates the Critical Assessment of protein Function Annotation (CAFA) benchmarking, which evaluates predictions of the consistent subgraphs in Gene Ontology. Owing to its reliability and accuracy, the organizers have selected CAFA-evaluator as the official CAFA evaluation software.
Cite
@article{arxiv.2310.06881,
title = {CAFA-evaluator: A Python Tool for Benchmarking Ontological Classification Methods},
author = {Damiano Piovesan and Davide Zago and Parnal Joshi and M. Clara De Paolis Kaluza and Mahta Mehdiabadi and Rashika Ramola and Alexander Miguel Monzon and Walter Reade and Iddo Friedberg and Predrag Radivojac and Silvio C. E. Tosatto},
journal= {arXiv preprint arXiv:2310.06881},
year = {2024}
}