Newer
Older

nina.marthe_ird.fr
committed
GrAnnoT is an annotation transfer tool for pangenome graphs. It can transfer genome annotations to a pangenome graph containing the genome, and also transfer the pangenome graph's annotations on the genomes it contains. It also outputs complementary information such as the alignments of the transfered genes, or a presence-absence matrix.
This project is young and in development, some errors will be corrected, improvements will be made and new features will appear in the near future.

nina.marthe_ird.fr
committed
GrAnnoT requires the graph to be in GFA format with W-lines or P-lines that describe the walks or paths of the genomes. For example minigraph-cactus or VG give that kind of output. If the graph has P-lines, the names of the paths must have the form IRGSP#0#Chr1 with three fields being the genome name, the haplotype, and the chromosome/contig name. The graphs must be full (option -gfa full for minigraph cactus).
Please note that graphs build with PGGB are not supported. Improvements will be made in the future to include these graphs.
You can run GrAnnoT via the script main.py with the following arguments :
- annotation file of a genome embedded in the graph
- the name of the source genome, that the annotation file refers to
- optionnaly the names of the target genomes (these names must be included in the walks or paths of the graph). If no name is specified, the annotation will be transfered on all the genomes in the graph.
The other options can be found if you run "main.py -h".
- a GFF and a GAF file with the annotation of the graph
- the GFF file with the annotation of the target genome
- a text file with the details of the variations within the annotated regions
- a text file with the alignments of the genes from the source genome and from the target genome
For all the target genomes :
- A PAV matrix recapitulating the transfers
If you have any question or suggestion, feel free to contact me by email : nina.marthe@ird.fr
If you use this program please cite Nina Marthe, Francois Sabot and Matthias Zytnicki.