universal_grass_peps

Mitchell, RowanORCID logo (2023) universal_grass_peps. [Data Collection]
Copy

This dataset is the output from a bioinformatics pipeline developed by Rowan Mitchell during 2018-2024 that seeks to identify all universal protein-coding genes in grasses and to estimate how specific they are to grasses. The dataset has 5 components: (1) universal_grass_peps.xlsx contains summary information on all the universal groups of peps identified. (2) files in genBlastG/* are genome annotation files for each novel gene model generated by the genBlastG files in the pipeline. (3) hmms/*.msa.fa are the multiple alignment sequence fasta files, one for each group. (4) files hmms/final_db.hmms* are for use to search the database with query sequences using the HMMER package. (5) files in lookup/* allow users to find which groups a grass query pep ID is a member of, or associated to, for 16 different grass species.

list_of_files.txt
subject
Other
Available under Creative Commons: Attribution 4.0
description
text/plain
folder_info
1MB

Download
files_v1.4.tar.gz
subject
Other
Available under Creative Commons: Attribution 4.0
folder_zip
application/x-gzip
folder_info
49MB

Download

Atom BibTeX OpenURL ContextObject in Span OpenURL ContextObject Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation METS MODS RIOXX2 XML Reference Manager Refer ASCII Citation
Export

Downloads