Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species

Kersey, P. J., Allen, J. E., Allot, A., Barba, M., Boddu, S., Bolt, B. J., Carvalho-Silva, D., Christensen, M., Davis, P., Grabmueller, C., +36 more...Kumar, N., Liu, Z., Maurel, T., Moore, B., Mcdowall, M. D., Maheswari, U., Naamati, G., Newman, V., Ong, C. K., Paulini, M., Pedro, H., Perry, E., Russell, M., Sparrow, H., Tapanari, E., Taylor, K., Vullo, A., Williams, G., Zadissia, A., Olson, A., Stein, J., Wei, S., Tello-Ruiz, M., Ware, D., Luciani, A., Potter, S., Finn, R. D., Urban, Martin

, Hammond-Kosack, Kim

, Bolser, D. M., De Silva, N., Howe, K. L., Langridge, N., Maslen, G., Staines, D. M. and Yates, A. (2017) Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species. Nucleic Acids Research, 46 (D1). D802-D808. 10.1093/nar/gkx1011

Copy

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including genome sequence, gene models, transcript sequence, genetic variation, and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments and expansions. These include the incorporation of almost 20 000 additional genome sequences and over 35 000 tracks of RNA-Seq data, which have been aligned to genomic sequence and made available for visualization. Other advances since 2015 include the release of the database in Resource Description Framework (RDF) format, a large increase in community-derived curation, a new high-performance protein sequence search, additional cross-references, improved annotation of non-protein-coding genes, and the launch of pre-release and archival sites. Collectively, these changes are part of a continuing response to the increasing quantity of publicly-available genome-scale data, and the consequent need to archive, integrate, annotate and disseminate these using automated, scalable methods.

Item Type	Article
Open Access	Gold
Project	Designing Future Wheat (DFW) [ISPG], Wheat, DFW - Designing Future Wheat - Work package 2 (WP2) - Added value and resilience, Pathogen-Host Interactions Database: PHI Database [2012-2017], PHI-base National Capability [2017-2019], PhytoPath, an infrastructure for hundreds of plant pathogen genomes, [20:20 Wheat] Protecting yield potential of wheat
Date Deposited	05 Dec 2025 09:55
Last Modified	19 Dec 2025 14:37