Publishing FAIR data: an exemplar methodology utilizing PHI-base

Rodriguez-Iglesias, A., Rodriguez-Gonzales, A., Irvine, A. G., Sesma, A., Urban, Martin

and Wilkinson, M. D. (2016) Publishing FAIR data: an exemplar methodology utilizing PHI-base. Frontiers in Plant Science, 7. p. 641. 10.3389/fpls.2016.00641

Copy

Pathogen-Host interaction data is core to our understanding of disease processes and their molecular/genetic bases. Facile access to such core data is particularly important for the plant sciences, where individual genetic and phenotypic observations have the added complexity of being dispersed over a wide diversity of plant species vs. the relatively fewer host species of interest to biomedical researchers. Recently, an international initiative interested in scholarly data publishing proposed that all scientific data should be “FAIR”—Findable, Accessible, Interoperable, and Reusable. In this work, we describe the process of migrating a database of notable relevance to the plant sciences—the Pathogen-Host Interaction Database (PHI-base)—to a form that conforms to each of the FAIR Principles. We discuss the technical and architectural decisions, and the migration pathway, including observations of the difficulty and/or fidelity of each step. We examine how multiple FAIR principles can be addressed simultaneously through careful design decisions, including making data FAIR for both humans and machines with minimal duplication of effort. We note how FAIR data publishing involves more than data reformatting, requiring features beyond those exhibited by most life science Semantic Web or Linked Data resources. We explore the value-added by completing this FAIR data transformation, and then test the result through integrative questions that could not easily be asked over traditional Web-based data resources. Finally, we demonstrate the utility of providing explicit and reliable access to provenance information, which we argue enhances citation rates by encouraging and facilitating transparent scholarly reuse of these valuable data holdings.

Item Type	Article
Open Access	Gold
Keywords	FAIR data, Linked Data, Pathogen-Host Interactions, PHI-base, Semantic Web, Semantic PHI-base, SPARQL, data integration
Project	Wheat, Pathogen-Host Interactions Database: PHI Database [2012-2017], PhytoPath: an Integrated resource for comparative phytopathogen genomics [2011-2014], PhytoPath, an infrastructure for hundreds of plant pathogen genomes, [20:20 Wheat] Protecting yield potential of wheat
Date Deposited	05 Dec 2025 09:52
Last Modified	19 Dec 2025 14:36

Explore Further

Biotechnology and Biological Sciences Research Council

Frontiers in Plant Science

10.3389/fpls.2016.00641 (DOI)

picture_as_pdf: fpls-07-00641.pdf
subject: Published Version
Creative Commons Attribution: Available under Creative Commons: Attribution 4.0

View

Download

EndNote

BibTeX

Reference Manager

Refer

Atom

Dublin Core

RIOXX2 XML

HTML Citation

OpenURL ContextObject

OpenURL ContextObject in Span

MODS

OPENAIRE

MPEG-21 DIDL

ASCII Citation

Data Cite XML

METS

Export

Downloads