DeepCanola: Phenotyping brassica pods using semi-synthetic data and active learning

A - Papers appearing in refereed journals

Van Vliet J. J., Atkins, K., Kurup, S., Siles, L., Hepworth, J, Corke, F. M. K., Doonan, J. H. and Hu, C. 2025. DeepCanola: Phenotyping brassica pods using semi-synthetic data and active learning. Computers and Electronics in Agriculture. 237 (Part B), p. 11047. https://doi.org/10.1016/j.compag.2025.110470

AuthorsVan Vliet J. J., Atkins, K., Kurup, S., Siles, L., Hepworth, J, Corke, F. M. K., Doonan, J. H. and Hu, C.
Abstract

Phenotyping, the measurement of attributes or traits, is crucial in selecting superior cultivars for specific environmental situations. This is a time-consuming process when applied to large populations but can be accelerated through the use of deep learning, resulting in an algorithm that can phenotype images of specimens in negligible amounts of time. The primary issue with deep learning is the large quantities of high-quality training data required to make a viable phenotyping pipeline. To address this, we present a semi-synthetic training data generation system which significantly reduces the amount of human effort spent on data collection. We use active learning alongside this system to create DeepCanola, an instance segmentation model that successfully segments and measures the valves from Brassica napus pods. We demonstrate that the model accurately estimates the effect of different winter cold treatments on a range of different cultivars and crop types as effectively as manually curated measurements. Furthermore, the resulting model is effective on data from various experimental settings and on different, but related, species such as Arabidopsis thaliana, Allaria petiolate (garlic mustard) and Raphanus raphanistrum subsp. sativus (radish). This robust tool could be easily scaled, thereby accelerating breeding or fundamental research programs. Code and model weights: https://github.com/kieranatkins/deepcanola.

KeywordsDeep learning ; Plant phenotyping ; Semi-synthetic data ; Active learning ; Human-in-the-loop ; Pod length
Year of Publication2025
JournalComputers and Electronics in Agriculture
Journal citation237 (Part B), p. 11047
Digital Object Identifier (DOI)https://doi.org/10.1016/j.compag.2025.110470
Open accessPublished as ‘gold’ (paid) open access
FunderBiotechnology and Biological Sciences Research Council
Funder project or codeTailoring Plant Metabolism (TPM) - Work package 1 (WP1) - High value lipids for health and industry
Brassica Rapeseed And Vegetable Optimisation (BRAVO)
Publisher's version
Output statusPublished
Publication dates
Online11 Jun 2025
Publication process dates
Accepted25 Apr 2025
PublisherElsevier
ISSN0168-1699

Permalink - https://repository.rothamsted.ac.uk/item/9940y/deepcanola-phenotyping-brassica-pods-using-semi-synthetic-data-and-active-learning

1 total views
0 total downloads
1 views this month
0 downloads this month
Download files as zip