A - Papers appearing in refereed journals
Van Vliet J. J., Atkins, K., Kurup, S., Siles, L., Hepworth, J, Corke, F. M. K., Doonan, J. H. and Hu, C. 2025. DeepCanola: Phenotyping brassica pods using semi-synthetic data and active learning. Computers and Electronics in Agriculture. 237 (Part B), p. 11047. https://doi.org/10.1016/j.compag.2025.110470
Authors | Van Vliet J. J., Atkins, K., Kurup, S., Siles, L., Hepworth, J, Corke, F. M. K., Doonan, J. H. and Hu, C. |
---|---|
Abstract | Phenotyping, the measurement of attributes or traits, is crucial in selecting superior cultivars for specific environmental situations. This is a time-consuming process when applied to large populations but can be accelerated through the use of deep learning, resulting in an algorithm that can phenotype images of specimens in negligible amounts of time. The primary issue with deep learning is the large quantities of high-quality training data required to make a viable phenotyping pipeline. To address this, we present a semi-synthetic training data generation system which significantly reduces the amount of human effort spent on data collection. We use active learning alongside this system to create DeepCanola, an instance segmentation model that successfully segments and measures the valves from Brassica napus pods. We demonstrate that the model accurately estimates the effect of different winter cold treatments on a range of different cultivars and crop types as effectively as manually curated measurements. Furthermore, the resulting model is effective on data from various experimental settings and on different, but related, species such as Arabidopsis thaliana, Allaria petiolate (garlic mustard) and Raphanus raphanistrum subsp. sativus (radish). This robust tool could be easily scaled, thereby accelerating breeding or fundamental research programs. Code and model weights: https://github.com/kieranatkins/deepcanola. |
Keywords | Deep learning ; Plant phenotyping ; Semi-synthetic data ; Active learning ; Human-in-the-loop ; Pod length |
Year of Publication | 2025 |
Journal | Computers and Electronics in Agriculture |
Journal citation | 237 (Part B), p. 11047 |
Digital Object Identifier (DOI) | https://doi.org/10.1016/j.compag.2025.110470 |
Open access | Published as ‘gold’ (paid) open access |
Funder | Biotechnology and Biological Sciences Research Council |
Funder project or code | Tailoring Plant Metabolism (TPM) - Work package 1 (WP1) - High value lipids for health and industry |
Brassica Rapeseed And Vegetable Optimisation (BRAVO) | |
Publisher's version | |
Output status | Published |
Publication dates | |
Online | 11 Jun 2025 |
Publication process dates | |
Accepted | 25 Apr 2025 |
Publisher | Elsevier |
ISSN | 0168-1699 |
Permalink - https://repository.rothamsted.ac.uk/item/9940y/deepcanola-phenotyping-brassica-pods-using-semi-synthetic-data-and-active-learning