Unknown

Dataset Information

0

A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines.


ABSTRACT:

Background

The combination of computer vision devices such as multispectral cameras coupled with artificial intelligence has provided a major leap forward in image-based analysis of biological processes. Supervised artificial intelligence algorithms require large ground truth image datasets for model training, which allows to validate or refute research hypotheses and to carry out comparisons between models. However, public datasets of images are scarce and ground truth images are surprisingly few considering the numbers required for training algorithms.

Results

We created a dataset of 1,283 multidimensional arrays, using berries from five different grape varieties. Each array has 37 images of wavelengths between 488.38 and 952.76 nm obtained from single berries. Coupled to each multispectral image, we added a dataset with measurements including, weight, anthocyanin content, and Brix index for each independent grape. Thus, the images have paired measures, creating a ground truth dataset. We tested the dataset with 2 neural network algorithms: multilayer perceptron (MLP) and 3-dimensional convolutional neural network (3D-CNN). A perfect (100% accuracy) classification model was fit with either the MLP or 3D-CNN algorithms.

Conclusions

This is the first public dataset of grape ground truth multispectral images. Associated with each multispectral image, there are measures of the weight, anthocyanins, and Brix index. The dataset should be useful to develop deep learning algorithms for classification, dimensionality reduction, regression, and prediction analysis.

SUBMITTER: Navarro PJ 

PROVIDER: S-EPMC9197681 | biostudies-literature | 2022 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel ground truth multispectral image dataset with weight, anthocyanins, and Brix index measures of grape berries tested for its utility in machine learning pipelines.

Navarro Pedro J PJ   Miller Leanne L   Díaz-Galián María Victoria MV   Gila-Navarro Alberto A   Aguila Diego J DJ   Egea-Cortines Marcos M  

GigaScience 20220601


<h4>Background</h4>The combination of computer vision devices such as multispectral cameras coupled with artificial intelligence has provided a major leap forward in image-based analysis of biological processes. Supervised artificial intelligence algorithms require large ground truth image datasets for model training, which allows to validate or refute research hypotheses and to carry out comparisons between models. However, public datasets of images are scarce and ground truth images are surpri  ...[more]

Similar Datasets

| S-EPMC6271032 | biostudies-literature
| S-EPMC9587524 | biostudies-literature
| PRJNA736205 | ENA
| S-EPMC6971370 | biostudies-literature
| S-EPMC10973596 | biostudies-literature
2022-10-11 | PXD034968 | Pride
2018-08-31 | GSE117010 | GEO
| S-EPMC11666237 | biostudies-literature
2023-12-21 | PXD044451 | Pride
| S-EPMC7898798 | biostudies-literature