Open Access
Issue
BIO Web Conf.
Volume 56, 2023
43rd World Congress of Vine and Wine
Article Number 02003
Number of page(s) 3
Section Oenology
DOI https://doi.org/10.1051/bioconf/20235602003
Published online 24 February 2023

© The Authors, published by EDP Sciences, 2023

Licence Creative CommonsThis is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0 (http://creativecommons.org/licenses/by/4.0/).

1 Introduction

There are high complexities in the attempt to define wine quality. However, physical-chemometric techniques can objectively assess wine quality traits [1]. Sensory analysis using trained panels could also broaden the definition of wines' specific characteristics that may help define quality traits.

These techniques have also been implemented to assess the provenance and counterfeiting of wines or to detect mislabeling from wineries. Near-infrared spectroscopy can offer a complete chemical fingerprinting profile of wines depending on the spectral range and sensitivity of different instruments. This instrument has been used for various applications, from general quality assessment [2] to predicting aroma profiles in beverages [3]. Furthermore, the use of digital technologies and sensors, such as NIR, electronic noses, and tongues, allows the implementation of Artificial Intelligence (AI) and ML modeling for wine quality prediction [4], to the specific prediction of phenolic compounds in red wines and geographical provenance [5], sugar content [5] astringency [6], and wine authenticity [7].

However, most of these works are based on wine samples extracted by direct measurement from wines or by opening wine bottles for measurement purposes. There is limited research focused on measurements through the bottle that can offer non-invasive tools for physicochemical, sensory, and aroma profile assessment of wines. This work presents the implementation of NIR spectroscopy measured through the bottles of unopened wines in parallel with AI and ML techniques to assess wine quality traits, and provenance and as a potential method to evaluate counterfeiting and mislabeling.

2 Materials and methods

2.1 Samples description

Samples of 13 different vintages from 2000-2021 produced at the Dookie College Winery from The University of Melbourne, Victoria, Australia (36°38′S, 145°71′E) were used for this study.

2.2 Near-infrared spectroscopy and sensory analysis

Samples were measured to assess their chemical fingerprinting within 1596-2396 through the bottle at three different spots (top, middle, and bottom) and three different sides (three measurements per spot/side; n = 27) using a handheld NIR Micro PHAZIR™ RX Analyzer (Thermo Fisher Scientific, Waltham, MA, USA) with a custom-made attachment to cover any external light and reduce noise from the environment. The empty bottles were measured three times and averaged further to subtract the values from the wine through bottle data to remove the glass components.

A quantitative descriptive analysis (QDA®) sensory session was conducted with a trained panel of 12 participants. The study was approved by the Human Ethics Advisory Group from The University of Melbourne (ID: 1953926.4), and panelists signed a consent form to participate. A total of 20 descriptors, (i) color intensity, (ii) clarity, (iii) aroma truffle, (iv) aroma smoke, (v) aroma blackberry, (vi) aroma blackcurrant, (vii) aroma prune, (viii) aroma butter, (ix) aroma pepper, (x) aroma cedar, (xi) aroma violet, (xii) aroma redcurrant, (xiii) bitterness, (xiv) acidic, (xv) sweetness, (xvi) astringency, (xvii) body, (xviii) warmingmouthfeel, (xix) tingling mouthfeel, and (xx) perceived quality, were evaluated using a 15-cm non-structured scale.

2.3 Machine learning modelling

Two machine learning models were developed based on artificial neural networks (ANN) using a code written in Matlab® R2021a (Mathworks, Inc., Natick, MA, USA) by the Digital Agriculture, Food and Wine group from The University of Melbourne (DAFW-UoM) [8]. Model 1 was developed using pattern recognition with Bayesian Regularization algorithm and NIR data as inputs to predict the vintages (Fig. 1a). Model 2 was developed using regression ANN with the Levenberg Marquardt algorithm and NIR data as inputs to predict the intensity of 20 sensory descriptors (Fig. 1b). Both models were developed with interleaved data division as 70% for training and 30% for testing in Model 1 and random data division as 70% training, 15% validation, and 15% testing for Model 2. Model 1 performance was assessed using cross-entropy, while Model 2 used a means squared error algorithm.

thumbnail Figure 1

Diagrams of the artificial neural network (a) classification Model 1 to predict the wine vintages and (b) regression Model 2 to predict the intensity of 20 sensory descriptors using the near-infrared absorbance values measured through the bottle.

3 Results and Discussion

Table 1 shows that Model 1 had a very high accuracy (97%) in predicting the vintages. This model did not present signs of over-fitting since the performance (cross-entropy) value of the training stage (<0.01) was lower than the testing (0.01).

The ML models obtained, due to their accuracy and without signs of over fitting, are highly replicable since bottle material and thickness are usually consistent with the commercial source by the winery used in this study. Models would need to be retrained for wineries using different shapes of bottles or internal and external diameters. These ML models would not apply to sparkling wines since bottles are thicker to sustain pressures of around 8 bars or more. The latter application would require retraining the algorithm incorporating more data using these types of bottles.

Figure 2 shows the receiver operating characteristics curve where all vintages were close to the true-positive rates. This confirms the model accuracy and suitability to classify the samples according to their vintage.

It is well known that variability of weather conditions and water availability within seasonsdirectly impact the quality traits of grapes produced and the subsequent winemaking process. Seasonal variability made it possible for the ML models to establish a pattern of analysis to distinguish different vintages from the same winery, which can be used to establish consistency and traceability of different bottles to avoid mislabeling.

Table 2 shows that Model 2 had accuracy based on the correlation coefficient (R = 0.95; Fig. 3) to predict the intensity of 20 sensory descriptors. It can be observed that the slopes of all stages are high (>0.88), and the performance value of the testing stage (MSE = 0.26) was lower than the validation and testing stages (MSE = 0.56 and 0.63, respectively), being the last two values very close, which are signs of no over fitting of the models. Figure 3 shows the predicted (y-axis) and observed (x-axis) values of the 20 descriptors from the sensory analysis.

Sensory analysis of wines modeled through the bottles offers powerful tools for wineries and retailers to monitor the acceptability and quality traits of wines over time. These models can be used as quality assurance to evaluate storage conditions and the ageing process. Furthermore, these models can be used as an extra layer of information to assess wines' provenance and detect mislabeling.

Table 1

Results from the classification machine learning model to predict wine vintages.

thumbnail Figure 2

Overall receiver operating characteristics (ROC) curve showing the true-positive and false-positive rates of Model 1 for all vintages.

Table 2

The regression machine learning model results in predicting wine sensory descriptors using NIR absorbance values as input. Abbreviations: R: correlation coefficient; MSE: means squared error.

thumbnail Figure 3

Overall machine learning model to predict sensory descriptors using near-infrared (NIR) absorbance values measured through the bottle as inputs.

4 Conclusiones

Digital sensor technology, such as NIR spectroscopy, can effectively and accurately assess wine quality traits based on chemical fingerprinting and ML modeling with data obtained through the bottle. This novel application, in parallel with low-cost NIR devices, could be used from the winery to retailers to assess how these quality traits change during transport, aging, storage, and for the detection of tampering, adulteration, and counterfeiting. A further application can be used to assess wines' provenance and detect mislabeling.

References

All Tables

Table 1

Results from the classification machine learning model to predict wine vintages.

Table 2

The regression machine learning model results in predicting wine sensory descriptors using NIR absorbance values as input. Abbreviations: R: correlation coefficient; MSE: means squared error.

All Figures

thumbnail Figure 1

Diagrams of the artificial neural network (a) classification Model 1 to predict the wine vintages and (b) regression Model 2 to predict the intensity of 20 sensory descriptors using the near-infrared absorbance values measured through the bottle.

In the text
thumbnail Figure 2

Overall receiver operating characteristics (ROC) curve showing the true-positive and false-positive rates of Model 1 for all vintages.

In the text
thumbnail Figure 3

Overall machine learning model to predict sensory descriptors using near-infrared (NIR) absorbance values measured through the bottle as inputs.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.