Issue |
BIO Web Conf.
Volume 142, 2024
2024 International Symposium on Agricultural Engineering and Biology (ISAEB 2024)
|
|
---|---|---|
Article Number | 01005 | |
Number of page(s) | 8 | |
Section | Agricultural Economic Engineering and Market Management | |
DOI | https://doi.org/10.1051/bioconf/202414201005 | |
Published online | 21 November 2024 |
Enhancing Agriculture QA Models Using Large Language Models
Mathematics, University of California San Diego, La Jolla, USA
* Corresponding author: x6he@ucsd.edu
Agriculture is a complex process that requires a great deal of knowledge and experience. Yet, the complexity and often chaotic nature of web data presents a considerable challenge in obtaining suitable results. Given these production requirements, there is an urgent need to develop a model that enables machine reading comprehension and caters to automatic question-answering scenarios within the scope of agricultural production. In this paper, we construct a dataset for the experiments of document QA in agricultural scenarios. We import two model (ask-my-pdf and chat-pdf) as our baseline and use them to do the single and multiple document QA task. Then, we proposed several methods to improve the performance of the model in agriculture scenarios. At the end of the experiment, we achieved 33.3% improvement in F1 score compared to baseline and 92.8% overall answer accuracy in single document QA. For our final multiple documents QA model, we achieved 53% improvement in F1 score compared to baseline and 83.3% overall answer accuracy in the task.
© The Authors, published by EDP Sciences, 2024
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.