A study on time-series prediction and analysis of acidity of Daqu based on multivariate data fusion and KNN-Attention-LSTM-XGBoost modeling

  • 1School of Mechanical Engineering, Sichuan University of Science and Engineering, Yibin, 644000, China.
  • 2School of Mechanical Engineering, Sichuan University of Science and Engineering, Yibin, 644000, China. 13527447962@163.com.
  • 3Anhui Province Key Laboratory of Intelligent Solid-State Fermentation Technology, Bozhou, 236800, China.
  • 4The Liquor Making Biological Technology and Application of Key Laboratory of Sichuan Province, Yibin, 644000, China.

Abstract

Daqu is a traditional Chinese brewing ingredient that serves dual functions of saccharification and fermentation during the brewing process. The acidity content during the Daqu fermentation process directly affects the quality of the Daqu. Traditional methods for measuring Daqu acidity are complex and exhibit lag, making it difficult to monitor fermentation acidity in real time. Given the strong correlation between Daqu acidity and environmental variables, this paper proposes a time series prediction model for Daqu acidity based on the KNN-Attention-LSTM-XGBoost model. Upon collecting and analyzing the microenvironmental parameters of Daqu, the XGBoost model was used to select two optimal imputation methods (LFBI and KNN). Partial Least Squares Regression (PLSR) was employed to extract key parameters, and feature extraction using the lag and rolling window methods was performed to capture temporal trends and fluctuations. Comparative analysis revealed that KNN preprocessing combined with the Attention-LSTM-XGBoost model performed best in predicting Daqu acidity, with R2 values reaching 0.9790, 0.9768, and 0.9636 for the upper, middle, and lower Daqu layers, respectively. This combination outperformed the LSTM-XGBoost and XGBoost models, with improvements of 3.87%, 1.11%, and 2.84% compared to LSTM-XGBoost, and 4.70%, 4.37%, and 8.46% compared to XGBoost. This study addresses the challenge of predicting Daqu acidity during fermentation and provides insights into the optimization of the Daqu fermentation process.

Related Concept Videos

End Point Prediction: Gran Plot 01:07

300

A Gran plot is used to predict the equivalence volume or endpoint of a potentiometric or acid-base titration without reaching the endpoint. Typically, titration data is collected as a function of the titrant's volume up to a point less than the equivalence volume and then transformed into a linear format. The straight line is extended to the x-axis, indicating the necessary titrant volume to achieve the equivalence point.
For potentiometric titration, the Gran plot is created by plotting...

Acid–Base Titration: Overview 01:26

9.4K

An acid-base titration is a technique used to determine the concentration of an unknown acid or base, using a titrant of known concentration–either a base for acid titration or an acid for base titration. The process involves gradually adding the titrant, leading to a predictable change in the pH of the solution. This change is plotted on a titration curve, showing how a solution's pH varies with the amount of titrant added. Such curves are instrumental in monitoring the...

Acid-Base Titration Curves 02:23

127.4K

A titration curve is a plot of some solution property versus the amount of added titrant. For acid-base titrations, solution pH is a useful property to monitor because it varies predictably with the solution composition and, therefore, may be used to monitor the titration’s progress and detect its endpoint. Acid-base titration can be performed with a strong acid and a strong base, a strong acid and a weak base, or a strong base and a weak acid.
For a titration carried out for 25.00 mL of...

Acidity and Basicity of Carboxylic Acid Derivatives 01:25

3.4K

Carboxylic acids are the strongest among organic acids, as they readily lose the hydroxyl proton to form a resonance-stabilized carboxylate ion. In comparison, the acid derivatives lack acidic hydrogens directly attached to a functional group. In these compounds, the acidic nature arises from their ability to lose α hydrogens, making them weakly acidic.
The relative acidic strength of the derivatives can be explained based on the extent of resonance stabilization of the conjugate base. The...

Acidity of Carboxylic Acids 01:21

6.9K

Carboxylic acids are the strongest organic acids. However, their acidic strength is much less than mineral acids like HCl. Carboxylic acids ionize in water and readily lose the hydroxyl proton to form a resonance-stabilized carboxylate ion.

The acid dissociation constant (Ka) or pKa value indicates the extent of ionization, reflecting the moderate acidic strength of carboxylic acids. For simple carboxylic acids, the Ka values are around 10−5, and the pKa values are in the range of...

Ladder Diagrams: Acid–Base Equilibria 01:32

466

Understanding the chemistry between the reagents is necessary for performing any experiment. To this end, scientists have designed a tool called a ladder diagram, which is a graphical representation that helps illustrate the chemistry of a system.
A ladder diagram for acid-base equilibria consists of a vertical axis that represents pH and horizontal bars (steps on the ladder) that help position all the pKa values in the system. At equilibrium, the pH value of the system corresponds to one of...