Explaining Data Preprocessing Methods for Modeling and Forecasting with the Example of Product Drying

Korkmaz, Cem; Kacar, Ilyas

Explaining Data Preprocessing Methods for Modeling and Forecasting with the Example of Product Drying

Tarih

2024

Yazarlar

Korkmaz, Cem

Kacar, Ilyas

Yayıncı

Univ Namik Kemal

Erişim Hakkı

info:eu-repo/semantics/openAccess

Özet

Although regression is a traditional data processing method, machine and deep learning methods have been widely used in the literature in recent years for both modelling and prediction. However, in order to use these methods efficiently, it is important to perform a preliminary evaluation to understand the data type. Therefore, preevaluation procedures are described in this study. Experimental uncertainty analysis was performed to determine the measurement uncertainties in the measurement devices and sensors used in the drying experimental setup. Significant and insignificant relationships between variables in the data set were determined by Pearson correlation matrix. Autocorrelation and partial autocorrelation functions were used to determine the time series lag in the drying data and an AR(5) series with 5 lags was determined. The data were found to have variable variance due to peaks and troughs in the raw data resulting from the natural behaviour of the drying process. Modelling success was achieved with the normalisation pre -evaluation process performed without distorting the raw data. Thus, it has been shown that better models can be obtained compared to traditional models. In order to avoid unnecessary time and computational costs in the trial and error method used to determine the number of hidden layers and neurons in the machine learning method, various formulas proposed in the literature were compared. It is shown that the correlation coefficient alone is not sufficient to determine the goodness of the model. In modelling the data in this study, the NARX model was found to converge to the desired value faster and with less error than ANFIS and LSTM models. In the simulation of a rotary drum dryer, the optimum number of mesh elements was determined as 1137 by mesh independence analysis. In this way, unnecessary over -calculations were also prevented. Of course, all these methods are already available in statistical science. However, in this study, the methods to be used for modelling and prediction purposes are carefully selected and explained with examples, especially for young researchers who are outside this field to gain speed and easy comprehension.

Anahtar Kelimeler

Data processing, Regression, Pearson, Autocorrelation function, Partial autocorrelation function, Heteroscedasticity, Convergence, Validation

Kaynak

Journal of Tekirdag Agriculture Faculty-Tekirdag Ziraat Fakultesi Dergisi

WoS Q Değeri

N/A

Scopus Q Değeri

Q3

Cilt

21

Sayı

2

Bağlantı

https://doi.org/10.33462/jotaf.1300122
https://search.trdizin.gov.tr/tr/yayin/detay/1231254
https://hdl.handle.net/11480/14883

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu
TR-Dizin İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Explaining Data Preprocessing Methods for Modeling and Forecasting with the Example of Product Drying

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon