Predictive analytics of crude oil prices by utilizing the intelligent model search engine

Korkut Bekiroglu, Okan Duru, Emrah Gulay, Rong Su, Constantino Manuel Lagoa

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

This paper proposes an intelligent model search engine (IMSE), an integrated model selection algorithm, subject to the out of sample predictive performance and given set of explanatory variables for forecasting crude oil prices. In the conventional applications of energy price forecasting, models are selected based on preliminary assumptions on causality and model structure (e.g. lag length in lagged variables). Relaxation of those assumptions would cause over-fitting and reduce the degree of freedom. Considering the ultimate objective of forecasting models, any variations of models may be tested in the out-of-sample period, and the optimization problem can be redefined as minimization of post-sample error metric in a validation set. By this, data mining would be a legitimate operation for economic forecasting, and it also proves required conditions usually tested by diagnostic tests such as Akaike Information Criterion for model quality. IMSE is a multi-input/single output difference equation based approach which allows users to test various models (for given set of explanatory variables) as well as various order of lagged inputs (lag length) without a priori assumption or theoretical basis except defining set of potential inputs. Finally, it selects the best model subject to predictive accuracy in a validation set. Empirical results indicated that the proposed algorithm significantly outperformed a broad range of benchmark methodologies as well as proving that certain assumptions of econometric approach (e.g. statistical significance of explanatory variables) are independent of predictive performance.

Original languageEnglish (US)
Pages (from-to)2387-2397
Number of pages11
JournalApplied Energy
Volume228
DOIs
StatePublished - Oct 15 2018

Fingerprint

Search engines
crude oil
engine
Crude oil
price
Predictive analytics
Akaike information criterion
data mining
Difference equations
Model structures
econometrics
model test
Data mining
Economics
methodology

All Science Journal Classification (ASJC) codes

  • Building and Construction
  • Energy(all)
  • Mechanical Engineering
  • Management, Monitoring, Policy and Law

Cite this

Bekiroglu, Korkut ; Duru, Okan ; Gulay, Emrah ; Su, Rong ; Lagoa, Constantino Manuel. / Predictive analytics of crude oil prices by utilizing the intelligent model search engine. In: Applied Energy. 2018 ; Vol. 228. pp. 2387-2397.
@article{9b4f556ba3cd4ffc81f6d3decdebcdbe,
title = "Predictive analytics of crude oil prices by utilizing the intelligent model search engine",
abstract = "This paper proposes an intelligent model search engine (IMSE), an integrated model selection algorithm, subject to the out of sample predictive performance and given set of explanatory variables for forecasting crude oil prices. In the conventional applications of energy price forecasting, models are selected based on preliminary assumptions on causality and model structure (e.g. lag length in lagged variables). Relaxation of those assumptions would cause over-fitting and reduce the degree of freedom. Considering the ultimate objective of forecasting models, any variations of models may be tested in the out-of-sample period, and the optimization problem can be redefined as minimization of post-sample error metric in a validation set. By this, data mining would be a legitimate operation for economic forecasting, and it also proves required conditions usually tested by diagnostic tests such as Akaike Information Criterion for model quality. IMSE is a multi-input/single output difference equation based approach which allows users to test various models (for given set of explanatory variables) as well as various order of lagged inputs (lag length) without a priori assumption or theoretical basis except defining set of potential inputs. Finally, it selects the best model subject to predictive accuracy in a validation set. Empirical results indicated that the proposed algorithm significantly outperformed a broad range of benchmark methodologies as well as proving that certain assumptions of econometric approach (e.g. statistical significance of explanatory variables) are independent of predictive performance.",
author = "Korkut Bekiroglu and Okan Duru and Emrah Gulay and Rong Su and Lagoa, {Constantino Manuel}",
year = "2018",
month = "10",
day = "15",
doi = "10.1016/j.apenergy.2018.07.071",
language = "English (US)",
volume = "228",
pages = "2387--2397",
journal = "Applied Energy",
issn = "0306-2619",
publisher = "Elsevier BV",

}

Predictive analytics of crude oil prices by utilizing the intelligent model search engine. / Bekiroglu, Korkut; Duru, Okan; Gulay, Emrah; Su, Rong; Lagoa, Constantino Manuel.

In: Applied Energy, Vol. 228, 15.10.2018, p. 2387-2397.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Predictive analytics of crude oil prices by utilizing the intelligent model search engine

AU - Bekiroglu, Korkut

AU - Duru, Okan

AU - Gulay, Emrah

AU - Su, Rong

AU - Lagoa, Constantino Manuel

PY - 2018/10/15

Y1 - 2018/10/15

N2 - This paper proposes an intelligent model search engine (IMSE), an integrated model selection algorithm, subject to the out of sample predictive performance and given set of explanatory variables for forecasting crude oil prices. In the conventional applications of energy price forecasting, models are selected based on preliminary assumptions on causality and model structure (e.g. lag length in lagged variables). Relaxation of those assumptions would cause over-fitting and reduce the degree of freedom. Considering the ultimate objective of forecasting models, any variations of models may be tested in the out-of-sample period, and the optimization problem can be redefined as minimization of post-sample error metric in a validation set. By this, data mining would be a legitimate operation for economic forecasting, and it also proves required conditions usually tested by diagnostic tests such as Akaike Information Criterion for model quality. IMSE is a multi-input/single output difference equation based approach which allows users to test various models (for given set of explanatory variables) as well as various order of lagged inputs (lag length) without a priori assumption or theoretical basis except defining set of potential inputs. Finally, it selects the best model subject to predictive accuracy in a validation set. Empirical results indicated that the proposed algorithm significantly outperformed a broad range of benchmark methodologies as well as proving that certain assumptions of econometric approach (e.g. statistical significance of explanatory variables) are independent of predictive performance.

AB - This paper proposes an intelligent model search engine (IMSE), an integrated model selection algorithm, subject to the out of sample predictive performance and given set of explanatory variables for forecasting crude oil prices. In the conventional applications of energy price forecasting, models are selected based on preliminary assumptions on causality and model structure (e.g. lag length in lagged variables). Relaxation of those assumptions would cause over-fitting and reduce the degree of freedom. Considering the ultimate objective of forecasting models, any variations of models may be tested in the out-of-sample period, and the optimization problem can be redefined as minimization of post-sample error metric in a validation set. By this, data mining would be a legitimate operation for economic forecasting, and it also proves required conditions usually tested by diagnostic tests such as Akaike Information Criterion for model quality. IMSE is a multi-input/single output difference equation based approach which allows users to test various models (for given set of explanatory variables) as well as various order of lagged inputs (lag length) without a priori assumption or theoretical basis except defining set of potential inputs. Finally, it selects the best model subject to predictive accuracy in a validation set. Empirical results indicated that the proposed algorithm significantly outperformed a broad range of benchmark methodologies as well as proving that certain assumptions of econometric approach (e.g. statistical significance of explanatory variables) are independent of predictive performance.

UR - http://www.scopus.com/inward/record.url?scp=85050482612&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85050482612&partnerID=8YFLogxK

U2 - 10.1016/j.apenergy.2018.07.071

DO - 10.1016/j.apenergy.2018.07.071

M3 - Article

AN - SCOPUS:85050482612

VL - 228

SP - 2387

EP - 2397

JO - Applied Energy

JF - Applied Energy

SN - 0306-2619

ER -