Abstract

Capital structure is an integral part of the corporate finance that sources the funds to finance growth and operations. Managers always have to maintain value of the firm to be higher than the cost of capital in order to maximize the shareholders wealth. Empirical studies have used sources of finance like debt and equity as variables of capital structure. A choice between debt and equity finance analyzes the firm’s ability to perform under the financially constrained environment to attain the sustainable growth. Therefore, it gives rise to a dire need to estimate the cost of capital precisely. We examined the capital structure of top ten market capitalization of the stock markets included in MSCI Emerging index with the use of artificial neural networks, support vector regression, and linear regression in forecasting methods. The capital structure is measured as the proportion of total debt over total equity (Tang et al., 1991). Other financial ratios such as profitability, liquidity, solvent, and turnover ratios were considered as drivers of the capital structure. Applying logistic and hyperbolic tangent activation functions, it was concluded that ANN has a great potential of replacing other traditional forecasting models with the nonstationary data. This research contributes with a new dimension for estimation through different activation functions. There is a possibility of ANN dominance as compared to the other models applied for predictability in financial markets.

1. Introduction

Artificial intelligent systems (AI) and machine learning (ML) have transformed the decision-making from human with an aim to improve quality of business and investments [1]. The dynamic environment is accelerated with creation of complexity in systems resulting in increased competition for which there is a need for a quick and better information decision system. The application of AI tools like artificial neural network (ANN) serves as best alternative to estimate the returns and build a positive vision, deliver more value, and radically improve the quality of business. There are many advances of AI in business in past few years, but implementation in accounting and finance is at a nascent stage with high need for further extensive machine learning techniques [2].

Over the past few years, a progressive revolution in technology took a root in creating artificial intelligence system. With the COVID-19, AI growth is accelerated and aligned towards reorganization of the industries. The concept and applications of AI is subject of interest for the business and academicians. Neural networks consist of multiple simple processors arranged in a communicative network, each of which is programmed to perform one identical, elementary processing task [3]. ANNs are considered to be universal functional approximators and provide a continuous function for a desired accuracy. The real-world systems are nonlinear, and traditional existing model are limited in expressing the relationship for the data series. Thus, ANNs perform nonlinear modeling without prior knowledge between input and output variable serving a more general and flexible model forecasting tool [4].

Neural networks are considered to be promising tools for forecasting financial time series. The factors such as selection of input variables, architecture of the network, and quantity of training data have an impact on accuracy of the model. ANNs models are highly multicomputational and interconnected units where in each input unit is allocated with sets of weights, bias adjustment, and transfer function to transforms the bias and sum of the weighted inputs to determine the output value from the input unit [5].

Classification, clustering, predicting, forecasting, and pattern recognition problems are the basic features of ANN [6]. AIS domain holds a vibrant research and inhibits an advanced role of leadership. Few studies on research on accounting firms reported an extensive use of AI tools in the integrated audit support systems [7]. Elbashir et al. [8] emphasized on identification of management accounting practices by business intelligence vendors. AI research studies have gained tremendous momentum due to the ability to find the suitable patterns to predict the future financial events more accurately that will assist in managerial decision-making.

1.1. Motivation of the Research

The main motivation of the research is due to the explosive growth in the ML and AI tools in the advancement of the finance literature. In the multiple studies performed based on ANN, it was concluded that artificial intelligence had outperformed economic, statistical methods, and other general forecasting models.

The large number of datasets used in the study assist in predictions through training algorithms with high dimension and increase in the number of variables as compared to the traditional structures.

The main purpose of using MSCI index from emerging markets is due to availability of many growth opportunities, free trade agreements, increased investments, and the capital flow. Nature of capital as prior to financial crisis was characterized by low debt levels, while after the crisis, higher debt levels are adopted. The rationale is to set a model in academic research, business managers, policy makers, and investors for real-time implementation and adjustment of capital structure based on predictions to maximize the firm’s value [9].

Additionally, MSCI Emerging Markets Index (MSCI EMI) has accounted since 2000 for a Sharpe ratio of 0.39 and an annualized net return of 7.78%, which has outperformed the MSCI World and MSCI ACWI indexes. Within the MSCI EMI, China stands for a 35% of the country weights mean, while Brazil just 4% and Saudi Arabia less than 4%. Besides Brazil and Saudi Arabia show small weights within MSCI EMI, but in their index regions, they outstand with greater than 60% and 30%. So, despite country weights in one index, opportunities from companies of these countries cannot be misled as shown in the weights in other indexes [10].

This research addresses for a company’s market capitalization criteria instead of an industry type selection. If all financial ratios were not available on any company, then it was selected the next lower market capitalization company. In that sense, the research intends to assess the capital structure forecasting capability of AI models of the highest market capitalization companies in emerging markets with no distinction of the industry type [11].

1.2. Main Objectives

This study contributes to application of artificial intelligence tools with an objective:(i)To identify the best-fit model out of the three models namely ANN, SVR, and LR(ii)Predictive performance of the capital structure in the financial markets

2. Review of the Literature

Debt and equity are the two important variables of capital structure that provide the information to investors in the organization for supporting investment decisions. Research study by Pao [12] revealed the importance of debt with positive relationship with firm size and negative relationship with profitability, nondebt tax shield, and dividend. Large firms related with the more risk and more investment opportunity, so organizations need to develop the sustainable capital structure that maximize the firm growth and increase investment opportunity. A dynamic agency model was developed based on Lintner’s adjustment model based on which managers choose to smooth dividends because they are trying to smooth their managerial rents through use of debt (leverage) and investment (capital expansion) to make sure that payouts are smoothed of [13, 14]. Similarly, another work by Hoang and Hoxha [15] used a sample of firms from 94 countries was explored with the countries in this study, Brazil, China, and Saudi Arabia. Payout policy is influenced by investment and debt policies and cannot be determined independently. In addition, it was revealed that geographic/cultural/institutional variation influence the response of payout policy to other corporate financing decisions. Another pioneer work was by Gatchev et al [16] that explored the interdependent and intertemporal nature of these decisions.

According to the research work [1719] continuous audit, continuous monitoring with stream in the accounting field applies AI techniques. Artificial neural network (ANN) models drive uncertainty in finance through primarily involve in recognition of patterns in data and use these patterns to predict future events. ANNs predict currency movements and interest rate volatility for financial survival, thus claims to deal with the large noisy datasets.

According to Zahedi [20], ANNs translates the complexity of data into mathematical functions and provides analytical tools for business and economic systems that cannot be provided by traditional quantitative tools in statistics and econometrics.

Applications of ANN were classified into three categories like financial analysis prediction, classification of pattern, and optimization and control [21]. Sophisticated techniques employ time series model using ANN for forecasting future data points. Temporal effect determines the relationship of data patterns. Sequence of input data is considerable factor that depicts the relationship, and other types of ANN like recurrent and finite impulse response (FIR) are explored for dealing specific types of issues.

With the growing demand and transformation of the financial markets and the systems globally, the machine learning tools are extensively used, leading to extensive use of ANN technology in finance and accounting and literature for bankruptcy predictions, rating of the bond, forecasting in the stock markets, foreign exchange, evaluation of credit, and fraud detection [22]. Research work of Zhang & Maringer [23] used financial report data for items in the financial balance sheet for ratios such as return on equity, return on assets, price to earnings ratio, and debt to equity ratio.

The artificial neural network (ANN) technology identifies and reproduces a phenomenon or approximates a function without making the parameters explicit. It does not require any assumptions further like noisy, inconsistent, missing, and distribution of data. A wide and extensive literature is available on ANN applications. The high potential of ANN is identified through its highspeed processing model by academicians, researchers, and policy makers reflecting and considered as the superior performance of ANN compared to other econometric tools [2428].

Four different technical indicators are used in the neural network architecture that include raw data, day of the week, prices of stocks are forecasted using the network that consists of input and output layer with the hidden layer between them [29, 30]. Back propagation of error is used to train the machine. Neural network topology is used for precise prediction. Its flexible characteristics make it universally applicable for financial analysis through technical indicators for the input data.

Recently, few studies have used artificial neural network (ANN) nonlinear models to resolve the forecasting problems [31] and focused on explaining capital structure by using time-series, cross-sectional tests, and panel data.

Financial economists predict increase in leverage with the fixed costs, nondebt tax shields, size of the firm, and investment opportunities [32]. According to research work of Moh'd et al. [33] employed time series data, and further results reveal that distribution of ownership equity is important in explaining overall capital structure, and managers try their best to reduce the level of debt to increase the owners wealth associated with the firm [34].

Extant literature is available on application of ANNs in classification and prediction problems in accounting and finance. ANN prediction in financial markets demonstrates the different techniques of the artificial intelligence tools that include technical and like fundamental analysis. A unique study on capital structure of G-7 countries except US related to factors similar to influence the capital structure of US firms by Rajan and Zingales [35] finds that leverage increases with asset structure and size but decreases with growth opportunities and profitability. ANN is based on certain biased functions where the machine is trained with simulated data to predict futuristic function by connecting with adaptive weights [3638].

According to A. V. Devadoss, T. Ligori [39], financial markets are affected by a large number of external factors like economic and political in a complex way and making it difficult to predict precisely. It was supported by [40] that ANN provides the ultimate solution to this. ANN and SVM were used [41] to compare stock price index movements and conclude that ANN is better than SVM model.

On investigation of the best modeling of time series using Neural Network with four different approaches, i.e., raw data, raw data with a time index, detrending, and differencing for modeling, Qi and Zhang [42] concluded that Neural network gives better results.

Heo et al. [43] examined the descriptive and predictive power of ANN and OLS (Ordinary Least Squares) model to evaluate household’s savings-to-income ratios and debt-to-asset ratios cross-sectional and across time. The results revealed ANN models serve as the best providers and serve as a better overall model fit to describe and forecast the financial ratios. The findings further indicate the importance of the machine learning procedures to provide a robust, efficient, and effective analytic method for lenders, financial service providers, academicians, and other researchers. Support Vector Machines (SVM) [45] and Artificial Neural Networks (ANN) are widely used for the prediction of stock price movements. Kuan and Liu (1995) investigated with empirical analysis on foreign exchange rates with recurrent and feed forward neural networks. According to Pao [12], analysis and forecasting of debt ratios achieved a greater accuracy using values of RMSE and ANN models as compared to other regression models [46]. Accordingly, it was suggested that ANN are capable of capturing varying nonlinear effects in the financial industries.

ANNs are more appropriate where the inputs are missing or nonlinear as compared to traditional statistical methods (Wang and Elhang, 2007). ANN model is a better choice for providing solutions involving complex practical problems because of their multidimensional analytical features as compared to other regression models. Factors such as number of observations, forecasting horizon, linear, or nonlinear form and data period effect the performance of ANN (Hill, O’Connor, and Remus [47]. According to the data period (annual, quarterly, monthly), functional form of the series (linear, nonlinear), forecasting horizon, and number of observations are the factors that affect the ANN performance. Based on the available literature, the ANN application in MSCI index for prediction of capital structure are least available. Most of the studies in this area have used a traditional regression model with one AI model either SVR or ANN. This study contributes with an objective to identify the best-fit model out of the three models namely ANN, SVR, and LR. Predictive performance of the capital structure is thereby assessed using the best model.

The paper is organized as follows: Section 2 presents review of the literature, Section 3 presents the data sources, variables, and methodology adopted. Sections 4 and 5 present the results of comparative study of the different models applied to rationalize the observed values. Section 5 includes the summary and conclusions.

3. Data and Methodology

Dataset consists of 46 units of observations that comprise of 15 financial ratios taken from quarterly financial statements during the period Q1-2010 to Q2-2021. Table 1 represents the details of the companies and the type of industry taken along with the market capitalization represented in their respective currencies. Nine companies belong to the top ten market capitalization within their stock markets and are listed in the MSCI Emerging Index. Equity index of Brazil, China and Saudi Arabia are taken from database of S&P Capital IQ.

Table 2 represents the details of the variables taken for the study of financial ratios to build our models. TDEQ represents the capital structure, and all the other ratios are taken with their respective formulae under description that can serve as reference to the readers.

(PetroChina) is China’s biggest oil and gas producer and distributor and one of the largest oil and gas companies in the world. China Petroleum & Chemical Corporation is a vertically integrated energy and chemical company that is engaged in oil and gas exploration and production. China Tourism Group operates as a tourism company and provides tourism, hotel, scenic, passenger transport, and other travel services [50].

Saudi Telecom Company offers landline and fixed infrastructure, mobile and data services. Saudi electric company provides on the generation, transmission and distribution of electric power. Saudi Arabian Mining Company is responsible for all the mining activities.

Support vector regression and linear regression are widely used to predict. In our study, we give emphasis to build an ANN-based model.

ANNs are based on the principle of neuron connecting and sending signals with an attempt to simulate the learning process of the human brain by using complex algorithms. Every connection has a weight attached either with a positive or negative value associated. Positive weights activate the neuron while negative weights inhibit it. Generally setting up of ANN is six-step procedures. The data input the financial ratios used are defined and presented to the ANN as a pattern indicated with the desired outcome or target. Data are categorized as 80% of it as the training set or validation and 20% as the testing set from the sample data taken. The validation set used will assist in predictive ability and recommends when to stop the training of the ANN. As neural network consists of an input layer, an output layer, and one more intervening layers also known as hidden layers. These hidden layers capture the nonlinear relationship between variables. Each layer consists of multiple neurons that are connected to neuron in adjacent layers. With these neurons in multiple layers, these networks can capture complex phenomenon. This process of training set helps in computation of output with minimizing errors in forecast due to iterations. The algorithms used will adjust the model parameters connecting weights and nodes. The output of each processing unit is put forward through each layer if the network (Liu, Kuo and Sastri, 1995).

In this study, the main variable for analysis is Capital Structure (CS) measured as the proportion of Total Debt against Total Equity is considered to be as dependent variable. Independent variables are categorized as profitability, liquidity, solvency, and turnover ratios. These ratios are considered to be the drivers of capital structure (15 ratios are taken).

Figure 1 represents the methodology and describes the entire approach followed to assess the three models. The data are split into 80% and 20% as stated before and are normalized. After the normalization, the data are fed into three models: SVR, LR, and ANN. After the models are trained on the 80% data, remaining 20% data are used to evaluate the models The root mean squared error (RMSE) and mean absolute error (MAE) are the metric errors approach to compare performance among models. Estimations were performed using R version 4.0.5.

The following section is used to set a detailed context on normalization, error metrics using ANN model as an example.

The normalization step helps to make sure that large values do not overcome features with small values. In that sense, the original financial ratio matrix is transformed to a matrix of range (0, 1). The following expression represents the normalized value:

After normalization, data are split into training (80%) and testing (20%) data. Different ANN structures were estimated based on number of hidden layers and number of nodes. The Logistic (sigmoid) and Hyperbolic Tangent (tanh) activation functions were applied as transfer functions from the weighted-sum inputs to determine the output.

Figure 2 represents a general model of an ANN structure with 3 inputs (X1, X2, and X3), one hidden layer, one node, and the output (Y); where f represents the activation function.

The following expression shows the logistic function:

However, the tanh function is shown in the following expression:

The performance comparison among algorithms is analyzed through the root mean squared error (RMSE) and mean absolute error (MAE) metrics. Expression (4) shows the RMSE metric.

And, the following expression shows the MAE metric:

4. Comparison of Models

The results obtained using different models are compared in this section using the error metrics defined in the previous section. The three models are now predicted for different regions and their RMSE, RSE, RAE, and MAE are evaluated. For ANN models, different combinations of layers and nodes are experimented along two functions. Hence, the discussion of results is made on two folds, first to discuss on the best suitable structure of ANN model and then compare it with rest of the two models.

4.1. Brazil

Tables 35 reveal the comparison of different applications and the metric errors of the three algorithm approaches used in forecasting the capital structure. The ANN results are based on the best algorithm structure with output that had the least RMSE (refer Table 6).

As per the ANN structures shown in Figure 3 for the Brazilian companies taken from equity index BOVESPA Vale, Petroleo Brasileiro, and Weg are shown with neural network structures.

As per algorithm, SVR has outperformed in Weg with RMSE value of 0.0464, while Vale and Petroleo Braileiro had LR with 0.0610 and LR of 0.0486, respectively.

4.2. China

With reference to Figure 4 the algorithmic metrics are validated; in Petro China ANN performed better with 0.0426 and SVR and LR were similar with 0.0759 and LR with 0.0758. China petroleum and chemical works well with LR with 0.0708 as its error. SVR (0.1117) outperformed against the ANN and LR in China Tourism.

Based on the ANN structures through activation functions of logistics and tanh for all the layers and all nodes, the best model fit is tanh function indicates that ANN can predict better in nonlinear values.

4.3. Saudi Arabia

The ANN structures algorithm metrics indicate 0.0328 for LR in Saudi Telecom as per RMSE followed by ANN and SVR with 0.0807 and 0.1274, respectively, as reflected in Figure 5. In Saudi Electricity, SVR 0.0618 is considered as best-fit followed by ANN with 01198 and LR with 0.1267. For Saudi Arabian Mining sector, LR model was best with 0.0283 compared to 0.0678 and 0.0705 for ANN and SVR, respectively.

Based on the ANN structures through activation functions of logistics and tanh for all the layers and all nodes, the best model fit indicates ANN can predict better in nonlinear values.

5. Results and Discussion

Empirical results of ANN structures are validated in the tables in terms of Algorithm metric errors using recognized statistical tools RMSE and MAE. Results indicate the capital structure prediction performance based on sole ANN structure. All ANN structures fitted well as one layer-one node structure with the same activation function and assists to transfer the inputs combination to the capital structure forecast output, thus considering ANN as an ideal promoting tool.

Equity index of Brazil, such as Vale, Petroleo, and Weg, were well fitted by the tanh function, which revealed best ANN network performance than logistic function.

When activation functions were applied for industry wise comparison in oil and gas companies of China, both logistic and tanh function performed well indicating that ANN is an ideal model for predictive performance.

Capital structure pattern of all three oil and gas companies, the Petroleo Brasileiro (Brazil) and Petroleum Chemical (China) and Petro China showed an average capital structure ratio, Petroleo Brasileiro and Petrochemical China were fitted by the logistic and tanh functions, respectively. The Saudi Arabian companies showed the best ANN structure based on the same activation function. Their average capital structures ratios were 24.55% (Telecom), 16.2% (Electricity), and 21.63% (Mining). It is concluded that when companies taken for the study are from same country then ideally same activation function could be fitted irrespective of their size of capital structure.

In Brazil and China, the sample consists of other industry group hence same activation function will not be applicable. The comparative analysis applied among the ANN estimated structures revealed that common features as those of country, industry, or capital structure level that could justify a sole activation function in forecasting the capital structure of companies belonging to emerging stock markets. Thus, based on the research work, it can be concluded that for the nonlinear data, ANN models are considered to be the best as compared to the other models applied in the study. In the sample taken for the study, LR models were identified as the best model for the linear data. When the data are nonlinear, ANN predictions were the most suitable models. Hence, common features as those of country, industry, or capital structure level may also indicate similar results.

6. Conclusions

With the application of algorithm, ANN model is considered to be the best-fit model in predictive performance as revealed and validate through low RMSE. The activation function of both logistic and tanh indicates low RMSE and support ANN structures. LR and SVR outperformed for the linear data. This research work supports the earlier studies that consider ANN network as superior in comparison with another model.

This study emphasized on the prediction of capital structure through input variables as financial ratios and algorithm error metrics applied in ANN, SVR, and LR models. The study was limited for predictive performance only. Although financial ratios were used as inputs, the relationship among the ratios is not measured and is not included. Authors plan to extend this work to evaluate and predict the capital structure in future research work with individual applications of ratios in terms of profitability, liquidity, solvency, and turnover ratios .

Data Availability

Dataset consist of 46 units of observations that comprise 15 financial ratios taken from quarterly financial statements during the period Q1-2010 to Q2-2021 for the research, and they will be provided upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.