1 Introduction

Hardfacing is applying filler metal onto a surface, edge, or a point of base-metal of different compositions. As stated by Ramesh et al. [1], in most cases, the deposited metal is harder than the substrate, resulting in the name “hardfacing.” As discussed by Chaidemenopoulos et al. [2], weld feed material composition for hardfacing purposes usually contains a high content of carbon and carbide forming elements. According to Bahoosh et al. [3], the concept increases hardness, improves tribological properties, and extends components service life in many engineering applications. Pawar et al. [4] also mentioned that the hardfacing process could improve surface wear resistance or even restore worn-out surfaces. However, as Ahn [5] discussed, hardfacing deposited by the welding process can result in considerable heat-affected zones and susceptibility to crack due to residual stresses imposed by the welding. As presented by [6], low-carbon steels are usually used as a substrate, which reduces the costs and supplies considerable amounts of iron during the welding, which can compromise the desired dilution. Therefore, the two main challenges involved in hardfacing deposition are obtaining crack-free overlay and low dilution.

Many different welding technologies can be implemented in weld surfacing processes. Among these various techniques, D’Oliveira et al. [7] stated that plasma transferred arc welding (PTAW) and gas tungsten arc welding (GTAW) processes are the most widely used in the industry. They provide some advantages, including high accuracy, superior quality weld, and low equipment cost. As observed by [8], another great advantage of these processes is their capability to produce weld beads with low dilution levels, desirable in the hardfacing layer. To get a suitable balance between the designed microstructure and the welded bead’s obtained integrity, dilution adjustment is crucial. Zhang et al. [9] deposited hardfacing using a fusion technique obtained by the arc between the tungsten electrode and the additional metal, successfully obtaining a coating with low dilution. Another successful technique for obtaining low dilution was tried by Silwal et al. [10] using hot-wire with resistive arc heat to preheat the wire. Wang et al. [11] presented an innovation in the process of depositing hot-wire GTAW coatings where a secondary bypass arc energy was used to heat the wire. In addition, a promising wire oscillation technique that is being developed by Silva et al. [12] to minimize welding defects can result in significant effects on dilution control. According to Lemke et al. [13], controlling the dilution and composition of the obtained final bead is crucial to ensure desired superficial properties. As observed by Balasubramanian et al. [14], when the GTAW welding process is used in metal deposition, the selection of the proper welding input parameters will define the quality and mechanical, chemical, and also geometric properties of the obtained weld bead. Jahanzaib et al. [15] stated that the welding current is an important parameter that most affects the bead geometry responses, and Giridharan and Murugan [16] added that the welding speed is also important, but with an inverse effect on the geometry of the bead. Traditionally, the input parameters of processing are selected based on practical experience and welding engineer experts. Pujari et al. [17] stated that one of the leading welding engineering problems is to use mathematical models that can simulate welding processes and identify the combination of input parameters capable of producing the weld beads with the best possible quality keeping good productivity. Nagesh and Datta [18] used mathematical modeling and design experiments to simulate and predict the welding process results and have proved its efficiency. As discussed by Zheng et al. [19], in industry 4.0, techniques for processing large datasets using artificial intelligence and the integration of artificial intelligence algorithms into automated production are becoming increasingly important. Abubakr et al. [20] also highlighted the importance of artificial intelligence techniques to optimize the manufacturing process, improve components quality, and achieve more sustainable processes.

In this trend, as reported by Fhale et al. [21], many authors have implemented the backpropagation Artificial neural network (ANN) and other artificial intelligence methods to simulate many different manufacturing processes, including welding. Muthu Krishnan et al. [22] used an ANN model to develop regression equations relating to response characteristics and process input parameters for friction stir welding. Vangalapati et al. [23] also simulated friction stir welding using an ANN model. Chang et al. [24] developed a backpropagation ANN model for predicting the penetration morphology of asymmetrical fillet welds and used a mind evolutionary algorithm to optimize the model. Joseph and Muthukumaran [25] used a genetic algorithm and simulated annealing technique to determine the optimal process parameters for activated tungsten inert gas. Sudhakaran and Sakthivel [26] developed neural network models for predicting bead parameters in the GTAW process, such as depth of penetration, bead width, and depth to width ratio. Ghanty et al. [27] also succeeded in using a backpropagation ANN to predict weld bead geometry confirming the method’s efficiency. Genetic algorithm (GA) is attracting many researchers’ attention to solve optimization problems since traditional optimization methods frequently end up in local minima. GA is beneficial in optimizing multi-objective problems, making it suitable to solve optimization problems related to the manufacturing process that involves many output parameters that must be controlled. Lei et al. [28] applied a GA to optimize the ANN used to determine the weld geometry during the thin-plate laser welding of Ti6Al4V. Dey et al. [29] used a GA combined with response surface methodology to minimize weld bead width and height and maximize the weld penetration in electron beam welding, producing a smaller bead without affecting the quality. Multi-objective optimization involves more than one function to be optimized simultaneously. Saha and Mondal [30] investigated the hardfacing process using multi-objective optimization. These authors processed a high volume fraction of metal carbides, metal borides, and boro-carbides finely distributed in a ferritic matrix, designed for wear-resistant components. They found optimum variables for hardfacing layers deposited by manual metal arc welding (MMAW). In the same sense, the multi-objective approach can be found for the GTAW process. For example, Prakash et al. [31] used hybrid optimization algorithms to deposition Inconel 625 alloy onto AISI-304 and 4140 low-alloy steel sheets. Rodriguez et al. [32] investigated the microstructural balance between ferrite and austenite in dissimilar joints of duplex stainless steel through a genetic algorithm. Both cases are aimed at the oil and gas field. Correia et al. [33] reported that GA’s capability found optimum welding parameters even when the process’s physical model is not available. Although many researchers have been studying the prediction of weld bead geometry and the optimization of GTAW, the literature remains with a lack in the multi-objective optimization of the GTAW hardfacing process. The present investigation used a five-factor, five-level, central composite design (CCD) matrix to run GTAW experiments that were used to train a backpropagation ANN. Five different welding current levels, welding speed, nozzle standoff distance, travel angle, and wire-feed pulse frequency were used in welding experiments. The aim of this work is to use the ANN model coupled to a GA to identify the welding parameters that can produce the optimum weld bead. A final experiment was performed to validate the results of the simulated optimum parameters.

2 Materials and methods

2.1 Welding experiments

Rectangular samples of AISI 1020 carbon steel with the dimensions of 150 mm × 200 mm × 6 mm (length × width × thickness) were used as substrate. The samples were cleaned and preheated to a temperature of 150° C before welding. The chemical composition of AISI 1020 carbon steel is given in Table 1. Weld beads were deposited by the mechanized GTAW process using a Lion 300 welding machine with Argon as shielding gas (at 15L/min). The non-consumable tungsten electrode was a 3.2-mm diameter and 1.5% Lanthanum, installed with direct current in negative polarity. The welding machine was equipped with two wire feeders containing tubular wires with different chemical compositions, as provided in Table 1. Both wires were simultaneously added to the melt pool with a constant wire feed of 2 m/min for all experiments. The experimental setup used in the current work, and the welding torch and wire feeders, is shown in Fig. 1.

Table 1 Chemical composition of the AISI 1020 carbon steel and wires
Fig. 1
figure 1

Experimental setup: a Welding machine; b welding electrode with other accessories; and c scheme of the welding torch and wire feeders

The following parameters—welding current, welding speed, nozzle standoff distance, travel angle, and wire feed pulse frequency—were changed as per a five-factor, five-level, central composite design (CCD) matrix. Six replicas were made to each set of parameters. Table 2 exhibits the tested GTAW input parameters and their associated levels.

Table 2 GTAW process parameters

2.2 Weld bead quality evaluation

After the weld bead deposition, samples were cut transversely by electrical discharge machine (EDM) and sanded and polished. Further, images used in the analysis of the width (W), reinforcement (R), penetration (P), and dilution of the bead deposited, which is calculated by the division of penetration cross-sectional area (A) by the total cross-sectional area (A+B), according to Fig. 2a. The images were obtained by the Stemmi-2000 stereoscope and analyzed using the Image-Pro Plus software (Fig. 2b). The experimental data with observed response values and their respective standard deviations are presented in Table 3.

Fig. 2
figure 2

Deposited weld bead: a Scheme of the geometry and dilution of the bead: W, width; R, reinforcement; P, penetration; A, melted substrate cross-sectional area; B, deposited filler cross-sectional area; b example of an image used to evaluate the dilution

Table 3 GTAW parameters with the experimental design and their results

2.3 Development and modeling of artificial neural network (ANN)

The ANN model was developed based on experimental data (Table 3). All input data (welding current, welding speed, nozzle standoff distance, travel angle, and wire feed pulse frequency) were normalized to implement ANN, ranging from 0 to 1 as per the following equation (Equation 1):

$$ {X}_{iN}=\frac{\left({X}_i-{X}_{\mathrm{min}}\right)}{\left({X}_{\mathrm{max}}-{X}_{\mathrm{min}}\right)} $$
(1)

Where XiN, Xi, Xmin, and Xmax are the values of normalized, measured, minimum, and maximum data for each input parameter, respectively. A backpropagation algorithm was used to train the ANN, and a non-linear sigmoid activation function was used. The topology of the used backpropagation neural network consists of five layers: the input layer, three hidden layers, and the output layer. The input layer corresponds to welding parameters having one neuron for each input parameter (welding current, welding speed, nozzle standoff distance, travel angle, and wire feed pulse frequency). The output layer presents four neurons corresponding to output data (width, reinforcement, penetration, dilution). Hidden layer topology was determined by an optimization process, which looked for the topology that provided a higher determination factor value (R2). This action is considering that the number of hidden layers and neurons plays a vital role in developing an ANN model, and an inadequate architecture can cause overfitting or underfitting of data. Lower root mean square error (RMSE) for all output data when comparing measured and predicted results. The best-identified topology presents three hidden layers having 23, 21, and 25 neurons, respectively. Each neuron of the input layer is connected to each neuron of the first hidden layer. All the following layers followed the same. Figure 3 depicts the topology of ANN according to the studied welding case model. In this study, data set was randomly divided into three sets: training, validation, and testing. The advantage of splitting data into three parts instead of two is to use validation to check when no overfitting in the ANN modeling, avoiding it to produce inferior results when data that were not part of the initial data set is used. The training step used 80% of the experiment data. After training, 10% of experimental data were used to validate ANN, and the rest 10% was used to test ANN.

Fig. 3
figure 3

Artificial neural network architecture

2.4 Development of genetic algorithm (GA) optimization

After fitting an appropriate ANN model, a genetic algorithm was adopted to optimize the GTAW process, identifying welding parameters that can improve the deposited weld bead’s characteristics, which are represented by an objective function. A binary approach was used for GA implementation; input data were coded in a 20-digit binary code chromosome, as illustrated in Fig. 4.

Fig. 4
figure 4

20-digit chromosome used for binary encoding

The adopted procedure in which an artificial neural network was coupled with a GA is shown in Fig. 5. The gray color can identify the coupling step. For each GA generation, the ANN was used to simulate the welding output data. The necessary steps shown in Fig. 5 are as follows:

  1. Step 1

    Create a first-generation chromosome randomly: First-generation chromosomes are randomly created. Twenty individuals are created as the first parents.

  2. Step 2

    Run ANN prediction for nth GA generation: In this step, the binary number of each input parameter is converted to a decimal number (0 to 15 for all input parameters) and then normalized, ranging from 0 to 1 and used as ANN inputs.

  3. Step 3

    Calculate multi-objective function for all populations: After running the ANN, the simulated responses are used to calculate an objective function. This objective function is determined by Equation 2 in which all output parameters were normalized, ranging from 0 to 1:

Fig. 5
figure 5

ANN and GA procedure

$$ OF=\frac{W-{W}_{\mathrm{min}}}{W_{\mathrm{max}}-{W}_{\mathrm{min}}}+\frac{R-{R}_{\mathrm{min}}}{R_{\mathrm{max}}-{R}_{\mathrm{min}}}-\frac{P-{P}_{\mathrm{min}}}{P_{\mathrm{max}}-{P}_{\mathrm{min}}}-\frac{Di-{Di}_{\mathrm{min}}}{Di_{\mathrm{max}}-{Di}_{\mathrm{min}}}+\left(\frac{\frac{W-{W}_{\mathrm{min}}}{W_{\mathrm{max}}-{W}_{\mathrm{min}}}}{\frac{R-{R}_{\mathrm{min}}}{R_{\mathrm{max}}-{R}_{\mathrm{min}}}}\right) $$
(2)

where OF is the objective function and W, R, P, and Di correspond to width, reinforcement, penetration, and dilution, respectively. The min and max index referred to the minimum and maximum values for each output and were used for normalization purposes. The objective function is the mathematical representation of the desired geometry of the weld bead, minimizing width, reinforcement, and their quotient at the same time that penetration and dilution are maximized. It is important to highlight that it consists in a multiobjective optimization and result will present a geometry that will combine and balance all these requirements. At the end of this step, the objective function value is compared with the previous generation value, and if it remains the same for 30 generations, the GA is stopped. The calculated value is determined as the optimum value. This procedure aims to ensure that the whole search space was examined and the global optimum was identified. On the other hand, if the objective function values decrease, the GA move to step 4.

  1. Step 4

    Select next-generation parents (elitism and tournament): In the selection step, 20 individuals are selected to be the next generation’s parents. In the first generation, all the 20 individuals are randomly created and parents, but 20 individuals are selected from the 40 available (previous and new generation) in the following generations. Elitism is responsible for the selection of 10% of next-generation parents. In this procedure, all individuals are ranked by the objective function (low to high). The twenty (10% of the total of 20 parents of next generation) individuals with the lowest objective function value are selected. The other 18 (90% of the total of 20 parents of next-generation) parents are selected using tournament selection described by [34]. The tournament selection consists of selecting randomly 5 individuals and choose the best (lower objective function value) to be a parent of the next generation. Bhunia and Samanta [35] stated that tournament selection is the most effective selection method for minimization problems, and it is essential to find the global solution, avoiding local minimums as mentioned by Kılıç and Yüzgeç [36].

  2. Step 5

    Reproduction (crossover and mutation): The crossover was used as the main reproduction procedure. In the crossover, two parents are randomly selected, and their chromosomes are split at a random point and then combined to produce two new individuals. This step is repeated ten times using twenty parents. After finishing the crossover, 20% of the generated individuals are submitted to mutation, which consists of changing the value of one digit of their chromosomes. Finally, the new individuals are submitted to step 2, and the GA continues until the stop criterion is reached. After identifying the optimum welding parameters, a new set of experiments was conducted to validate the optimized welding condition’s predicted outputs.

3 Results and discussion

3.1 Influence of GTAW parameters on the quality of weld bead

Figure 6 presents the effects of GTAW input parameters (welding current, welding speed, nozzle standoff distance, travel angle, and wire feed pulse frequency) on the deposited weld bead geometry (width, reinforcement, penetration, dilution). The graphs were constructed in Minitab17® and consist in the experimental data plotting joined by an interpolation function intending to evaluate the effects of GTAW variables on the measured values.

Fig. 6
figure 6

Main effects plot of GTAW input parameters

Higher welding currents increase heat input and the welding pool and substrate temperature, resulting in more deposited material fluidity. It is possible to note that increments in welding current increased width, penetration, and dilution. The reinforcement, however, decreased when higher welding currents were used. These characteristics can be observed in Fig. 7, which shows the cross-section of a sample welded in condition 18 of Table 3. Condition 18 used the higher tested welding current (270 A), resulting in a low-quality weld bead with high dilution. Shanmugam and Murugan [37] reported similar results.

Fig. 7
figure 7

Comparison of cross-sectional areas, illustrating the weld bead of tested conditions 18, 20, and 22

On the other hand, as Balasubramanian et al. [14] reported, welding speed increments reduce heat input generating the opposite effects. Tarng and Yang [38] observed that higher welding speed values resulted in lower response values for all outputs. Figure 7 illustrates the cross-section of a sample welded in condition 20 of Table 3 in which the higher welding speed was used. In these conditions, the obtained dilution is too low, leading to decreased ductility of the overlay near the fusion line resulting from the increasing carbide content [39]. The lowest heat input was not enough to melt all the feed wire, promoting regions rich in iron from the wire structure. The lack of melting can also result in voids formation [40], which can promote cracks when the overlayer is subject to the wear process. Modifications in nozzle standoff distance did not significantly impact reinforcement and penetration but changed width and dilution considerably. Higher values of nozzle standoff resulted in higher width and lower dilution. Figure 7 shows the cross-section of a sample welded in condition 22 of Table 3. This condition used the higher tested welding speed (25 cm/min) combined with the other input parameters generated a desired weld bead geometry. On the contrary, the lower tested welding speed (5 cm/min, condition 19, Table 3) produced a weld bead with dilution higher than 30%, which compromises its microstructure and integrity. Travel angle and wire feed pulse frequency did not influence the outputs as the other input parameters. For all the tested interval, they just caused small variations on the response values. A global comparison among Fig. 7 allows concluding that the wettability decreased from condition 18 to condition 22, being condition 20, an intermediate situation. It is possible to notice that the wettability was influenced by a combination effect of welding current and speed. Moreover, Fig. 8 presents the surface quality characteristics of different combinations of welding parameters. The results are in good agreement with Fig. 7. However, experiment no. 3 shows the poor quality characteristics as compared with all other conditions such as lack of fusion of the wire. In the experiments carried out by Han et al. [41], they also noticed the lack of fusion of the wire and its appearance on the surface of the bead. The authors tested the maximum deposition rate with a double electrode in the GTAW process. These defects were observed at high wire feed speeds where it was limited by the heat of the arc being insufficient to melt the wire, this causes the wire to hit the bottom of the fusion pool and be directed to the surface of the bead, as was also observed in Fig. 8. This analysis is currently made on single beads. However, during the processing of a hardfacing, multiple layers are interposed on top of each other. A consequence of low wettability is the possible creation of voids, giving rise to a poor adhesion among layers. Also, the resulting microstructure is less and less dependent on the dilution level because further interaction will occur among layers. Besides the bead quality, defined in the previous examples by the geometry and dilution, the resulting microstructures are crucial for abrasive wear situations where the studied hardfacing can be applied.

Fig. 8
figure 8

Surface characteristics at different welding parameters

Lack of fusion of the wire structure and microcracks can weaken the formed microstructure and impair the abrasion resistance. The resulting microstructure of condition 26 is presented in Fig. 9. The wire composition suggests a hypereutectic microstructure. It is composed of primary M7C3 carbides with a eutectic matrix (austenite plus M7C3 carbides). The SEM image and XRD diffractogram confirm this expectation. A local chemical composition performed through the energy-dispersive X-ray spectroscopy on the largest carbide showed the presence of chromium (37.8% wt.) and iron (48.2% wt.), meaning that it is a complex carbide, probably of (Fe, Cr)7C3 stoichiometry. Microcracks are observed along with the microstructure of Fig. 9. Some of them are bifurcated, but many of them are restricted to carbide areas. The nucleation of microcracks associated with the carbides, also described by Bahoosh et al. [42], is a combination of the residual stresses level imposed by the welding process and the low fracture toughness carbides [43], which is not supported by the metallic matrix.

Fig. 9
figure 9

SEM image revealing the microstructure of the welded layer of condition 26 (a). Note: microcracks associated with complex carbides. DRX pattern of condition 26 (b)

3.2 ANN validation

ANN model performance is shown in Figs. 10, 11, 12, and 13. It is possible to compare predicted and measured data for dilution, width, reinforcement, penetration, respectively. The determination coefficient (R2) and RMSE value of all response parameters are satisfactory, and the R2 of all the data remained higher than 0.65. Table 4 presents the comparison of replica experimental uncertainty and the relative errors between measured and predicted responses. It is possible to see that to all output parameters, and the ANN relative error is smaller than the experimental uncertainty, which means that the neural network model has a robust predictive performance.

Fig. 10
figure 10

ANN regression plot for width a training, b validating, c test, and d overall

Fig. 11
figure 11

ANN regression plot for reinforcement (a) training, (b) validating, (c) test, and (d) overall

Fig. 12
figure 12

ANN regression plot for penetration (a) training, (b) validating, (c) test, and (d) overall

Fig. 13
figure 13

ANN regression plot for dilution (a) training, (b) validating, (c) test, and (d) overall

Table 4 Comparison of mean replica experimental uncertainty and ANN relative errors (%)

3.3 GA implementation and optimum parameters validation

After training and verification, the ANN was coupled with the GA to identify the optimum welding parameters. The procedure illustrated in Fig. 14 shows the objective function value over all the sixty-three generations. It is possible to see that it reached its minimum after 33 generations and kept the same value for the following thirty generations, reaching the established stop criteria. The optimum welding parameters obtained after a complete evaluation of GA and their corresponding outputs are presented in Table 5. However, the optimized condition presented in Table 5 was used to obtain beads and verify the simulation’s reliability. Figure 15 shows the cross-section of a deposited bead with an optimized condition. It is possible to notice that the cord geometry was uniform with width, penetration, and adequate reinforcement, contributing to a bead with low dilution.

Fig. 14
figure 14

Objective function value convergence

Table 5 Optimum GTAW welding parameters
Fig. 15
figure 15

Cross-section of a bead deposited with an optimized condition

Table 6 exhibits a comparison between the predicted and measured outputs for the optimum welding condition. The results presented for the measured results correspond to the mean value of the 35 measures used to verify the predicted outputs. It is possible to see that the relative error is smaller than the observed experimental uncertainty for width, penetration, and dilution of the predicted results. Only the predicted results for reinforcement was beyond the expected error. However, the most relevant remark is that the observed error is similar to the experimental reinforcement standard deviations.

Table 6 Predicted and measured outputs for the optimum welding condition

4 Conclusions

In this paper, the effect of pulsed GTAW welding parameters in weld bead quality was investigated and a multi-objective optimization was used to identify the best input parameter combination. Based on experiments performed with two tubular wires with different chemical compositions deposited on an AISI 1020 steel substrate, the following conclusions can be put forward:

  • The experimental results allowed the comprehension of the influence of each input parameter in the quality of weld beads. Decreasing the welding speed and/or increasing the welding current leads to an increase in the dilution percentage compromising weld bead quality.

  • The welding current and speed are the processing parameters that most affect heat input and melting. Therefore, they were the most critical process parameters to determine the geometry and dilution of the weld beads deposited with two tubular wires by the GTAW process.

  • An artificial neural network (ANN) was used to simulate the GTAW process and then combined with a genetic algorithm (GA) to determine welding parameters that can provide an optimal weld, which can be integrated into automated production.

  • Using GA, optimal welding parameters were determined: welding current = 222 A, welding speed = 25 cm/min, nozzle stand-off distance = 8 mm, travel angle = 25 °, wire feed pulse frequency = 30 Hz.

  • The determination coefficient (R2), RMSE, and relative error values of all response parameters are satisfactory compared to experimental uncertainty, showing great adequacy to experimental data.