Abstract

An estimation methodology is presented to determine the breast tumor parameters using the surface temperature profile that may be obtained by infrared thermography. The estimation methodology involves evolutionary algorithms using artificial neural network (ANN) and genetic algorithm (GA). The ANN is used to map the relationship of tumor parameters (depth, size, and heat generation) to the temperature profile over the idealized breast model. The relationship obtained from ANN is compared to that obtained by finite element software. Results from ANN training/testing were in good agreement with those obtained from finite element model. After ANN validation, GA is used to estimate tumor parameters by minimizing a fitness function involving comparing the temperature profiles from simulated or clinical data to those obtained by ANN. Results show that it is possible to determine the depth, diameter, and heat generation rate from the surface temperature data (with 5% random noise) with good accuracy for the 2D model. With 10% noise, the accuracy of estimation deteriorates for deep-seated tumors with low heat generation. In order to further develop this methodology for use in a clinical scenario, several aspects such as 3D breast geometry and the effects of nonuniform cooling should be considered in future investigations.

1. Introduction

Breast cancer, which is common among women, is an international concern. There are about 200 000 cases a year, and it is estimated that there are more than 1 million women with undetected breast cancer. Among various techniques to detect breast cancer, infrared (IR) imaging has been widely used since the late 1950’s. In general, the temperature of the human body on the surface of the skin depends on the metabolic activity, the blood flow, and the temperature of the surroundings. Any abnormality in the tissue, such as the presence of a tumor, alters the normal temperature on the skin surface due to increased metabolic activity of the tumor. Therefore, abnormal skin temperature profiles are an indication of diseases such as mastitis, benign tumors, fibrocystic breast disease, and cancer. Lawson [1] was the first to propose the use of thermographic detection of breast cancer, when he observed that the local temperatures of the skin over a tumor were significantly higher (about 2-3 degrees) than normal skin temperatures. Lawson and Chughtai [2] established that the regional temperature difference over an embedded tumor was due to convection effects associated with increased blood perfusion, and increased metabolism around the tumor.

Thermography, also known as thermal or infrared imaging, is a procedure to determine if an abnormality is present in the breast tissue surface temperature distribution. This abnormality in temperature distribution might indicate the presence of an embedded tumor. Amalu [3] and Keyserlingk et al. [4] have reviewed the role of infrared thermography with respect to the role of IR imaging in the detection of breast cancer and risk indicator. They [3, 4] concluded that the IR imaging has a significant role to play in breast cancer screening due to its ability to image metabolic aspects of the breast and along with sensitivity, specificity, and prognostics. Further aspects of IR imaging techniques of breast cancer and detection methods from infrared images are described in detail and compiled in a recent book by Diakides and Bronzino [5].

In order to complement the IR imaging techniques of breast tumor detection, models have been proposed to estimate the temperature distributions over breast with and without tumors. For example, Sudarshan et al. [6] studied the temperature distributions in breast cancer by considering a two-dimensional model first proposed by Romrel and Bland [7]. In this model, Pennes bioheat equation was used to describe the flow of thermal energy. The breast model consisted of a subcutaneous fat layer, followed by a gland layer and a deep muscle layer adjacent to the thoracic wall. Using this model, the effects of the tumor size, depth, and the blood perfusion rate on the surface temperature magnitude and profile were determined. Evolutionary algorithms based on neural networks and genetic algorithms have been proposed to detect abnormalities in several applications (such as ECG, tomography, ultrasound); see, for example, Diakides and Bronzino [5]. Recently, Mital and Scott [8] developed a method to determine parameters of an embedded heat source representing a tumor using infrared imaging. It appears that based on literature there are no methods available to estimate the tumor parameters based on the temperature distribution obtained from IR imaging techniques.

The objective of this study is the development of a methodology to estimate the depth, size, and heat generation rate of an embedded tumor in the breast, based on temperature profile on the surface that may be obtained by infrared thermography. This study differs from other works related to breast cancer in that inverse methods are used in the estimation methodology. The methodology is demonstrated with a simple model in order to illustrate the details of the procedures involved. Several cases of various embedded tumor conditions were presented to illustrate the generality of the method.

2. Estimation Methodology and Procedure

The estimation methodology involves evolutionary algorithms using artificial neural network (ANN) and genetic algorithm (GA) as shown in Figure 1. The ANN is used to map the relationship of tumor depth, tumor size, and the heat generation to the temperature profile over the idealized breast model. The relationship obtained from ANN is compared and validated to that obtained by standard numerical model-based finite element analysis software (ANSYS). GA is used to estimate the tumor parameters (depth, size, and heat generation) by minimizing a fitness function involving the temperature profiles obtained from simulated or clinical data to those obtained by ANN. The fitness function is given as a sum of the squares’ function that relates the data from the observed thermographic temperature profile to the temperature profile for a given set of estimated parameters. It is defined as

where Y and T are vectors containing the observed and estimated temperatures, respectively, and superscript T denotes the transpose of the vector. The observed temperature vector Y contains surface temperatures that would be obtained experimentally from thermal imaging. The estimated temperatures contained in vector T are obtained by the outputs of the trained neural network model using estimates for the unknown parameters (from the population pool of the GA). The vector containing the estimates for unknown parameters is denoted by . In this study, the unknown parameter vector contains the depth, diameter, and the heat generation rate of the source. Once the objective function was formulated, the estimation scheme using GA was used to determine the unknown parameters.

3. Artificial Neural Network Model to Map Breast Temperature Profiles

A multilayer feed-forward ANN with backpropagation learning algorithm is developed to represent the surface temperature profile of the breast as a function of depth, diameter, and heat generation rate of the tumor. The network is developed with the MATLAB Neural Network Toolbox using the “newff” function. The learning rate was set reasonably low at 0.05 to ensure convergence of the algorithm. There are many transfer functions available in MATLAB software. After some experimentation, the “logsig” transfer function was chosen for the hidden layers and the output neurons, due to nature of the desired outputs. A schematic of the network is shown in Figure 2. The network has 3 input nodes for the three parameters, and 31 output nodes that define one half of the symmetrical temperature profile on the skin surface. There are 2 hidden layers in the network, the first with 21 nodes, and the second with 19 nodes. This configuration was reached after a few iterations of a single hidden layer network proved to give less than adequate results. Two layer deep-nested loops were used in an exhaustive search that varied the number of neurons in the first and second hidden layers, and found the appropriate combination (21-19) that best trained the network. Therefore, a 3-21-19-31 neural network architecture was developed and trained and tested to validate the model. More details about the neural networks and their concepts can be found in Russell and Norvig [9].

In this study, a simplified 2D model of the cross-section of the breast similar to that used by Sudershan et al. [6] was considered, as shown in Figure 3. The Pennes bioheat equation has to be solved to estimate the temperature distribution over the breast model. Due to the complexity of the breast geometry, there are no known analytical solutions to the bioheat equation. Therefore, solutions are usually obtained using computational techniques such as the finite element method. A finite element model created using a commercially available finite element package, ANSYS, solves the Pennes bioheat equation to find the surface temperatures, and this information is used in training/validating ANN model. In this study, the breast is assumed to be of 0.072 m radius hemispherical shape, with a subcutaneous fat layer of 0.005 m uniform thickness, followed by gland, muscle, and the thoracic wall. The tumor is situated symmetrically in the gland layer and assumed to be circular in shape for simplicity. Note that the entire breast is modeled even though the 2D geometry is symmetrical about the Y axis. This is done for future studies where an offcenter tumor location may be considered. The heat transfer in each layer is governed by (1). The values of thermal conductivity, metabolic heat generation rate, and the perfusion source term for each layer, adopted from Werner and Buse [10], are presented in Table 1. The thoracic wall is assumed to be maintained at a constant core body temperature of 37°C. Based on estimate by Osman and Afify [11], the surface of the breast is subjected to a combined heat transfer coefficient (convection, radiation, and evaporation) of 13.5 -K, a reasonable value in the absence of forced convection. The clinical environment under which the thermographic measurements are made is assumed to be at a constant temperature of 21°C.

Using the finite element analysis of the breast model, a total of 17 different cases were analyzed and used for training the ANN. The inputs parameters are scaled between 0 and 1 using upper and lower bounds for each parameter. The diameter of the tumor is allowed to vary between 0.005 and 0.015 m so that it is in mammary gland region. The bounds for depth are 0.014 and 0.041 m (based on the assumption that the tumor will most likely be present in the gland). The lower and upper bounds on the heat generation rate of the tumor are 20 times (14,000 W/m3) and 100 times (70,000 W/m3) that of normal tissue. The output surface temperatures of the network are scaled values between 0 and 1, using a lower limit of 26°C and upper limit of 33°C. The network was trained for 400 epochs or training cycles using the data generated from finite element analysis results. Three different cases were used for validating the trained network.

4. Genetic Algorithms in Breast Tumor Parameters Estimation

In the implementation of the GA, the potential solution (chromosome) contains the depth, diameter, and the tumor heat generation rate as the 3 genes and 10 bits are used to represent the value of each gene in binary format. Therefore, each chromosome has 30 bits. The population size has been chosen to be 20. The initial population pool was generated using a random number generator.

The parameters contained in each chromosome are changed from their binary representation to floating point representation using (for the ith parameter) where Pi is the decimal value of ith parameter, a and b are the lower and upper limits of the search interval, respectively, nb is the number of bits used to represent the parameter (e.g., 10 in this case), and m is the decimal value of the parameter in binary form. The lower limit a and upper limit b for each of the three parameters were mentioned earlier.

The fitness of each chromosome is evaluated using the sum of the squares objective function given by (2). The fitness f of a chromosome is related to the sum of the squares function using where was arbitrarily chosen to be 10 so it always stays greater than the maximum value of sum of squares possible, and S is the sum of the squares generated using a parameter set (chromosome) .

The fitness is determined for each parameter set (chromosome) and then ranked. Chromosomes of higher fitness are more likely to be selected to reproduce and contribute their genetic material to the next generation. A probability of mutation (random changing of a chromosome) of 0.10 and a probability of crossover (formation of children) equal to 0.90 are used to determine how members of the population will reproduce to bring forth the next generation. A flowchart of the solution method is presented in Figure 4. More details about GA procedures and applications can be found in Goldberg [12] and Davis [13].

Several cases were simulated in order to validate the ability of the estimation procedure to accurately predict the depth, size, and heat generation rate of the embedded tumor. For each parameter set, two runs of the GA were conducted by introducing 5% and 10% noises in the simulation data to see the sensitivity of the prediction of breast tumor parameters.

5. Results and Discussion

A convergence plot of the mean squared error of the neural network output during training is shown in Figure 5. In general, the results obtained from ANN training were in good agreement with those obtained from finite element analysis model. After the ANN is trained, the network is validated for three different cases as shown in Figure 6. It can be seen from Figure 6 that, for each validation case, the output of the neural network model shows good agreement with the simulation results with values of 0.936, 0.982, and 0.988, respectively. To further validate the ANN results, the temperature profiles obtained for a specific case 3 (depth of 0.029 m, diameter of 0.008 m, and heat generation of 30000 W/m3) are compared with the finite element and ANN models as shown in Figure 7. Once again a good comparison is seen.

The parameter estimates for the depth, diameter, and heat generation rate of the tumor for the four cases, corresponding to 5% and 10% random noise in the temperature rise on the skin surface, are presented in Tables 2 and 3, respectively. MATLAB built in function “rand” was used for generating random noise in the temperature rise on the skin surface. For each case, the GA was run twice with different random initial populations yielding similar results. With 5% random noise, the results show good agreement between actual and predicted parameters. In this case, the absolute errors in depth and size were within 0.001 m, and absolute errors in heat generation rate were within 3000 W/m3. With 10% noise, the accuracy of the estimated parameters deteriorates for deep seated tumors with low heat generation rate. In this case, the absolute errors in depth were within 0.005 m, error in size within 0.002 m, and in heat generation rate within 4870 W/m3.

6. Conclusions

An estimation methodology-based evolutionary algorithm using neural networks and genetic algorithms was developed to estimate the breast tumor parameters based on surface temperature profile that may be obtained by infrared thermography. The methodology is demonstrated with a simple model in order to illustrate the details of the procedures involved. For the cases studied, results from simulations show that it is possible to determine the depth, diameter, and heat generation rate from the surface temperature data (with 5% random noise) with good accuracy for the simplified 2D model. With 10% noise, the accuracy of the estimated parameters deteriorates for deep-seated tumors with low heat generation rate. With the recent advances in the accuracy of IR images and noise reduction techniques, it may be possible to reduce noise to within acceptable levels (5%) for the methodology to be effective. In general, the breast shape is complex, and this should be considered for realistic studies. However, the estimation methodology is general and can be applied even to realistic breast geometry. This may increase the computation time but the methodology can be easily extended. In order to further develop the proposed methodology so that it can be used in a clinical scenario, several aspects such as complexities of 3D breast geometry and the effects of nonuniform cooling should be considered in future investigations.