J Vet Sci. 2020 Mar;21(2):e25. English.
Published online Jan 17, 2020.
© 2020 The Korean Society of Veterinary Science
Original Article

A descriptive study of on-farm biosecurity and management practices during the incursion of porcine epidemic diarrhea into Canadian swine herds, 2014

Amanda M. Perri,1 Zvonimir Poljak,1 Cate Dewey,1 John CS. Harding,2 and Terri L. O'Sullivan1
    • 1Department of Population Medicine, Ontario Veterinary College, University of Guelph, Guelph, Ontario N1G 2W1, Canada.
    • 2Department of Large Animal Clinical Sciences, Western College of Veterinary Medicine, Saskatoon, Saskatchewan S7N 5A2, Canada.
Received April 11, 2019; Revised November 19, 2019; Accepted November 30, 2019.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Porcine epidemic diarrhea virus (PEDV) emerged into Canada in January 2014, primarily affecting sow herds. Subsequent epidemiological analyses suggested contaminated feed was the most likely transmission pathway. The primary objective of this study was to describe general biosecurity and management practices implemented in PEDV-positive sow herds and matched control herds at the time the virus emerged. The secondary objective was to determine if any of these general biosecurity and farm management practices were important in explaining PEDV infection status from January 22, 2014 to March 1, 2014. A case herd was defined as a swine herd with clinical signs and a positive test result for PEDV. A questionnaire was used to a gather 30-day history of herd management practices, animal movements on/off site, feed management practices, semen deliveries and biosecurity practices for case (n = 8) and control (n = 12) herds, primarily located in Ontario. Data was analyzed using descriptive statistics and random forests (RFs). Case herds were larger in size than control herds. Case herds had more animal movements and non-staff movements onto the site. Also, case herds had higher quantities of pigs delivered, feed deliveries and semen deliveries on-site. The biosecurity practices of case herds were considered more rigorous based on herd management, feed deliveries, transportation and truck driver practices than control herds. The RF model found that the most important variables for predicting herd status were related to herd size and feed management variables. Nonetheless, predictive accuracy of the final RF model was 72%.

Keywords
Swine; biosecurity; porcine epidemic diarrhea; random forests; Canada

INTRODUCTION

In May 2013, the first swine herd in the United States, located in Iowa, was diagnosed with porcine epidemic diarrhea virus (PEDV). Thirty of 50 states in the United States were affected by the virus within one year of PEDV emergence [1]. The first PEDV strain found in the United States was found to phylogenetically cluster within a subgroup containing a strain that was discovered in China in 2012 [2]. A second PEDV strain was also detected by Iowa State University and the Ohio Department of Agriculture [3]. This strain however clustered with a different Chinese strain reported from 2010 to 2012. This suggests that there may have been: 1) numerous ways the virus was introduced, or 2) a single introduction of multiple PEDV strains [4]. Transmission of PEDV between pigs on a farm occurs by direct and indirect fecal-oral routes [5]. Viral transmission between farms has been linked to animal movements containing PED-positive animals, feces contaminated fomites including trucks, barn boots, and equipment, along with the movement of people on and off farm [6, 7, 8, 9].

In January 2014, the first Canadian swine herd, located in Ontario, was diagnosed with PEDV. This case occurred approximately nine months following the initial case of PED in the United States [4]. Porcine epidemic diarrhea (PED) has a significant impact on animal health and profitability of individual farms [10], as the virus is highly contagious and is associated with mortality ranging between 80 and 100% in suckling pigs [8]. Unlike the dynamics of the virus in the United States, the emergence of the virus into Canada was thought to be from a common source. Spray-dried porcine plasma (SDPP), a feed ingredient added to nursery feed for improving growth and overall health [11] was thought to be the source of the infection. During the initial phase of the outbreak, a descriptive outbreak investigation and an experimental challenge study were undertaken to determine the likely source of infection. In a previous research study, a case-control study was designed, which utilized data from an extensive questionnaire to analyze a subset of the collected variables using logistic regression [12] and network analysis [13]. In both cases, the common source hypothesis (through feed) was confirmed [12, 13]. In addition, in the case-control study, a biosecurity score was created based on the adoption of four beneficial biosecurity practices: 1) washing and disinfecting livestock trailers prior to delivering pigs onto the site, 2) truck drivers not stepping out of the vehicle and entering the laneway on-site, 3) enforcement of a shower-in policy or bench entry was implemented on-site, and 4) requirement of staff and non-staff members to wear site specific clothing [12]. It was found that the biosecurity score was a risk factor for PED during univariable analysis, which was in agreement with previous reports claiming that PED-affected herds were the herds with a high level of biosecurity [14]. Nonetheless, the questionnaire administered as part of the case-control study contained an abundance of information regarding biosecurity and farm management practices that were in place at the time of the 2014 PED outbreak. Therefore, the primary objective of this study was to describe the general biosecurity and management practices that were in place in sow herds that contracted PEDV during the early phase of the 2014 outbreak, and to compare these herds to matched control herds. The secondary objective was to determine whether, after exclusion of the variables reflective of common source of infection, there were any general biosecurity and farm management practices that were important in explaining PEDV status. In order to address the latter objective, we used a non-parametric approach based on random forests (RFs) [15], which allowed assessment of variable importance and predictive accuracy based on cross-validation. The source of contaminated feed was excluded from the analysis of the current study, because the focus of this study was on general biosecurity practices potentially associated with the outbreaks.

MATERIALS AND METHODS

The study protocol and animal use were reviewed and approved by the University of Guelph's Animal Care (AUP# 3094) and Research Ethics Board (REB #14AP030) committees.

Herd selection

The early incursion of the PED outbreak in Canada affected swine herds predominantly located in the province of Ontario. To evaluate the early phase of the 2014 Canadian outbreak, the study period for recruitment of herds was from January 22, 2014 to March 1, 2014. During this time period, a total of 27 swine herds were positively diagnosed with PEDV from four provinces: Ontario (n = 24), Prince Edward Island (n = 1), Manitoba (n = 1) and Quebec (n = 1). Case and control herd recruitment and selection processes are described in depth in Perri et al. [12]. A herd with clinical signs that tested positive for PED using real-time reverse transcriptase polymerase chain reaction was classified as a case herd. The case herd definition was established based on the reported PED clinical signs from the outbreaks that occurred in the United States, which included acute onset of severe, watery, neonatal diarrhea with high mortality rates and depression [2]. The authors expected and assumed that the clinical presentation of the Canadian herds would be similar to the infected herds in the United States if the same PED strains (Chinese strain AH2012 and variant-INDEL strain OH581) were identified.

For selection of control herds, a list of Ontario swine herds was provided from the Ontario Pork Producers' Marketing Board under a data sharing agreement, and herds were categorized based on facility type and herd size. Control herds were then selected using computer-generated random sampling and matched to case herds based on province, facility type and approximate size.

Data collection

A questionnaire was used to gather farm management and biosecurity protocols from case and control herds. The questionnaire took approximately one-hour to complete and was completed during a face-to-face interview or by telephone. A detailed description of the questionnaire can be found in Perri et al. [12]. The questionnaire was designed to gather an in-depth description of all farm practices and to gather information that is not often recorded in the literature (i.e. feed storage locations and environmental conditions). The farm practices focused on included: herd demographics, movements on-site and movements off-site. The questionnaire data for case herds were based on their first day of clinical signs of PEDV, and a 30-day history prior to that day. The questionnaire data for control herds were based on the matched case herd's 30-day period of interest.

For herd demographics, the information collected included: the premises identification, name of the site owner, contact information, farm type, number of buildings on-site, number of pigs by production class, pig flow, distance to nearest pig site, number of pig sites within a 3-km radius, water source for pigs, premises washing, whether the water was used for treatment, dates of initial clinical signs (case herds only) and the laboratory name where PEDV was diagnosed (case herds only). A heat producing unit (HPU) variable was computed at the site level. The HPU accounts for production type and the number of pigs on site and was calculated as follows:

HPU = 0.17 × (sum of nursery and finisher pigs) + 0.3 × (sum of gilts and sows) [16, 17].

At the site level, information collected based on incoming movements included the number of pig movements, people (staff and visitors), feed deliveries, truck movements, semen deliveries, dates of movements and deliveries. The source (i.e. feed supplier, semen company, transportation company, etc.), name of the staff and visitors and whether the visitor was at a different swine site prior to the current site, the quantity of pigs, dates of other species shipments (i.e. beef and dairy cattle) and the quantity of feed and feed ingredients delivered were documented. The biosecurity protocols for truck drivers, trailers, feed storage, staff and non-staff members were recorded.

For movements off-site, information was gathered based on the number pig movements, deadstock and culled pig movements, feed deliveries (i.e. delivering feed or feed ingredients off-site within the same production system) and truck movements. The dates of all movements, the number of pigs shipped, the category of pigs, deadstock management protocols and biosecurity practices of trucks and trailers were reported. The truck companies used to deliver feed, pigs, other species and deadstock off site was recorded.

Descriptive analysis

Descriptive analysis was performed on all variables included in the study using Stata 13.1 (StataCorp LP, USA). For continuous variables, the mean, standard deviation, median, interquartile range, and the minimal and maximum (range) number of observations were recorded based on herd status (case and control herds). For binary variables, the proportions were calculated based on herd status.

Statistical analysis

The original data set available for statistical analysis consisted of 185 variables. After removing the variables with missing data (n = 22) and variables that had a zero variance (n = 12) the data set consisted of 151 numerical and binary variables. Correlation among the variables was then evaluated using Spearmen's correlation coefficient. Highly correlated variables (r > 0.75) were subsequently evaluated and removed (n = 37). Removal of highly correlated variables was based on manual selection, so that the most biologically most relevant variables from the correlated set remained as a possible predictor for statistical analysis. After removing variables as described above, the data set included PED herd status (case or control herd) with 112 predictor variables.

RFs were constructed using two steps, initially with 112 variables considered (referred to as the full model), and finally with 39 variables that were identified to be the most important variables using extent of decrease in predictive accuracy as the selection criterion (referred to as the final model). Variables were selected to be in the final model if the decrease in predictive accuracy was greater than zero. RFs were constructed using ‘randomForest’ package [18] in the statistical software R version 3.3.0 (The R Project for Statistical Computing, Austria). RFs are an ensemble non-parametric method based on iterative fitting of a number of classification trees that are constructed from a random sample of all variables at each node. In this study, the number of trees constructed was fixed to 100,000. A cross-validation approach was utilized, using the ‘caret’ package in R [19], to determine the ideal number of variables to be included in the full model. The number of randomly selected variables was set to 5 in the final model. The predictive accuracy of each model was determined by using a classification matrix based on the observed data and additionally by using 10x repeated 5-fold cross-validation, which is available in the caret package. Measures considered were accuracy, sensitivity and specificity. Assessment of predictive accuracy using the validation dataset was not attempted due to the low number of observations.

The RF model was also used to determine variable importance for predicting herd status using the ‘randomForestExplainer’ package in R [20]. Multiple measures of importance were evaluated including: selection frequency, total number of trees, total number of nodes, minimal depth, accuracy decrease, Gini decrease and p values. The concept of minimal depth is illustrated in Supplementary Fig. 1.

RESULTS

Descriptive analysis

Twenty sow herds (n = 8 case herds, n = 12 control herds) from the case-control study were used for the current study. For case herds, half of the production systems were farrow-to-finish sites (n = 4, 50%) followed by farrow-to-wean (n = 2, 25%) and farrow-to-feeder (n = 2, 25%). The control herds in the study were evenly distributed between farrow-to-finish sites (n = 4, 33.3%), farrow-to-wean (n = 4, 33.3%) and farrow-to-feeder (n = 4, 33.3%). Some case herds (n = 4, 50%) and control herds (n = 8, 66.7%) were part of a multi-site system. The mean and standard deviation of the HPU for case and control herds were 935.4 ± 428.4 and 518.2 ± 321.0, respectively. Tables 1 to 8 report the descriptive statistics of the variables computed from the questionnaire by herd status.

Table 1
The number and distance* of nearby herds from PED-affected and non-PED-affected Canadian swine herds

Table 2
The number of buildings by pig flow for PED-affected and non-PED-affected* Canadian swine herds

Table 3
The quantity and frequency of pigs and pig movements on- and off-site by production class identified in PED-affected and non-PED-affected* Canadian swine herds

Table 4
The biosecurity protocols of trailers and truck drivers when pigs were delivered on site PED-affected and non-PED-affected* Canadian swine herds

Table 5
The biosecurity protocols of trailers and truck drivers when pigs were delivered off site PED-affected and non-PED-affected* Canadian swine herds

Table 6
The biosecurity protocols of staff and non-staff members on site PED-affected and non-PED-affected* Canadian swine herds

Table 7
Descriptive feed deliveries and storage practices on farm for PED-affected and non-PED-affected* Canadian swine herds

Table 8
Feed mixing practices of PED-affected and non-PED-affected* Canadian swine herds

At the site level, 50% of case herds (n = 4) reported having only one building. One case herd (13%) reported having two buildings on-site and three case herds (38%) reported having more than three buildings on-site. At the site level, 50% of control herds (n = 6) reported having one building, 25% of control herds (n = 3) reported having two buildings and 25% of control herds (n = 3) reported having more than three buildings. Further descriptive statistics on the number and distance of nearby herds to participating case and control herds and the number of buildings by pig flow are reported in Tables 1 and 2.

Case herds had a greater number of pig movements on and off of the site compared to control herds over the 30-day period of interest. A detailed description of pig movements is summarized in Table 3. The biosecurity protocols of trailers and truck drivers for delivering pigs are outlined in Tables 4 and 5. Approximately 38% of case herds (n = 3) reported that trailers were washed and disinfected prior to delivering pigs on-site, compared to 33% of control herds (n = 4). It was reported that 38% of truck drivers helped unload pigs and stepped on the laneway on-site for case herds compared to 17% of controls herds (n = 2). The mean and standard deviation of the number of non-staff movements on-site for case and control herds were 4.4 ± 2.8 and 2.9 ± 5.6, respectively based on the 30-day study period of interest. The biosecurity measures implemented on-site for staff and non-members are reported in Table 6. Staff members and visitors were required to shower-in prior to entering the barn for 88% of case herds (n = 7) and 58% of control herds (n = 7). Fifty percent of case herds (n = 4) reported that they had a Danish entry (a separation between a dirty area containing outdoor boots and dirty coveralls and a clean area prior to entering the barn) compared to 58% of control herds (n = 7).

At the site level, 50% of case herds (n = 4) reported having only pigs. However, some case herds reported having beef cattle (n = 1, 13%), dogs (n = 2, 25%) and cats (n = 3, 38%) on-site. Similarly, half of the control herds only reported having pigs on-site (n = 7, 58%). Although, some control herds reported having dogs (n = 4, 33%), cats (n = 2, 17%), horses (n = 2, 17%) and rabbits (n = 1, 0.08%).

The mean and standard deviation of semen deliveries received on-site for case and control herds were 10.5 ± 4.2 and 7.0 ± 6.2, respectively over the 30-day period of interest. Fifty percent of case herds (n = 4) received semen deliveries on-site and the truck driver did not enter the barn. One case herd (13%) received a semen delivery to the house. Twenty-five percent of control herds (n = 3) received semen deliveries where the truck driver did not enter the barn. One control herd (8%) received a semen delivery to the house on-site and a separate herd (n = 1, 8%) received a semen delivery at a different location (other than the barn and house).

Case herds reported 2.5 ± 3.0 deadstock movements off-site within the study period of interest compared to control herds, 2.4 ± 2.9. The deadstock management practices reported by case herds include: deadstock was picked up by contracted company (n = 3, 38%), composted (n = 2, 25%), transported to another site (n = 2, 25%) and buried on-site (n = 1, 13%). For control herds, the deadstock management practices reported include: deadstock was picked up by contracted company (n = 5, 42%), composted (n = 4, 33%) and transported to another site (n = 3, 25%).

Descriptively, case herds received more feed deliveries, larger quantities of bagged and pelleted feed and smaller quantities of bulk feed compared to control herds. The number of deliveries case herds received of pelleted feed were 2.6 ± 2. For bagged and bulk feed deliveries, case herds received 4.0 ± 3.8 and 7.5 ± 8.1, respectively. The number of deliveries control herds received of pelleted feed were 2.1 ± 4.3. For bagged and bulk feed deliveries, control herds received 1.6 ± 2.3 and 8.1 ± 5.6, respectively. Case herds received fewer number of feed deliveries in which the feed was stored inside the barn in a cold area (< 15°C) as well as outside the barn in a storage bin. However, case herds stored more of their feed deliveries inside the barn in a warm (≥ 15°C) area compared to control herds. Tables 7 and 8 contain specific details on feed management and biosecurity practices including: the type of feed deliveries received by production class, how the feed was stored, and whether the feed was mixed on-site.

Random forests

The top ten variables that were determined using RFs for predicting herd status in the final model include: HPU, total feed quantity (tonnes) delivered to a herd, the number of feed deliveries a herd received from an off-site feed supplier, the total number of non-staff members on-site, the number of semen deliveries received and the number of on-site feed related deliveries including: bagged feed, bulk feed, pelleted feed, sow feed, and bio oil (feed ingredient). Seven out of 10 (70%) of the most important variables used to classify and predict PEDV were related to feed deliveries. The variable names and definitions used to predict the herd status during the incursion of PEDV into Canadian swine herds are reported in Supplementary Table 1. Additional farm management practices including the water source and water treatment used by case and control herds and the frequency of deliveries by feed ingredients received are reported in Supplementary Tables 2 and 3. A summary of the variable importance measures for the top 10 variables in the RF are reported in Supplementary Table 4. The distribution and mean minimal depth for the top ten variables was calculated using the function mean sample “top trees” and is illustrated in Fig. 1. This function calculates the mean minimal depth using a subset out of the number of trees, where the subset is equal to the maximum number of trees in which any Xj was used for splitting. The shorter the distance from the root of the tree to the variable of interest, the more predictive the variable is. To explain the concept of minimal depth, see Supplementary Fig. 1. Relationships between the importance measures were examined by plotting the measures pairwise against each other (Supplementary Fig. 2). This figure excludes p values and the number of trees since comparable information is obtained using the number of nodes. This plot was used to distinguish the three measures that were least correlated. However, all variable measures of importance were correlated and regardless of the measure selected, they all identified the same variables as important.

Fig. 1
The distribution and mean minimal depth for the top variables for predicting porcine epidemic diarrhea virus during the incursion of the virus in Canadian swine herds, 2014. The vertical black line represents the mean minimal depth. The x-axis ranges from zero to 30 000 trees in which is the maximum any variable was used for splitting on Xj. Variable HPU is the only variable that reaches the maximum number of trees.
HPU, heat producing unit.

The three importance measures that were least correlated were accuracy decrease, Gini decrease and the number of times as a root node. However, these 3 measures were still highly correlated (r > 0.75). A multi-importance plot using the top 3 importance measures to visually identify the top variables for predicting herd status is illustrated in Fig. 2. Variables of importance by category of herd management based on mean decrease in accuracy for predicting PEDV during the incursion of the virus in Canadian swine herds are presented in Supplementary Fig. 3. The top 3 variables of importance found using mean decrease in accuracy include HPU, total feed quantity (tonnes) delivered to a herd and the number of feed deliveries a herd received by contracted feed suppliers. Variables of importance by category of herd management based on Gini decrease for predicting PEDV during the incursion of the virus in Canadian swine herds are presented in Supplementary Fig. 4. The same top 3 variables of importance listed above were found when using Gini decrease for predicting PEDV during the incursion of the virus in Canadian swine herds.

Fig. 2
Multi-importance plot using the accuracy decrease, Gini decrease and number of times as a root node to visually identify variables for predicting porcine epidemic diarrhea virus during the incursion of the virus in Canadian swine herds using random forests. The blue circles represent the top variables (most important) for predicting herd status (case versus control herd) and the black variables are the remaining variables using in the random forest model.
HPU, heat producing unit.

Partial dependence plots for the top three importance variables are illustrated in Supplementary Fig. 5. The two importance variables that had the highest selection frequency (number of times as a root node), number of nodes, number of trees, Gini decrease, and the smallest mean minimal death was HPU and total feed quantity (tonnes) delivered to a herd. The HPU was the only variable which reached the maximum number of trees for splitting.

For the full RF model, the average accuracy for predicting herd status (case or control herd) based on repeated cross-validation was 55.3%. The specificity and sensitivity of the model were 78.3% and 20.6%, respectively. For the final RF model, the predictive accuracy was 72% and the specificity and sensitivity of the model were 90% and 45%, respectively.

DISCUSSION

Prior to January 2014, Canadian swine industry members, stakeholders and veterinarians were preparing for the emergence of PEDV, due to the rapid spread of the virus observed months prior in the United States and the high connectivity between the Canadian and United States industries. The virus has significantly affected the swine industry due to production losses as well as financial commitments for intervention and surveillance programs [21]. The production losses are due to reduced performance in breeding herds as a result of high morbidity and mortality that occurs in immunologically naïve neonatal piglets. Poor performance is also evident in finishing herds due to periods of increased clinical illness (i.e. diarrhea and vomiting). Given that the routes of transmission for PEDV have been established within and between herds, including: animal movements, truck movements, people, equipment, feed and herd biosecurity [6, 7, 8, 9], these were the areas of focus for the current study.

There are many truck movements that transport nursery pigs, market hogs, and cull sows from the province of Ontario (Canada) to the United States. In 2014, 3.9 million feeder pigs were exported from Canada to the United States, which accounted for 79.5% of the United States total imports [22]. It has been previously reported in the United States that transport trucks are typically shared between producers and sites and are often not disinfected or washed in between loads, which can enable the spread of PEDV [23]. After the trucks are emptied in the United States, they often return to Ontario without being washed, disinfected and dried [23]. Additionally, in a study conducted by Lowe et al. [24], it was found that animal movement and biosecurity practices related to animal transportation (trucks and trailers) played a role in viral transmission. Lowe et al. [24] reported that 4.9% PEDV-free trucks that delivered pigs to abattoirs were cross-contaminated after the pigs were unloaded. For this reason, it was initially speculated that the emergence of PED into Canada would be due to contaminated trucks from the United States. Although, the biosecurity practices based on transportation, truck drivers and herd management practices descriptively favour case herds over control herds in the current study. For instance, 38% of case herds received pig deliveries from transport trailers that were washed and disinfected prior to delivering pigs on-site compared to 33% of control herds. Thirty-eight percent of case herds reported that truck drivers helped unload pigs, stepped on the laneway and entered the barn when delivering pigs on-site compared to 17% of control herds. When pigs were delivered off-site to either a different farm, abattoir, deadstock or culled animal site, 63% of case herds reported that the pigs were delivered in transport trailers that were washed and disinfected compared to 42% of control herds. Also, 88% of staff and non-staff members were required to shower on-site prior to entering the barn compared to 58% of control herds. Case herds also had fewer numbers of deadstock movements picked up by contracted deadstock companies compared to control herds. Thus, case herds seemed to have better biosecurity practices related to herd management and animal movements including deadstock and culled animals with regards to transportation and truck driver protocols. These findings were also determined analytically by Perri et al. [12] and descriptively by Pasma et al. [14]. This indicates that biosecurity related to animal movements, transportation vehicles and deadstock removal likely did not play a role in the incursion of PEDV into Canada.

With regards to feed management practices, feed suppliers and potentially contaminated SDPP were not included in the current study. Viral infection based on feed is likely a multi-factorial process. For instance, specific feed ingredients capable of harbouring viable PEDV over a long period of time becomes contaminated, followed by the contaminated ingredients contacting a supportive matrix during the processing of complete feed [10]. Feed deliveries consisting of the PEDV-positive feed transmit the virus to pigs on farm. Dee et al. [10] found that PEDV survival is specific to certain ingredients. Dee et al. [10] also found that PEDV survived longer in soybean meal which may have been due to environmental temperatures since decreased titers were observed with increased temperature over time. This is because coronaviruses including PEDV can typically survive temperatures from 56°C for 10–15 min, 37°C for several days, 4°C for numerous months, and while frozen at −60°C for many years without losing infectivity [25]. In the current study, the case herds received a higher amount of feed deliveries on-site that consisted of pelleted and bagged feed compared to control herds. Control herds received more feed ingredient deliveries such as soybean meal. Regarding feed storage, case herds stored more feed inside the barn in warm areas than control herds. Whereas, control herds stored more of their feed ingredients indoors and outdoors in cold areas (i.e. in non-insulated areas) located in storage bins. The study period of interest occurred during the winter months in Canada and thus, feed storage (i.e. if stored outside) could have increased the risk of the viability of the virus if the complete feed or feed ingredients were contaminated. Although, if temperature (i.e. colder temperature) were to be associated with PEDV herd status during the incursion of the virus, this would be counterintuitive for the current study since the control herds stored their feed in colder temperatures. To further understand whether feed management practices influenced a herd's PEDV status, RF models were applied.

The machine learning tool RF was used to provide a unique perspective on determining the most important variables for predicting PEDV. The most important variables found to predict herd status (case versus control herds) included herd size and variables related to feed management. All 7 importance measures used to determine variable importance consistently found that following HPU, the total quantity of feed (tonnes) delivered as well as the number of feed deliveries received from off-site feed suppliers were the next top variables for explaining herd status. Although variables related to contaminated plasma were excluded from the model, the variables related to feed were still identified as the most important ones. It is possible that this is due to partial confounding between different feed-related variables. The final model based on a limited set of variables had a moderate predictive accuracy (72%) using cross-validation, although the sensitivity of this model was still poor and indicated that only 45% of case herds were expected to be correctly identified. This predictive accuracy suggests that it was expected that the PED status of 72% of study herds during this study period, (which were not used to train the data) could be accurately predicted using this model. As such, the model has modest accuracy and results should be interpreted with care. Modest accuracy was likely a consequence of withholding an important predictor (exposure to contaminated porcine plasma) from the model. The predictive accuracy further suggests that relying on biosecurity practices that were important predictors of PED status was not a strategy that resulted in great model performance, and we believe that this was likely due to positive correlation between feed-related variables and exposure to contaminated porcine plasma. An indication of this was the high importance of herd size and feed-related variables, among all the biosecurity variables considered in the model. External validity using herds from different study periods was not considered in this study. We expect that this model cannot be extrapolated to other study periods because of all feed-biosecurity measures that were implemented shortly after PED incursion.

The current study does present some limitations. Of the 27 identified case herds during the study period, only 8 sow herds participated in the study. This moderate response rate (30%) is likely due to producers not having enough time to participate especially in the immediate post-outbreak phase, as well as the sensitive nature of the early phase of the outbreak. It was found that the case herds that participated in the study were larger in production size based on HPU compared to control herds, despite efforts to match on the herd size category. This finding likely had an effect on why case herds had more pig movements, higher quantities of pigs delivered, feed deliveries, quantity of feed, non-staff movements on-site and semen deliveries. The next limitation of the study is the sample size. There were 20 herds included in the study with > 100 predictive variables evaluated. However, the RF tree-based machine learning tool splits nodes one step-at-a-time which permits trees (and thus forests) to implement regularization in data sets with large amounts of predictors and a small sample size [15]. Also, the grouping property of trees allows RF to competently account for correlated variables [15]. An advantage of using RF permutation accuracy as an importance measure rather than univariate screening models is that it accounts for each predictor variable individually as well as in multivariate interactions with all other predictor variables [15]. Using RF can increase the prediction accuracy compared to single classification trees because the ensemble method adjusts and accounts for instability of individual trees by small changes in the learning sample, which impairs the prediction accuracy in test samples [26]. Another limitation of using RF is that the interpretation of the model is not as straightforward compared to an individual classification tree. However, this study incorporated 7 measures of importance to determine the top variables for predicting the model. The importance measures were all highly correlated, however the results were consistent when comparing the top 10 importance variables.

In conclusion, the study provided a detailed descriptive analysis of general biosecurity and management practices implemented in PEDV-positive sow herds during the early phase of the 2014 outbreak and in matched control herds. The study focussed solely on the incursion of PEDV into Canadian swine herds during the period of January 22 and March 1, 2014. Detailed descriptions of biosecurity and feed management on case and matched control herds have added insight to the on-farm practices because most of the data obtained and recorded is not readily found in the literature and fills this knowledge gap. A RF machine learning tool determined that along with herd size, 70% of the importance variables for predicting herd status (case versus control herds) were feed management protocols. The final RF model provided a moderate predictive accuracy (72%) and poor sensitivity (45%). The poor sensitivity of this model is problematic due to the nature of the virus. Since the common source of infection was not included, the RF model determined herd size and feed management variables were of importance for predicting herd status, which is likely due to partial confounding. Overall, this study has provided novel insight into the role of biosecurity and feed management practices at the time PEDV emerged into Canadian swine herds in 2014.

SUPPLEMENTARY MATERIALS

Supplementary Table 1

Variable names and definitions used in the study to predict the herd status during the incursion of porcine epidemic diarrhea virus in Canadian swine herds using random forests

Click here to view.(35K, xls)

Supplementary Table 2

Water source and treatment used for PED-affected and non-PED-affected* swine herds

Click here to view.(29K, xls)

Supplementary Table 3

The frequency of deliveries by feed ingredients received by PED-affected and non-PED-affected* Canadian swine herds

Click here to view.(29K, xls)

Supplementary Table 4

A summary of the variable importance measures* for the top variables for predicting porcine epidemic diarrhea virus during the incursion of the virus in Canadian swine herds

Click here to view.(31K, xls)

Supplementary Fig. 1

A depiction to explain the concept of minimal depth in random forests. The top black node is referred to as the root node. The numbers represent the minimal depth of each node. The variables that split closest to the root node have a strong effect on the predictive accuracy of the random forest model and a higher importance measure.

Click here to view.(594K, ppt)

Supplementary Fig. 2

Relationship between measures of importance* for predicting porcine epidemic diarrhea virus during the incursion of the virus in Canadian swine herds.

Click here to view.(2M, ppt)

Supplementary Fig. 3

Variable importance using mean decrease in accuracy for predicting porcine epidemic diarrhea virus by type of management practice during the incursion of the virus in Canadian swine herds, 2014.

Click here to view.(1M, ppt)

Supplementary Fig. 4

Variable importance using Gini decrease for predicting porcine epidemic diarrhea virus by type of management practice during the incursion of the virus in Canadian swine herds, 2014.

Click here to view.(1M, ppt)

Supplementary Fig. 5

Partial dependence plots for the top three importance variables for predicting PEDV during the incursion of the virus in Canadian swine herds, 2014.

Click here to view.(1M, ppt)

Notes

Funding:This study relied on collaborative efforts with Ontario Pork, Animal Health Laboratory, the University of Guelph, the Ontario Ministry of Agriculture, Food and Rural Affairs (OMAFRA), and Ontario veterinarians and swine producers. This work was funded by OMAFRA (#030042), Ontario Pork (#052556), NSERC-Collaborative Research Development (CRD) (#401204) grants and OVC fellowship.

Conflict of Interest:The authors declare no conflicts of interest.

Author Contributions:

  • Conceptualization: O'Sullivan TL, Poljak Z.

  • Data curation: O'Sullivan TL, Poljak Z.

  • Formal analysis: Perri AM, O'Sullivan TL, Poljak Z.

  • Funding acquisition: O'Sullivan TL, Poljak Z.

  • Investigation: O'Sullivan TL, Poljak Z, Perri AM.

  • Methodology: Perri AM, O'Sullivan TL, Poljak Z.

  • Project administration: O'Sullivan TL, Poljak Z.

  • Supervision: O'Sullivan TL, Poljak Z.

  • Validation: Perri AM, O'Sullivan TL, Poljak Z.

  • Visualization: Perri AM, O'Sullivan TL, Poljak Z, Dewey C, Harding JCS.

  • Writing - original draft: Perri AM.

  • Writing - review & editing: O'Sullivan TL, Poljak Z, Dewey C, Harding JCS.

References

    1. EFSA Panel on Animal Health and Welfare (AHAW). Scientific opinion on porcine epidemic diarrhoea and emerging porcine deltacoronavirus. EFSA J 2014;12:3877.
    1. Stevenson GW, Hoang H, Schwartz KJ, Burrough ER, Sun D, Madson D, Cooper VL, Pillatzki A, Gauger P, Schmitt BJ, Koster LG, Killian ML, Yoon KJ. Emergence of Porcine epidemic diarrhea virus in the United States: clinical signs, lesions, and viral genomic sequences. J Vet Diagn Invest 2013;25:649–654.
    1. Wang L, Byrum B, Zhang Y. New variant of porcine epidemic diarrhea virus, United States, 2014. Emerg Infect Dis 2014;20:917–919.
    1. Pasick J, Berhane Y, Ojkic D, Maxie G, Embury-Hyatt C, Swekla K, Handel K, Fairles J, Alexandersen S. Investigation into the role of potentially contaminated feed as a source of the first-detected outbreaks of porcine epidemic diarrhea in Canada. Transbound Emerg Dis 2014;61:397–410.
    1. Kim YK, Lim SI, Cho IS, Cheong KM, Lee EJ, Lee SO, Kim JB, Kim JH, Jeong DS, An BH, An DJ. A novel diagnostic approach to detecting porcine epidemic diarrhea virus: The lateral immunochromatography assay. J Virol Methods 2015;225:4–8.
    1. Alvarez J, Goede D, Morrison R, Perez A. Spatial and temporal epidemiology of porcine epidemic diarrhea (PED) in the Midwest and Southeast regions of the United States. Prev Vet Med 2016;123:155–160.
    1. Ayudhya SN, Assavacheep P, Thanawongnuwech R, Thanawongnuwech R. One world--one health: the threat of emerging swine diseases. an Asian perspective. Transbound Emerg Dis 2012;59 Suppl 1:9–17.
    1. Pospischil A, Stuedli A, Kiupel M. Update on porcine epidemic diarrhea. J Swine Health Prod 2002;10:81–85.
    1. Sasaki Y, Alvarez J, Sekiguchi S, Sueyoshi M, Otake S, Perez A. Epidemiological factors associated to spread of porcine epidemic diarrhea in Japan. Prev Vet Med 2016;123:161–167.
    1. Dee S, Neill C, Clement T, Singrey A, Christopher-Hennings J, Nelson E. An evaluation of porcine epidemic diarrhea virus survival in individual feed ingredients in the presence or absence of a liquid antimicrobial. Porcine Health Manag 2015;1:9.
    1. Opriessnig T, Xiao CT, Gerber PF, Zhang J, Halbur PG, Halbur P. Porcine epidemic diarrhea virus RNA present in commercial spray-dried porcine plasma is not infectious to naïve pigs. PLoS One 2014;9:e104766
    1. Perri AM, Poljak Z, Dewey C, Harding JCS, O'Sullivan TL. An epidemiological investigation of the early phase of the porcine epidemic diarrhea (PED) outbreak in Canadian swine herds in 2014: A case-control study. Prev Vet Med 2018;150:101–109.
    1. Perri AM, Poljak Z, Dewey C, Harding JCS, O'Sullivan TL. Network analyses using case-control data to describe and characterize the initial 2014 incursion of porcine epidemic diarrhea (PED) in Canadian swine herds. Prev Vet Med 2019;162:18–28.
    1. Pasma T, Furness MC, Alves D, Aubry P. Outbreak investigation of porcine epidemic diarrhea in swine in Ontario. Can Vet J 2016;57:84–89.
    1. Chen X, Ishwaran H. Random forests for genomic data analysis. Genomics 2012;99:323–329.
    1. Flori J, Mousing J, Gardner I, Willeberg P, Have P. Risk factors associated with seropositivity to porcine respiratory coronavirus in Danish swine herds. Prev Vet Med 1995;25:51–62.
    1. Lambert ME, Arsenault J, Poljak Z, D'Allaire S. Epidemiological investigations in regard to porcine reproductive and respiratory syndrome (PRRS) in Quebec, Canada. Part 2: prevalence and risk factors in breeding sites. Prev Vet Med 2012;104:84–93.
    1. Liaw A, Wiener M. Classification and regression by randomForest. R News 2002;2:18–22.
    1. Kuhn M. Classification and Regression Training. [updated 2018]. [cited 2018 September 19].
    1. Paluszyńska A. Understanding Random Forests with randomForestExplainer. [updated 2017]. [cited 2018 September 19].
    1. Jung K, Saif LJ. Porcine epidemic diarrhea virus infection: Etiology, epidemiology, pathogenesis and immunoprophylaxis. Vet J 2015;204:134–143.
    1. United States Department of Agriculture Economics, Statistics and Market Information System. Overview of the United States Hog Industry. [updated 2015]. [cited 2018 October 1].
    1. Ojkic D, Hazlett M, Fairles J, Marom A, Slavic D, Maxie G, Alexandersen S, Pasick J, Alsop J, Burlatschenko S. The first case of porcine epidemic diarrhea in Canada. Can Vet J 2015;56:149–152.
    1. Lowe J, Gauger P, Harmon K, Zhang J, Connor J, Yeske P, Loula T, Levis I, Dufresne L, Main R. Role of transportation in spread of porcine epidemic diarrhea virus infection, United States. Emerg Infect Dis 2014;20:872–874.
    1. Casanova LM, Jeon S, Rutala WA, Weber DJ, Sobsey MD. Effects of air temperature and relative humidity on coronavirus survival on surfaces. Appl Environ Microbiol 2010;76:2712–2717.
    1. Strobl C, Boulesteix AL, Zeileis A, Hothorn T. Bias in random forest variable importance measures: illustrations, sources and a solution. BMC Bioinformatics 2007;8:25.

Metrics
Share
Figures

1 / 2

Tables

1 / 8

PERMALINK