Location: Egg and Poultry Production Safety Research Unit
Title: Comparison between LASSO and RT methods for prediction of generic E. coli concentration in pastured poultry farmsAuthor
XU, KINRAN - University Of Georgia | |
Rothrock, Michael | |
REEVES, JAXK - University Of Georgia | |
DEV KUMAR, GOVINDARAJ - University Of Georgia | |
MISHRA, ABHINAV - University Of Georgia |
Submitted to: Food Research International
Publication Type: Peer Reviewed Journal Publication Acceptance Date: 8/1/2022 Publication Date: 9/7/2022 Citation: Xu, K., Rothrock Jr, M.J., Reeves, J., Dev Kumar, G., Mishra, A. 2022. Comparison between LASSO and RT methods for prediction of generic E. coli concentration in pastured poultry farms. Food Research International. v.161, p.111860. https://doi.org/10.1016/j.foodres.2022.111860. DOI: https://doi.org/10.1016/j.foodres.2022.111860 Interpretive Summary: Though most strains of E. coli are non-pathogenic components of the intestinal microbiome, certain pathogenic E. coli strains are the cause of diseases and outbreaks. Poultry is identified as a common reservoir for pathogenic E. coli. It is important to identify farm practice factors associated with E. coli in the pastured poultry environment. The objective of this study is to develop models that can predict E. coli levels and to select farm practice factors contributing to E. coli concentration in pastured poultry farms. Five kinds of samples: feces, soil, whole carcass rinse after processing (WCR-P), final product after chilling and storage (WCR-F), and ceca samples were collected for E. coli counts from 11 pastured poultry farms in the southeastern US. The regression tree (RT) and least absolute shrinkage and selection operator (LASSO) methods were applied to data from each sample type. The farm management practices and processing factors such as source of eggs, type of feed used, appearance of other animals on farm, chilling method, and storage time and temperature were all considered as possible explanatory factors in the models. Models were developed to predict the levels of E. coli and to select the most important factors used in predicting E. coli population. Model performances were compared using root mean square error (RMSE). For feces samples, average number of birds and animal source were the two most important variables affecting E. coli population by LASSO. The RT selected animal source, brood feed, day of year, flock age in days, and flock size as the most important variables in predicting E. coli concentration. The RMSE (in log10 scale) under LASSO was 0.974, while under RT it was 1.032 for feces samples. The predictive models provide practical and effective tools to predict E. coli population and to identify farm practice factors that affect E. coli levels. Technical Abstract: Though most strains of E. coli are non-pathogenic components of the intestinal microbiome, certain pathogenic E. coli strains are the cause of diseases and outbreaks. Poultry is identified as a common reservoir for pathogenic E. coli. It is important to identify farm practice factors associated with E. coli in the pastured poultry environment. The objective of this study is to develop models that can predict E. coli levels and to select farm practice factors contributing to E. coli concentration in pastured poultry farms. Five kinds of samples: feces, soil, whole carcass rinse after processing (WCR-P), final product after chilling and storage (WCR-F), and ceca samples were collected for E. coli counts from 11 pastured poultry farms in the southeastern US. The regression tree (RT) and least absolute shrinkage and selection operator (LASSO) methods were applied to data from each sample type. The farm management practices and processing factors such as source of eggs, type of feed used, appearance of other animals on farm, chilling method, and storage time and temperature were all considered as possible explanatory factors in the models. Models were developed to predict the levels of E. coli and to select the most important factors used in predicting E. coli population. Model performances were compared using root mean square error (RMSE). For feces samples, average number of birds and animal source were the two most important variables affecting E. coli population by LASSO. The RT selected animal source, brood feed, day of year, flock age in days, and flock size as the most important variables in predicting E. coli concentration. The RMSE (in log10 scale) under LASSO was 0.974, while under RT it was 1.032 for feces samples. The predictive models provide practical and effective tools to predict E. coli population and to identify farm practice factors that affect E. coli levels. |