Machine learning methods for the prediction of antimicrobial resistance and identification of novel markers of resistance in Escherichia coli

dc.contributor.authorMoat, Janice
dc.contributor.authorUniversity of Lethbridge. Faculty of Arts and Science
dc.contributor.supervisorZovoilis, Athanasios
dc.contributor.supervisorLaing, Chad
dc.date.accessioned2024-01-22T22:59:17Z
dc.date.available2024-01-22T22:59:17Z
dc.date.issued2023
dc.degree.levelMasters
dc.description.abstractAntimicrobial resistant strains of pathogenic Escherichia coli are a burden on the healthcare system, causing longer hospital stays and increased treatment costs compared to nonresistant strains. The proportion of E. coli infections in Canada caused by resistant strains producing extended-spectrum beta-lactamase rose from 3.4% in 2007 to 11.1% in 2017. Use of most antimicrobials in the treatment of Shiga-toxin producing E. coli infection is not recommended due to their propensity to increase toxin production. Rapid detection of resistant strains would improve both treatment and prevention of this pathogen. With whole genome sequencing (WGS) now ubiquitous in the analyses of outbreak and surveillance samples, in silico methods can be both faster and cheaper than traditional wet-lab methods. In this work, machine learning (ML) classification methods were used for the prediction of antimicrobial resistance and the identification of potentially novel genomic markers of resistance in E. coli. There are four supplementary files to accompany Chapter 3. Supplementary Figure 1 is the Phylogenetic tree of the 4300 E. coli isolates with Salmonella as an outgroup; it is coloured by country of origin and serotype and paired with legends for each. Supplementary Table 1 contains the list of publicly available genomes collected and their corresponding metadata. Supplementary Table 2 contains the top features extracted from trained machine learning models and their annotations. Supplementary Table 3 contains the accuracy data for training machine learning models across a range of 100 to 5000 features. Supplementary Table 4 contains the complete performance data for the 11-mer and 25-mer machine learning models.
dc.identifier.urihttps://hdl.handle.net/10133/6660
dc.language.isoen
dc.proquest.subject0715
dc.proquest.subject0487
dc.proquest.subject0369
dc.proquestyesYes
dc.publisherLethbridge, Alta. : University of Lethbridge, Dept. of Chemistry and Biochemistry
dc.publisher.departmentDepartment of Biochemistry
dc.publisher.facultyArts and Science
dc.relation.ispartofseriesThesis (University of Lethbridge. Faculty of Arts and Science)
dc.subjectBioinformatics
dc.subjectMachine learning
dc.subjectAntimicrobial resistance
dc.subjectBacteria
dc.subjectE. coli
dc.subject.lcshEscherichia coli
dc.subject.lcshAnti-infective agents
dc.subject.lcshDissertations, Academic
dc.titleMachine learning methods for the prediction of antimicrobial resistance and identification of novel markers of resistance in Escherichia coli
dc.typeThesis
Files
Original bundle
Now showing 1 - 5 of 6
Loading...
Thumbnail Image
Name:
Supplementary-Figures.docx
Size:
5.53 MB
Format:
Microsoft Word XML
Description:
Loading...
Thumbnail Image
Name:
Supplementary-Table-1.xlsx
Size:
617.05 KB
Format:
Microsoft Excel XML
Description:
Loading...
Thumbnail Image
Name:
Supplementary-Table-2.xlsx
Size:
48.11 KB
Format:
Microsoft Excel XML
Description:
Loading...
Thumbnail Image
Name:
Supplementary-Table-3.xlsx
Size:
51.38 KB
Format:
Microsoft Excel XML
Description:
Loading...
Thumbnail Image
Name:
Supplementary-Table-4.xlsx
Size:
19.94 KB
Format:
Microsoft Excel XML
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.25 KB
Format:
Item-specific license agreed upon to submission
Description: