Machine learning in the classification of computer code

dc.contributor.authorTasnim, Nazia
dc.contributor.authorUniversity of Lethbridge. Faculty of Arts and Science
dc.contributor.supervisorRice, Jacqueline E.
dc.date.accessioned2020-09-09T20:08:19Z
dc.date.available2020-09-09T20:08:19Z
dc.date.issued2020
dc.degree.levelMastersen_US
dc.description.abstractMachine learning approaches are a well-established method to analyze natural language. Sociolinguistic characteristics, such as the author's gender, experience, and age, have compelling effects on natural language use. Previous research has shown that a computer program can be analyzed using similar linguistics-based approaches. In this research, we are using machine learning techniques to analyze computer programs based on the author's programming experience. We use machine learning and statistical approaches to determine which features are most significant in the classification of a computer program according to the author's programming experience. Several experiments have been carried out on a dataset consisting of computer programs written in C++, and the results are encouraging. The experimental results estimate that the author's programming experience can be predicted with an accuracy of 69%.en_US
dc.identifier.urihttps://hdl.handle.net/10133/5761
dc.language.isoen_USen_US
dc.proquest.subjectArtificial intelligence [0800]en_US
dc.proquest.subjectComputer science [0984]en_US
dc.proquest.subjectEducational technology [0710]en_US
dc.proquestyesYesen_US
dc.publisherLethbridge, Alta. : University of Lethbridge, Dept. of Mathematics and Computer Scienceen_US
dc.publisher.departmentDepartment of Mathematics and Computer Scienceen_US
dc.publisher.facultyArts and Scienceen_US
dc.relation.ispartofseriesThesis (University of Lethbridge. Faculty of Arts and Science)en_US
dc.subjectArtificial intelligenceen_US
dc.subjectClassificationen_US
dc.subjectComputer programmingen_US
dc.subjectMachine Learningen_US
dc.subjectProgramming languages (Electronic computers)en_US
dc.subjectSociolinguisticsen_US
dc.titleMachine learning in the classification of computer codeen_US
dc.typeThesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Tasnim_Nazia_MSc_2020.pdf
Size:
1.53 MB
Format:
Adobe Portable Document Format
Description:
MSc Thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.25 KB
Format:
Item-specific license agreed upon to submission
Description: