Computer program complexity and its correlation with program features and sociolinguistics
dc.contributor.author | Alam, Sowkat | |
dc.contributor.author | University of Lethbridge. Faculty of Arts and Science | |
dc.contributor.supervisor | Rice, Jacqueline E. | |
dc.date.accessioned | 2021-01-07T16:36:21Z | |
dc.date.available | 2021-01-07T16:36:21Z | |
dc.date.issued | 2021 | |
dc.degree.level | Masters | en_US |
dc.description.abstract | Machine learning techniques have been widely used to understand the use of various sociolinguistic characteristics. These techniques can also be applied to analyze artificial languages. This research focuses on the influence of socio-characteristics, especially region and gender, on an artificial language (programming language). Software complexity features, 103 programming features, and their correlations (using pearson correlation) are also explored in this work. Machine learning and statistical techniques are used to determine whether any dissimilarities or similarities exist in the use of C++ programming language. We show that machine learning models can predict the region of programmers with 78.36\% accuracy and the gender of programmers with 62.63\% accuracy. We hypothesize that feature frequency difference may be a reason for lower accuracy in the gender-based program classification. We also demonstrate that some features such as for-loops and if-else conditions are closely correlated to the complexity of a computer program. | en_US |
dc.description.sponsorship | Natural Sciences and Engineering Research Council (NSERC) of Canada and the University of Lethbridge, Alberta, Canada. | en_US |
dc.identifier.uri | https://hdl.handle.net/10133/5818 | |
dc.language.iso | en_US | en_US |
dc.proquest.subject | Computer science [0984] | en_US |
dc.proquest.subject | Artificial intelligence [0800] | en_US |
dc.proquest.subject | Computer engineering [0464] | en_US |
dc.proquestyes | Yes | en_US |
dc.publisher | Lethbridge, Alta. : University of Lethbridge, Dept. of Mathematics and Computer Science | en_US |
dc.publisher.department | Department of Mathematics & Computer Science | en_US |
dc.publisher.faculty | Arts and Science | en_US |
dc.relation.ispartofseries | Thesis (University of Lethbridge. Faculty of Arts and Science) | en_US |
dc.subject | Artificial intelligence | en_US |
dc.subject | Computer programming | en_US |
dc.subject | Dissertations, Academic | en_US |
dc.subject | Machine learning | en_US |
dc.subject | Programming (Computers) | en_US |
dc.subject | Programming languages (Computers) | en_US |
dc.subject | Sociolinguistics | en_US |
dc.title | Computer program complexity and its correlation with program features and sociolinguistics | en_US |
dc.type | Thesis | en_US |