AUN Digital Repository

Data classification using various learning algorithms

Show simple item record

dc.contributor.author AHMAD BABA, Usman
dc.date.accessioned 2019-05-08T14:41:12Z
dc.date.available 2019-05-08T14:41:12Z
dc.date.issued 2017-12
dc.identifier.issn A00018852
dc.identifier.uri http://hdl.handle.net/123456789/582
dc.description Graduate Research Thesis submitted to the School of Graduate Studies in partial fulfillment of the requirements for the award of the degree of Master of Science in Information Systems. en_US
dc.description.abstract Dimensionality reduction provides a compact representation of an original high-dimensional data, which means the reduced data is free from any further processing and only the vital information is retained. For this reason, it is an invaluable preprocessing step before the application of many machine learning algorithms that perform poorly on high-dimensional data. In this thesis, the perceptron classification algorithm – an eager learner - is applied to three two-class datasets (Student, Weather and Ionosphere datasets). The k-Nearest Neighbors classification algorithm - a lazy learner - is also applied to the same two-class datasets. Each dataset is then reduced using fifteen different dimensionality reduction techniques. The perceptron and k-nearest neighbor classification algorithms are applied to each reduced set and the performance (evaluated using confusion matrix) of the dimensionality reduction techniques is compared on preserving the classification of a dataset by the k-nearest neighbors and perceptron classification algorithms. This investigation revealed that the dimensionality reduction techniques implemented in this thesis seem to perform much better at preserving K-Nearest Neighbor classification than they do at preserving the classification of the original datasets using the perceptron. In general, the dimensionality reduction techniques prove to be very efficient in preserving the classification of both the lazy and eager learners used for this investigation. en_US
dc.language.iso en_US en_US
dc.publisher American University of Nigeria, School of Information, Technology and Computing en_US
dc.relation.ispartofseries Graduate Research Thesis;GRT 2017
dc.subject Classification, confusion matrix, dimensionality reduction, eager learner, k-nearest neighbors, lazy learner, perceptron, data-set, machine learning en_US
dc.title Data classification using various learning algorithms en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

  • School of Information Technology and Computing
    Collection of research theses and dissertations written by graduate students in the school of information technology and computing for the following programmes; Msc computer science, Msc information sysytems, PhD computer science, etc.

Show simple item record

Search AUN Digital Repository


Advanced Search

Browse

My Account