Publication:
On applications of rough sets theory to knowledge discovery

dc.contributor.advisor Acuña-Fernández, Edgar
dc.contributor.author Coaquira-Nina, Frida R.
dc.contributor.college College of Engineering en_US
dc.contributor.committee Macchiavelli, Raul
dc.contributor.committee Saito, Tokuji
dc.contributor.committee Vega, Fernando
dc.contributor.department Department of Electrical and Computer Engineering en_US
dc.contributor.representative Cordova, Mario
dc.date.accessioned 2019-02-12T15:30:46Z
dc.date.available 2019-02-12T15:30:46Z
dc.date.issued 2007
dc.description.abstract Knowledge Discovery in Databases (KDD) is the nontrivial extraction of implicit, previously unknown and potentially useful information from data. Data preprocessing is a step of the KDD process that reduces the complexity of the data and offers better conditions to subsequent analysis. Rough sets theory, where sets are approximated using elementary sets, is another approach for developing methods for the KDD process. In this doctoral Thesis, we propose new algorithms based on Rough sets theory for three data preprocessing steps: Discretization, feature selection, and instance selection. In Discretization, continuous features are transformed into new categorical features. This is required for some KDD algorithms working strictly with categorical features. In Feature selection, the new subset of features leads to a new dataset of lower dimension, where it is easier to perform a KDD task. When a dataset is very large, an instance selection process is required to decrease the computational complexity of the KDD process. In addition to that, we combine a partitioning clustering algorithm with the Rough sets approach obtaining comparable results to a hierarchical clustering algorithm used along with rough sets. The new methods proposed in this thesis have been tested on datasets taken from the Machine Learning Database Repository at the University of California at Irvine. en_US
dc.description.graduationYear 2007 en_US
dc.identifier.uri https://hdl.handle.net/20.500.11801/1791
dc.language.iso English en_US
dc.rights.holder (c) 2007 Frida R. Coaquira Nina en_US
dc.rights.license All rights reserved en_US
dc.subject Rough set theory en_US
dc.subject Knowledge discovery en_US
dc.title On applications of rough sets theory to knowledge discovery en_US
dc.type Dissertation en_US
dspace.entity.type Publication
thesis.degree.discipline Computing and Information Sciences and Engineering en_US
thesis.degree.level Ph.D. en_US
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
CIIC_CoaquiraNinaF_2007.pdf
Size:
619.74 KB
Format:
Adobe Portable Document Format
Description: