Search
Now showing items 1-10 of 11
Efecto de casos anómalos en máquinas de vectores de soporte
(2009-03)
Support Vector Machines (SVM) is a new technique of classification that has received much attention in recent years. In many applications, the SVM has shown better performance than machine learning methods, and it has been ...
Análisis sobre métodos de pruebas de hipótesis múltiple en la identificación de genes diferencialmente expresados
(2009-07)
The Human Genome Project is the most important reason for the surge of new technologies in the microarray area. These technologies facilitate the experimentation with a large number of genes simultaneously. These experiments ...
A comparison in cluster validation techniques
(2004)
Clustering may be defined as a process that aims to find partitions of similar objects. It is an unsupervised recognition procedure since there are no predefined classes that indicate grouping properties in the data set. ...
A computational environment for data preprocessing in supervised classification
(2004)
In this thesis, a data preprocessing environment has been created, for use in a supervised classification context, with the Windows platform of the R programming language and environment for statistical computing and ...
Clasificación noparamétrica en datos direccionales
(2004)
In a supervised classification problem, when the vectors of data are direction- al, it means, that they take values on a k-dimensional sphere, the application of the algorithms of pattern recognition as k-nearest-neighbour ...
Algorithms for non-parametric classifiers in multi-relational data mining
(2006)
Over the last decades, due to the advances in information technologies, both the industrial and scientific communities have acquired large volumes of data in digital form. Most of these data sets are stored using relational ...
Evaluación de métodos de imputación para datos de expresión genética
(2007)
The technology of microarrays introduced in the middle of the nineties allow the analysis of the gene expression levels of thousands of genes simultaneously. The identification of genes with an expression level very different ...
Unsupervised classification of text documents
(2007)
The automatic extraction of knowledge from very large document collections is becoming an important issue in order to exploit the increasing available information stored in text form. A significant aspect of this extraction ...
Componentes principales supervisados para clasificación de datos de expresión genética
(2005)
The gene expression data obtained through the technology of microarrays are characterized by its considerably greater amount of features in comparison to the number of observations. The direct use of traditional statistics ...
Regresión logística con penalidad ridge aplicada a datos de expresión genética
(2005)
Logistic regression analysis is used in classification to find out which group an individual belong from a predictor variables set. In classification sometimes we work with data sets with more variables than observations. ...