HASHING ADAPTATIVO BASADO EN REDES NEURONALES PARA BÚSQUEDA POR SIMILITUD

Thesis
HASHING ADAPTATIVO BASADO EN REDES NEURONALES PARA BÚSQUEDA POR SIMILITUD

Simple item page

dc.contributor.advisor	ÑANCULEF, RICARDO
dc.contributor.author	VELÁSQUEZ ARAYA, JOAQUÍN EDUARDO
dc.contributor.department	Universidad Tecnica Federico Santa Maria UTFSM INFORMATICA	es_CL
dc.coverage.spatial		es_CL
dc.date.accessioned	2017-08-08T19:08:58Z
dc.date.available	2017-08-08T19:08:58Z
dc.date.issued	2017
dc.description	Catalogado desde la version PDF de la tesis.	es_CL
dc.description.abstract	Este trabajo está enfocado en la búsqueda por similaridad de documentos de texto, porello se busca un modelo para obtener una representación binaria de documentos de texto querefleje la similitud semántica entre ellos y alcance una alta precisión.La búsqueda por similitud de documentos de texto corresponde a obtener los documentosdentro de una colección que resultan semánticamente similares respecto a un documentode consulta, es decir, que están relacionados con dicha consulta en función de su significadoo contenido. Una representación binaria de estos documentos que refleje su similitudSemántica permite operar en el espacio de Hamming, en donde las operaciones necesariaspara comparar las representaciones son de menor complejidad. En la recuperación de informaciónsuele ser relevante recuperar una baja cantidad de documentos pero alcanzando unaalta precisión.Se realizó una implementación en Python basado en el modelo propuesto en [28] paraGenerar representación binaria de documentos de texto y se evaluó su desempeño variandoParámetros del modelo. Este modelo fue modificado para experimentar con distintas arquitecturas,utilizar Constrained Poisson Model y se suprimió el ruido. De este modo, se encontraronmodelos con alta precisión y poco profundos para la recuperación de una baja cantidadde documentos.	es_CL
dc.description.abstract	This work is focus on similarity search of text documents, for this reason it shows a modelto obtain a binary representation of text documents that reflects the semantics similaritybetween them and reach a high precision.Similarity search of text documents corresponds to obtaining the documents within acollection that is semantically similar to a query document, that is, they are related to thequery according to its meaning or content. A binary representation of these documents thatreflects their semantic similarity allows to operate over Hamming space, where the operationsnecessary to compare the representations have less complexity. In information retrieval it isusual to recover a low number of documents but attaining high accuracy.A model baser on [28] was implemented on Python to generate binary representation oftext documents and its performance was evaluated by varying model parameters. This modelwas modified to experiment with dierent architectures to use Constrained Poisson Modeland the noise was suppressed. In this way, shallow models with high precision and for thelow amount recovery were found.v	eng
dc.description.degree	INGENIERO CIVIL INFORMÁTICO	es_CL
dc.description.program	INGENIERÍA CIVIL INFORMÁTICA
dc.format.extent	93 h.
dc.format.medium	CD ROM
dc.format.mimetype	application/pdf
dc.identifier.barcode	3560902038236
dc.identifier.uri	http://hdl.handle.net/11673/15561
dc.rights	info:eu-repo/semantics/openAccess
dc.rights.accessRights	A - Internet abierta www.repositorio.usm.cl y otros repositorios a la que la USM se adscriba
dc.subject	BUSQUEDA POR SIMILITUD	es_CL
dc.subject	RECONOCIMIENTO POR PATRONES	es_CL
dc.subject	REDES NEURONALES	es_CL
dc.title	HASHING ADAPTATIVO BASADO EN REDES NEURONALES PARA BÚSQUEDA POR SIMILITUD	es_CL
dc.type	Tesis Pregrado	es_CL
dspace.entity.type	Tesis
usm.date.thesisregistration	2016
usm.identifier.thesis	4500012524

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 3560902038236UTFSM.pdf
Size:: 3.36 MB
Format:: Adobe Portable Document Format
Description:

Collections

Tesis de Pregrado de Acceso Abierto