AUTOMATION OF RETRIEVAL, TRANSFORMATION AND UPLOADING OF GENOMIC DATA AND THEIR METADATA FOR THEIR INTEGRATION INTO A GDM REPOSITORY

VERA PENA, JORGE IGNACIO

Publication:
AUTOMATION OF RETRIEVAL, TRANSFORMATION AND UPLOADING OF GENOMIC DATA AND THEIR METADATA FOR THEIR INTEGRATION INTO A GDM REPOSITORY

dc.contributor.advisor	MOREIRA WENZEL, ANDRÉS
dc.contributor.author	VERA PENA, JORGE IGNACIO
dc.contributor.department	Universidad Técnica Federico Santa María. Departamento de Informática	es_CL
dc.contributor.other	MASSEROLI, MARCO
dc.coverage.spatial	Campus San Joaquín, Santiago	es_CL
dc.date.accessioned	2020-07-01T15:43:21Z
dc.date.available	2020-07-01T15:43:21Z
dc.date.issued	2018-10
dc.description.abstract	Due to NGS techniques, whole genome sequences are produced much cheaper and faster every year, thus genomic data is being gathered at a pace never seen before. By processing NGS data new sense making relationships between genomic regions are being found and fundamental biological questions are answered; therefore managing NGS data now seems to be the most important big data problem of humankind. As the new NGS data generated are mostly heterogeneous, they are not easily interoperable. The Genomic Data Model (GDM) allows describing NGS data in a homogeneous way for their interoperation. GMQL is a next-generation query language that by means of using GDM data, gives genomics specific domain operations to biologists to process large volumes of data for discovering biological knowledge. This thesis studies the improvement of NGS data analysis by automating and standardizing the genomic data and their experimental metadata integration into a GDM repository.	es_CL
dc.description.abstract	A causa de las tecnologías NGS, las secuencias completas del genoma se producen cada vez más rápido y barato cada año, esto implica que la obtención de datos genómicos tiene un ritmo nunca antes visto. Procesando estos datos NGS se están descubriendo nuevas relaciones entre distintas regiones genómicas y se están encontrando respuestas a preguntas biológicas fundamentales. Por lo tanto parece que manejar los datos NGS ahora es el problema de big data más importante de la humanidad. Dado que los nuevos datos NGS son mayormente heterogéneos, no son fácilmente interoperables. El Genomic Data Model (GDM) permite describir datos NGS y sus metadatos de manera homogénea para su interoperabilidad. GMQL es un next-generation query language que usando el modelo GDM, entrega a biólogos herramientas específicas del dominio genómico para procesar gran volumen de datos para así poder generar nuevo conocimiento biológico. Este trabajo estudia la mejora del análisis de datos NGS mediante la estandarización y automatización de la integración de los datos experimentales y sus metadatos en un repositorio GDM.	es_CL
dc.description.degree	INGENIERO CIVIL INFORMÁTICO	es_CL
dc.description.program	UNIVERSIDAD TÉCNICA FEDERICO SANTA MARÍA UTFSM. DEPARTAMENTO DE INFORMÁTICA. INGENIERÍA CIVIL INFORMÁTICA	es_CL
dc.format.extent	142 h.	es_CL
dc.identifier.barcode	3560902039000	es_CL
dc.identifier.uri	https://hdl.handle.net/11673/49206
dc.subject	GENOMICA	es_CL
dc.subject	PROCESAMIENTO DE DATOS	es_CL
dc.subject	INGENIERIA DE SOFTWARE	es_CL
dc.subject.other	INGENIERIA CIVIL INFORMATICA	es_CL
dc.title	AUTOMATION OF RETRIEVAL, TRANSFORMATION AND UPLOADING OF GENOMIC DATA AND THEIR METADATA FOR THEIR INTEGRATION INTO A GDM REPOSITORY	es_CL
dc.type	Tesis de Pregrado
dspace.entity.type	Publication

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 3560902039000UTFSM.pdf
Size:: 3.04 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

TESIS de Pregrado de acceso ABIERTO

Publication: AUTOMATION OF RETRIEVAL, TRANSFORMATION AND UPLOADING OF GENOMIC DATA AND THEIR METADATA FOR THEIR INTEGRATION INTO A GDM REPOSITORY

Options

Files

Original bundle

Collections

Publication:
AUTOMATION OF RETRIEVAL, TRANSFORMATION AND UPLOADING OF GENOMIC DATA AND THEIR METADATA FOR THEIR INTEGRATION INTO A GDM REPOSITORY