Thesis IMPLEMENTACIÓN Y EVALUACIÓN DE ALGORITMOS DE MINERÍA DE DATOS SOBRE HADOOP MAPREDUCE
Loading...
Date
2014
Journal Title
Journal ISSN
Volume Title
Publisher
Universidad Tecnica Federico Santa Maria
Abstract
Dia a dIa la cantidad de nuevos datos aumenta de manera exponencial. Uno de los principales
problemas es cOmo manejar y extraer información relevante de estos. Hadoop MapReduce se
ofrece como una de las herramientas para solucionar dicho problema.
El objetivo del trabajo fue evaluar el desempeño de Hadoop MapReduce junto a grandes
cantidades de datos. mediante la implementación de algoritmos para verificar las ventajas y
desventajas de dicha herramienta.
Se dernostró que Hadoop MapReduce ofrece una gran ventaja en cuanto a rapidez de análisis
y resultados en comparación a lenguajes secuenciales y scripting con archivos de gran tamaflo.
Sin embargo, la desventaja es La dificultad de desarrollo de codigo a corto plazo.
Every day the amount of new data increase in an exponential manner. One of the principal problems is how we can manage and extract important information from these. A tool that can be a solution for this problem is Hadoop MapReduce. The aim of this paper was to carry out an assessment of Hadoop MapReduce's performance with huge amount of data by the algorithms implementation in order to check the advantages and disadvantages of this tool. Here was demonstrated that Hadoop MapReduce offers a great advantage in terms of quickly analysis and results in comparison to sequential languages and scripting with large files. However, a disadvantage is the difficulty of code development in a short time.
Every day the amount of new data increase in an exponential manner. One of the principal problems is how we can manage and extract important information from these. A tool that can be a solution for this problem is Hadoop MapReduce. The aim of this paper was to carry out an assessment of Hadoop MapReduce's performance with huge amount of data by the algorithms implementation in order to check the advantages and disadvantages of this tool. Here was demonstrated that Hadoop MapReduce offers a great advantage in terms of quickly analysis and results in comparison to sequential languages and scripting with large files. However, a disadvantage is the difficulty of code development in a short time.
Description
Digitalizado de su versión en papel
Keywords
MINERIA DE DATOS, ALGORITMOS PARA COMPUTADOR, RECUPERACION DE DATOS (CIENCIA DE LA COMPUTACIÓN), BASE DE DATOS EN MINERIA
Citation
Campus
Universidad Técnica Federico Santa María UTFSM. Campus San Joaquín