Bassett, Ian (2016) Benchmarking Hive on Spark and SQL Server with the Real Time Data Warehousing Chain. Masters thesis, Dublin, National College of Ireland.
PDF (Master of Science)
Download (759kB) | Preview
PDF (Configuration File)
Download (1MB) | Preview
The following paper focuses on the field of Data Warehousing in two aspects. The first aspect will review Big Data performance comparing the emerging Hive on Apache Spark with SQL Server to determine when it would be appropriate to switch to a big data platform. The other aspect will investigate current software in the industry and how the continuous support of communities are creating to solve current and future barriers in the profession. A current issue in Data Warehousing and Business Intelligence is the development of Real Time Data Warehousing. This paper documents the research and progress of tools in the automation process of Real Time Data Warehousing.
|Item Type:||Thesis (Masters)|
|Subjects:||Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
Q Science > QA Mathematics > Computer software
T Technology > T Technology (General) > Information Technology > Computer software
|Divisions:||School of Computing > Master of Science in Data Analytics|
|Depositing User:||CAOIMHE NI MHAICIN|
|Date Deposited:||03 Dec 2016 12:00|
|Last Modified:||03 Dec 2016 12:00|
Actions (login required)