samedi 10 janvier 2015 19h18

Understanding Vertical Scalability of I/O Virtualization for MapReduce Workloads: Challenges and Opportunities

* Auteur correspondant

Abstract : As the explosion of data sizes continues to push the limits of our abilities to efficiently store and process big data, next generation big data systems face multiple challenges. One such important challenge relates to the limited scalability of I/O, a determining factor in the overall performance of big data applications. Although paradigms like MapReduce have long been used to take advantage of local disks and avoid data movements over the network as much as possible, with increasing core count per node, local storage comes under increasing I/O pressure itself and prompts the need to equip nodes with multiple disks. However, given the rising need to virtualize large datacenters in order to provide a more flexible allocation and consolidation of physical resources (transforming them into public or private/hybrid clouds), the following questions arise: is it possible to take advantage of multiple local disks at virtual machine (VM) level in order to speed up big data analytics? If so, what are the best practices to achieve a high virtualized aggregated I/O throughput? This paper aims to answer these questions in the context of I/O intensive MapReduce workloads: it analyzes and characterizes their behavior under different virtualization scenarios in order to propose best practices for current approaches and speculate on future areas of improvement.

keyword : I/O virtualization big data vertical I/O scalability I/O virtualization big data vertical I/O scalability

Type de document :

Communication dans un congrès

BigDataCloud'13: 2nd Workshop on Big Data Management in Clouds, Aug 2013, Aachen, Germany

Domaine :

Informatique / Calcul parallèle, distribué et partagé

Source : https://hal.archives-ouvertes.fr/hal-00856877v1

Understanding Vertical Scalability of I/O Virtualization for MapReduce Workloads: Challenges and Opportunities

Bogdan Nicolae 1, *

* Auteur correspondant

1 IBM Research - Ireland

keyword : I/O virtualization big data vertical I/O scalability I/O virtualization big data vertical I/O scalability

Type de document :

Communication dans un congrès

BigDataCloud'13: 2nd Workshop on Big Data Management in Clouds, Aug 2013, Aachen, Germany

Domaine :

Informatique / Calcul parallèle, distribué et partagé

Source : https://hal.archives-ouvertes.fr/hal-00856877v1

Lien permanent 0 commentaire

D	L	M	M	J	V	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

AUDENTIA

Informatique - Logiciels - Gestion - Formation - Tél : 09 50 31 52 80 - 06 62 23 52 80 - contact@audentia-gestion.fr

#MOOC Comptabilité

VOTRE IDEE. VOTRE SITE WEB. Sur tous les appareils sans HTML

#MOOC Informatique

ACHETER DIRECTEMENT VOS LOGICIELS DE GESTION

Lawyer'it

#MOOC sur les Bases de Données

Rooming'it

Understanding Vertical Scalability of I/O Virtualization for MapReduce Workloads: Challenges and Opportunities

Understanding Vertical Scalability of I/O Virtualization for MapReduce Workloads: Challenges and Opportunities

FICHIER

Understanding Vertical Scalability of I/O Virtualization for MapReduce Workloads: Challenges and Opportunities

FICHIER