Checkpoint and Restart: An Energy Consumption Characterization in Clusters

The fault tolerance method currently used in High Perfor- mance Computing (HPC) is the rollback-recovery method by using check- points. This, like any other fault tolerance method, adds an additional energy consumption to that of the execution of the application. The objective of this work is to...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Morán, Marina, Balladini, Javier, Rexachs, Dolores, Luque, Emilio
Formato: Articulo article acceptedVersion
Lenguaje:Inglés
Publicado: arXiv 2024
Materias:
Acceso en línea:https://rdi.uncoma.edu.ar/handle/uncomaid/19173
Aporte de:

Ejemplares similares