Homepage

International peer-reviewed conference proceedings

Raghunath Rajachandrasekar, Xavier Besseron, and Dhabaleswar K. Panda. Monitoring and predicting hardware failures in HPC clusters with FTB-IPMI. In International Workshop on System Management Techniques, Processes, and Services (SMTPS'12), held in conjunction with IPDPS'12, Shanghai, China, May 2012.

Vilobh Meshram, Xavier Besseron, Xiangyong Ouyang, Raghunath Rajachandrasekar, Ravi Prakash Darbha, and Dhabaleswar K. Panda. Can a decentralized metadata service layer benefit parallel filesystems? In Workshop on Interfaces and Architectures for Scientific Data Storage (IASDS'11), held in conjunction with Cluster'11, Austin, Texas, USA, September 2011. [ http ]

Xiangyong Ouyang, Raghunath Rajachandrasekar, Xavier Besseron, Hao Wang, Jian Huang, and Dhabaleswar K. Panda. CRFS: A lightweight user-level filesystem for generic checkpoint/restart. In International Conference on Parallel Processing (ICPP'11), Taipei, Taiwan, September 2011. [ http ]

Raghunath Rajachandrasekar, Xiangyong Ouyang, Xavier Besseron, Vilobh Meshram, and Dhabaleswar K. Panda. Can checkpoint/restart mechanisms benefit from hierarchical data staging? In Workshop on Resiliency in High Performance Computing in Clusters, Clouds, and Grids (Resilience'11), held in conjunction with EuroPar'11, Bordeaux, France, August 2011. [ http ]

Xavier Besseron and Thierry Gautier. Impact of over-decomposition on coordinated checkpoint/rollback protocol. In Workshop on Resiliency in High Performance Computing in Clusters, Clouds, and Grids (Resilience'11), held in conjunction with EuroPar'11, Bordeaux, France, August 2011. [ http ]

Xiangyong Ouyang, Raghunath Rajachandrasekar, Xavier Besseron, and Dhabaleswar K. Panda. High performance pipelined process migration with RDMA. In 11th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid'11), Newport Beach, California, USA, May 2011. [ .html ]

Xavier Besseron and Thierry Gautier. Optimised recovery with a coordinated checkpoint/rollback protocol for domain decomposition applications. In Modelling, Computation and Optimization in Information Systems and Management Sciences (MCO'08), Metz, France - Luxembourg, September 2008. [ http ]

Thierry Gautier, Xavier Besseron, and Laurent Pigeon. Kaapi: A thread scheduling runtime system for data flow computations on cluster of multi-processors. In Parallel Symbolic Computation'07 (PASCO'07), London, Ontario, Canada, July 2007. [ http ]

Xavier Besseron, Samir Jafar, Thierry Gautier, and Jean Louis Roch. CCK: An improved coordinated checkpoint/rollback protocol for dataflow applications in Kaapi. In IEEE Conference on Information and Communication Technologies: from Theory to Applications (ICTTA'06), Damascus, Syria, April 2006. [ http ]

Scientific books and chapters

Xavier Besseron, Slim Bouguerra, Thierry Gautier, Érik Saule, and Denis Trystram. Fault tolerance and availability awareness in computational grids, chapter 5. Numerical Analysis and Scientific Computing. Chapman and Hall/CRC Press, December 2009. [ http ]

National peer reviewed journals

Xavier Besseron, Laurent Pigeon, Thierry Gautier, and Samir Jafar. Un protocole de sauvegarde / reprise coordonné pour les applications à flot de données reconfigurables. Technique et Science Informatiques (TSI), 27, 2008. [ http ]

National peer-reviewed conference proceedings

Xavier Besseron, Christophe Laferrière, Daouda Traoré, and Thierry Gautier. X-kaapi : Une nouvelle implémentation extrême du vol de travail. In Proceedings Des 19èmes Rencontres Francophones Du Parallélisme (RenPar'19), Toulouse, France, September 2009. [ .html ]

Xavier Besseron, Laurent Pigeon, Thierry Gautier, and Samir Jafar. Un protocole de sauvegarde/reprise coordonné pour les applications à flot de données reconfigurables. In Proceedings Des 17èmes Rencontres Francophones Du Parallélisme (RenPar'17), Perpignan, France, October 2006.

Thesis

Xavier Besseron. Tolérance aux fautes et reconfiguration dynamique pour les applications distribuées à grande échelle. PhD thesis, Université de Grenoble, Grenoble, France, April 2010. [ .pdf ]

Xavier Besseron. CCK : un protocole coordonné de sauvegarde/reprise pour la tolérance aux pannes des applications itératives en calcul numérique. Master's thesis, Université Joseph Fourier, Grenoble, France, June 2006. [ .pdf ]

Presentations

Xavier Besseron, Thierry Gautier, Gengbin Zheng, and Laxmikant V. Kalé. Kaapi / Charm++ preliminary comparison. In Third workshop of the Joint Laboratory for Petascale Computing, Bordeaux, France, June 2010. [ .html | .pdf ]

Xavier Besseron. Reconfiguration dynamique et tolérance aux fautes pour les applications distribuées à grande échelle. In PhD defense, INRIA Grenoble Rhône Alpes, France, April 2010. [ .pdf ]

Xavier Besseron. Fault tolerance for a data flow model. In Visit at the Parallel Programming Laboratory, University of Illinois at Urbana-Champaign, USA, March 2010. [ .pdf ]

Xavier Besseron and Thierry Gautier. Optimized coordinated checkpoint/rollback protocol using a dataflow graph model. In Workshop APRETAF : Algorithmes Parallèles, Répartis Et Tolérance Aux Fautes, Grenoble, France, January 2009. [ .html | .pdf ]

Xavier Besseron, Vincent Danjean, Thierry Gautier, Serge Guelton, Guillaume Huard, and Frédéric Wagner. IV Grid Plugtests: composing dedicated tools to run an application efficiently on Grid'5000. In 3rd EGEE User Forum, Clermont-Ferrand, France, February 2008. [ http | .pdf ]


Homepage