Filters
Results 1 - 1 of 1
Results 1 - 1 of 1.
Search took: 0.021 seconds
AbstractAbstract
[en] Recent high performance computing architectures come with more and more cores on a greater number of computational nodes. Memory buses and communication networks are facing critical levels of use. Programming parallel codes for those architectures requires to put the emphasize on those matters while writing tailored algorithms. In this thesis, a plasma turbulence simulation code is analyzed and its parallelization is overhauled. The gyroaverage operator benefits from a new algorithm that is better suited with regard to its data distribution and that uses a computation - communication overlapping scheme. Those optimizations lead to an improvement by reducing both execution times and memory footprint. We also study new designs for the code by developing a prototype based on task programming model and an asynchronous communication scheme. It allows us to reach a better load balancing and thus to achieve better execution times by minimizing communication overheads. A new reduced mesh is introduced, shrinking the overall mesh size while keeping the same numerical accuracy but at the expense of more complex operators. This prototype also uses a new data distribution and twists the mesh to adapt to the complex geometries of modern tokamak reactors. Performance of the different optimizations is studied and compared to that of the current code. A case scaling on a large number of cores is given. (author)
[fr]
Les architectures de calcul haute performance les plus recentes integrent de plus en plus de noeuds de calcul qui contiennent eux-memes plus de coeurs. Les bus memoires et les reseaux de communication sont soumis a un niveau d'utilisation critique. La programmation parallele sur ces nouvelles machines necessite de porter une attention particuliere a ces problematiques pour l'ecriture de nouveaux algorithmes. Nous analysons dans cette these un code de simulation de turbulences de plasma et proposons une refonte de la parallelisation de l'operateur de gyromoyenne plus adapte en termes de distribution de donnees et beneficiant d'un schema de recouvrement calcul - communication efficace. Les optimisations permettent un gain vis-a-vis des couts de communication et de l'empreinte memoire. Nous etudions egalement les possibilites d'evolution de ce code a travers la conception d'un prototype utilisant un modele programmation par tache et un schema de communication asynchrone adapte. Cela permet d'atteindre un meilleur equilibrage de charge afin de maximiser le temps de calcul et de minimiser les communications entre processus. Un maillage reduit adaptatif en espace est propose, diminuant le nombre de points sans pour autant perdre en precision, mais ajoutant de fait une couche supplementaire de complexite. Ce prototype explore egalement une distribution de donnees differente ainsi qu'un maillage en geometrie complexe adapte aux nouvelles configurations des tokamaks. Les performances de differentes optimisations sont etudiees et comparees avec le code preexistant et un cas dimensionnant sur un grand nombre de coeurs est presente.Original Title
Algorithmes a grain fin et schemas numeriques pour des simulations exascales de plasmas turbulents
Primary Subject
Secondary Subject
Source
17 Dec 2018; 153 p; 75 refs.; Available from the INIS Liaison Officer for France, see the INIS website for current contact and E-mail addresses; Informatique
Record Type
Report
Literature Type
Thesis/Dissertation
Report Number
Country of publication
Reference NumberReference Number
INIS VolumeINIS Volume
INIS IssueINIS Issue