MPI Collective Optimization

Felix, qui, quod amat, defendere fortiter audet

Home -> Research -> Older Projects -> CollOpt

  Publications
  Awards
  Research
    NB Collectives
    MPI Topologies
    MPI Datatypes
    Netgauge
    Network Topologies
    Ethernet BTL eth
    ORCS
    DFSSSP
    Older Projects
      CollOpt
      Abinit
    cDAG
    LogGOPSim
    CoMPIler
  Teaching

Miscellaneous

  Full CV [pdf]
  BLOG
  bio

Events

Past Events

MPI Collective Optimization

MPI Collective Operations are crucial for the running time of many parallel applications and algorithms. Our group tries to enhance the running time of Collective Operations on particular networks (e.g. InfiniBand, Ethernet). We are also testing the impact of non-blocking collectives on current applications (e.g. Linpack, Ab Initio Codes).

InfiniBand optimized collective operations

A collective component which enables optimized collective comunication over InfiniBand is under development at the TU Chemnitz. Currently, only an optimized InfiniBand barrier (see "Fast Barrier Synchronization for InfiniBand") is available. Other collective operations are currently under development by students under my advisory. We use different hardware sepcialities of InfiniBand, as the different transport types, Multicast and the hardware parallelism (see "LogfP - A Model for small Messages in InfiniBand") to optimize collective operations. The main goal is to prove that all collective operations can be implemented in a non-blocking manner without a performance penalty. Non-blocking collective operations can be used to overlap communication and computation efficiently.

Adaptive collective operations

We investigated also the possibility for self-optimization techniques for collective operations. A result of this is an adaptive barrier which benchmarks a set of different barrier algorithms during the creation of each communicator and chooses the fastest one to perform the operation afterwards. We used this framework to verify LogP predictions for all operations, see "A practical approach to the rating of barrier algorithms using the LogP model and Open-MPI".


serving: 216.73.216.0:61221	© Torsten Hoefler