i forget to say, that communicator::barrier is the first operation which is called when starting a prozess. i do this to synchronize the processes.<br><br>Stephan<br><br><div class="gmail_quote">2010/8/21 Stephan Hackstedt <span dir="ltr"><<a href="mailto:stephan.hackstedt@googlemail.com">stephan.hackstedt@googlemail.com</a>></span><br> <blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">I found out , that when i use a send -> recv communication on the nodes, before calling communicator::barrier, it works. maybe calling the point to point operatrion before the collective synchronizises both prozesses?<br> Now i need a solution for using communicator::barrier as first call.<br><br>Stephan<br><br><div class="gmail_quote">2010/8/20 Stephan Hackstedt <span dir="ltr"><<a href="mailto:stephan.hackstedt@googlemail.com" target="_blank">stephan.hackstedt@googlemail.com</a>></span><div> <div></div><div class="h5"><br> <blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">Hi there,<br><br>i have a big problem by running MPI programs which use the Boost.MPI library. When i'm trying to run programs on <b>more </b>than one node, collective operations like <code></code><a href="http://boost.org/doc/libs/1_44_0/doc/html/boost/mpi/communicator.html#id918378-bb" target="_blank">communicator::barrier</a> or <code></code><a href="http://boost.org/doc/libs/1_44_0/doc/html/boost/mpi/broadcast.html" title="Function broadcast" target="_blank">broadcast,</a> or even the <code><a href="http://boost.org/doc/libs/1_44_0/doc/html/boost/mpi/environment.html" title="Class environment" target="_blank">environment</a></code> destructor (cause of FINALIZE, which is colletive) causing the programm to crash. I got errors like this :<br><br><b>[1]terminate called after throwing an instance of 'boost::exception_detail::clone_impl<boost::exception_detail::error_info_injector<boost::mpi::exception> >'<br> <br>[1]� what():� MPI_Barrier: Other MPI error, error stack:<br>[1]PMPI_Barrier(362).................: MPI_Barrier(MPI_COMM_WORLD) failed<br>[1]MPIR_Barrier_impl(255)............: <br>[1]MPIR_Barrier_intra(79)............: <br> [1]MPIC_Sendrecv(186)................: <br>[1]MPIC_Wait(534)....................: <br>[1]MPIDI_CH3I_Progress(184)..........: <br>[1]MPID_nem_mpich2_blocking_recv(895): <br>[1]MPID_nem_tcp_connpoll(1746).......: Communication error with rank 0: </b><br> <br>I also tested this with the simple broadcast example from the Boost.MPI tutorial - same errors..<br>But when using the original MPI equivalent without the Boost.MPI library, such as <a href="http://www.mpi-forum.org/docs/mpi-11-html/node66.html#Node66" target="_blank"><code><span>MPI_Barrier</span></code></a>, the programm runs well. I am using MPICH2 on Ubuntu 10.04 platforms.<br> Someone had problems like this or know a fix for that?<br><br>Regards,<br><font color="#888888"><br>stephan<br><br><br> </font></blockquote></div></div></div><br> </blockquote></div><br>