i forget to say, that communicator::barrier is the first operation which is called when starting a prozess. i do this to synchronize the processes.<br><br>Stephan<br><br><div class="gmail_quote">2010/8/21 Stephan Hackstedt <span dir="ltr">&lt;<a href="mailto:stephan.hackstedt@googlemail.com">stephan.hackstedt@googlemail.com</a>&gt;</span><br>
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">I found out , that when i use a send -&gt; recv communication on the nodes, before calling communicator::barrier, it works. maybe calling the point to point operatrion before the collective synchronizises both prozesses?<br>

Now i need a solution for using communicator::barrier as first call.<br><br>Stephan<br><br><div class="gmail_quote">2010/8/20 Stephan Hackstedt <span dir="ltr">&lt;<a href="mailto:stephan.hackstedt@googlemail.com" target="_blank">stephan.hackstedt@googlemail.com</a>&gt;</span><div>
<div></div><div class="h5"><br>
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">Hi there,<br><br>i have a big problem by running MPI programs which use the Boost.MPI library. When i&#39;m trying to run programs on <b>more </b>than one node, collective operations like <code></code><a href="http://boost.org/doc/libs/1_44_0/doc/html/boost/mpi/communicator.html#id918378-bb" target="_blank">communicator::barrier</a> or <code></code><a href="http://boost.org/doc/libs/1_44_0/doc/html/boost/mpi/broadcast.html" title="Function broadcast" target="_blank">broadcast,</a> or even the <code><a href="http://boost.org/doc/libs/1_44_0/doc/html/boost/mpi/environment.html" title="Class environment" target="_blank">environment</a></code>
                  destructor (cause of FINALIZE, which is colletive) causing the programm to crash. I got errors like this :<br><br><b>[1]terminate called after throwing an instance of &#39;boost::exception_detail::clone_impl&lt;boost::exception_detail::error_info_injector&lt;boost::mpi::exception&gt; &gt;&#39;<br>


<br>[1]� what():� MPI_Barrier: Other MPI error, error stack:<br>[1]PMPI_Barrier(362).................: MPI_Barrier(MPI_COMM_WORLD) failed<br>[1]MPIR_Barrier_impl(255)............: <br>[1]MPIR_Barrier_intra(79)............: <br>


[1]MPIC_Sendrecv(186)................: <br>[1]MPIC_Wait(534)....................: <br>[1]MPIDI_CH3I_Progress(184)..........: <br>[1]MPID_nem_mpich2_blocking_recv(895): <br>[1]MPID_nem_tcp_connpoll(1746).......: Communication error with rank 0: </b><br>


<br>I also  tested this with the simple broadcast example from the Boost.MPI tutorial - same errors..<br>But when using the original MPI equivalent without the Boost.MPI library, such as <a href="http://www.mpi-forum.org/docs/mpi-11-html/node66.html#Node66" target="_blank"><code><span>MPI_Barrier</span></code></a>, the programm runs well. I am using MPICH2 on Ubuntu 10.04 platforms.<br>


Someone had problems like this or know a fix for that?<br><br>Regards,<br><font color="#888888"><br>stephan<br><br><br>
</font></blockquote></div></div></div><br>
</blockquote></div><br>