 Research
 Open Access
 Published:
On the reliability of Nbody simulations
Computational Astrophysics and Cosmology volume 2, Article number: 2 (2015)
Abstract
The general consensus in the Nbody community is that statistical results of an ensemble of collisional Nbody simulations are accurate, even though individual simulations are not. A way to test this hypothesis is to make a direct comparison of an ensemble of solutions obtained by conventional methods with an ensemble of true solutions. In order to make this possible, we wrote an Nbody code called Brutus, that uses arbitraryprecision arithmetic. In combination with the BulirschStoer method, Brutus is able to obtain converged solutions, which are true up to a specified number of digits.
We perform simulations of democratic 3body systems, where after a sequence of resonances and ejections, a final configuration is reached consisting of a permanent binary and an escaping star. We do this with conventional doubleprecision methods, and with Brutus; both have the same set of initial conditions and initial realisations. The ensemble of solutions from the conventional simulations is compared directly to that of the converged simulations, both as an ensemble and on an individual basis to determine the distribution of the errors.
We find that on average at least half of the conventional simulations diverge from the converged solution, such that the two solutions are microscopically incomparable. For the solutions which have not diverged significantly, we observe that if the integrator has a bias in energy and angular momentum, this propagates to a bias in the statistical properties of the binaries. In the case when the conventional solution has diverged onto an entirely different trajectory in phasespace, we find that the errors are centred around zero and symmetric; the error due to divergence is unbiased, as long as the timestep parameter, \(\eta\le2^{5}\) and when simulations which violate energy conservation by more than 10% are excluded. For resonant 3body interactions, we conclude that the statistical results of an ensemble of conventional solutions are indeed accurate.
Introduction
Analytical solutions to the Nbody problem are known for \(N=2\), which are the familiar conic sections. Also, for several systems possessing symmetries, analytical solutions have been found, for example the equilateral triangle (Lagrange 1772). For a more general initial configuration, solutions have to be obtained by means of numerical integration. Given an initial Nbody realisation, one can calculate all mutual forces and subsequently the net acceleration of each particle. Different integration methods exist which take the accelerations, and update the positions and velocities to a time \(t+\Delta t\), with Δt the timestep size. This process is repeated until the end time is reached.
Miller (1964) recognised that obtaining the solution to an Nbody problem by numerical integration is difficult. This is caused by exponential divergence. Consider a certain Nbody problem, i.e. N pointparticles, each with a given mass, position and velocity. This system evolves with time in a definite and unique way. If one goes back to the initial state and slightly perturbs only one coordinate of a single particle, the perturbed Nbody problem will also have a definite and unique but different solution than the original one. When the two solutions are compared as a function of time, it is observed that differences can grow exponentially (Miller 1964; Dejonghe and Hut 1986; Goodman et al. 1993; Hut and Heggie 2002). If the initial perturbation is due to a numerical error, the calculated solution will also diverge away from the true solution.
Several authors have estimated the timescale of this divergence (Goodman et al. 1993; Hut and Heggie 2002), and arrived at an efolding timescale of the order a dynamical, crossing time. Simulation times of interest are typically much longer than a crossing time and therefore staying close to the true solution is numerically challenging.
If the result of a direct Nbody simulation of for example a star cluster, has diverged away from the true solution, the result may well be meaningless (Goodman et al. 1993). The general consensus however, is that statistically the results are representative for the true solution to the Nbody problem (Smith 1979; Heggie 1991; Goodman et al. 1993). The underlying idea is that the statistics of an ensemble of Nbody simulations are representative for the true statistics, obtained by an ensemble of true solutions, with the same set of initial conditions. We regard this the hypothesis we want to test.
One way to test this hypothesis is to directly compare statistics obtained by conventional methods, with the statistics obtained from an ensemble of true solutions. To obtain true solutions, we wrote an Nbody code which can solve the Nbody problem to arbitrary precision.
Such a code can be realised if the different sources of error are controlled. The error has contributions from the time discretisation of the integrator and the roundoff due to the limited precision of the computer (Zadunaisky 1979). Another possible source of error is in the initial conditions, for example the configuration of the Solar System is only approximately known (Ito and Tanikawa 2002). However, if the initial condition is a random realisation of a distribution function, this is less often a problem. Using the BulirschStoer method (Bulirsch and Stoer 1964; Gragg 1965), the discretisation error can be controlled to stay within a specified tolerance. Using arbitraryprecision arithmetic instead of conventional doubleprecision or singleprecision, the roundoff error can be reduced by increasing the number of digits.
We obtain converged solutions to the Nbody problem by decreasing the BulirschStoer tolerance and increasing the number of digits systematically. We define a converged solution in our experiments as a solution for which the first specified number of decimal places of every phasespace coordinate in our final configuration in the Nbody experiment becomes independent of the length of the mantissa and the BulirschStoer tolerance. We explain the method of convergence in Section 2 and we give examples of the procedure in Section 3.
Using this new, brute force Nbody code which we call Brutus, we test the reliability of Nbody simulations by a controlled numerical experiment which we describe in Section 4. In this experiment we perform a series of resonant 3body simulations, where the term resonant implies a phase or multiple phases during the interaction where the stars are more or less equidistant (Hut and Bahcall 1983). These phases are intermingled by ejections, where a binary and single star are clearly separated. We perform the simulations with conventional doubleprecision, and with arbitraryprecision to reach the converged solution. In Section 5, the solutions are compared individually to investigate the distribution of the errors. We also compare the global statistical distributions using twosample KolmogorovSmirnov tests (Kolmogorov 1933; Smirnov 1948).
Methods
The benchmark integrator
The gravitational Nbody problem aims to solve Newton’s equations of motion under gravity for N stars (Newton 1687). A popular integrator to perform this task is the fourthorder Hermite predictorcorrector scheme (Makino and Aarseth 1992), using doubleprecision arithmetic. The experiments we discuss in Section 4 will use this integrator as a benchmark test. We adopt a shared, adaptive timestepping scheme with the following criterion:
Here η is the timestep parameter and \(\Delta r_{ij}\) and \(\Delta a_{ij}\) are the relative distance and acceleration for the pair of particles i and j. We implement no further constraints on the timestep size.
To test how inaccurate we are allowed to integrate while still obtaining accurate statistics (Smith 1979; Quinlan and Tremaine 1992) we vary the timestep parameter η, to obtain statistics from conventional simulations with different precision.
The Brutus Nbody code
The results obtained with the benchmark integrator are compared to those obtained with Brutus, which uses an arbitraryprecision library.^{Footnote 1} With this library we can specify the number of bits, \(L_{\mathrm{w}}\), used to store the mantissa, while the exponent has a fixed wordlength. The length of the mantissa can be specified and increased, with the aim of controlling the roundoff error.
The integration of the equations of motion is realised using the Verletleapfrog scheme (Verlet 1967). The timestep is shared among all particles, but varies for every step according to equation (1).
To control the discretisation error, we implemented the BulirschStoer (BS) method, which uses iterative integration and polynomial extrapolation to infinitesimal timestep size (Bulirsch and Stoer 1964; Gragg 1965). An integration step is accepted, when two subsequent BS iterations have converged to below the BS tolerance level, ϵ.
The timestep parameter η and the BS tolerance ϵ, both influence the performance. If η is too big, convergence may not be achieved for any tolerance. If η is too small, the many integration steps will render the integration too expensive. There is an optimal value for η as a function of ϵ. We measured this relation empirically, which results in:
For \(\epsilon< 10^{50}\) the powerlaw converges to \(A=0.029\) and \(B=0.45\). Extrapolating this relation to \(\epsilon> 10^{50}\) will cause the timestep size to become larger than the time scale for the closest encounter in the system. Therefore this relation saturates to a flatter powerlaw for \(\epsilon> 10^{50}\) with \(A=0.012\) and \(B=0.40\). Compared to a fixed value for η, this relation speeds up the iterative procedure by about a factor three or more. The code is implemented as a community code in the AMUSE framework (Portegies Zwart et al. 2012) under the name Brutus.
Method of convergence
For every simulation we have to define the BS tolerance parameter ϵ and the wordlength \(L_{\mathrm{w}}\). In an iterative procedure we vary both parameters systematically, each time carrying out a simulation until \(t=t_{\mathrm{end}}\). We subsequently calculate the phase space distance, \(\delta^{2}_{\mathrm{A},\mathrm{B}}\), between two solutions A and B:
The first summation is over all particles and the second summation is over the six phasespace coordinates denoted by q (Miller 1964). We normalise by 6N, so that δ represents the average difference per phasespace coordinate between two solutions A and B. In our experiments we adopt Hénon units^{Footnote 2} (Hénon 1971; Heggie and Mathieu 1986), in which the typical values for the distance and velocity are of the same order. We will also use the distance in just position or just velocity space as they might behave differently.
We consider the solutions A and B to be converged when \(\delta _{\mathrm{A},\mathrm{B}}<10^{p}\) at all times during the simulation. Note that converged in this case means convergence of the total solution, contrary to convergence per integration step as in the previous section. This criterion for convergence is roughly equivalent to comparing the first p decimal places of the positions and velocities for all N stars, in two subsequent calculations A, B. In most of our experiments we adopt \(p=3\), i.e. all coordinates have to converge to about three decimal places or more. We perform a subset of simulations with \(p=15\) to investigate the effect of small errors (see Section 5.4.3).
Each simulation starts by specifying the initial positions and velocities of N stars in doubleprecision (see Section 4). The simulation is carried out with the parameter set \((\epsilon, L_{\mathrm{w}})\). We start each simulation with \(\epsilon=10^{6} \) and \(L_{\mathrm{w}}=56\mbox{ bits}\). This corresponds to a level of accuracy similar to what we reach with the conventional Hermite integrator. After this simulation, we increase \(L_{\mathrm{w}}\), for example to 72 bits (∼22 decimal places), redo the simulation and calculate \(\delta^{2}\). We repeat this procedure until \(\delta< 10^{p}\). When this is achieved, we have obtained a solution in which the roundoff error is below a specified number of decimal places for this particular value of ϵ.
We now reduce the tolerance parameter ϵ, for example by a factor of 100, and repeat the procedure of increasing \(L_{\mathrm{w}}\). This series will again lead to a converged solution, but this time it is obtained using a smaller ϵ, and is likely to be different than the previous converged solution. We continue decreasing the value of ϵ by factors of 100 and repeat the procedure, until two subsequent iterations in ϵ lead to a converged solution with a value of \(\delta< 10^{p}\). By this time we are assured of having a solution to the gravitational Nbody problem, that is accurate up to at least p decimal places.
In practice, we speed up the procedure by writing the wordlength as a function of BS tolerance. Consider for example a BS tolerance of 10^{−20}. To reach convergence up to this level, we need at least 20 decimal places. Adding an extra buffer of 10 digits gives a total of 30 digits, or equivalently a wordlength of about 112 bits. For this example, 112 bits turns out to be a good minimum wordlength. For a first estimate of the wordlength, we use:
With this relation, we will only have to specify a single parameter ϵ, which directly controls the discretisation error and indirectly controls the roundoff error. For most of the systems in our experiment the discretisation error turns out to be the dominant source of error and as a consequence ϵ has to decrease quite drastically. When ϵ decreases, \(L_{\mathrm{w}}\) increases, even up to the point that there are many more digits available than really needed to control the roundoff error. In the case when the discretisation error dominates, the above defined minimum wordlength for a given BS tolerance will result in the converged solution. When the roundoff error dominates the wordlength should be varied independently.
Validation and performance
The Pythagorean problem
To show that our method works, we adopt the Pythagorean 3body system (Burrau 1913). Previous numerical studies have shown that this system dissolves into a binary and an escaper (Szebehely and Peters 1967; Aarseth et al. 1994). After many complex, close encounters the dissolution happens at about \(t=60\) time units (Dejonghe and Hut 1986), or about 16 crossing times.
We adopt the initial conditions for the Pythagorean problem and integrate up to \(t=100\). To illustrate how the method works we start with a high tolerance and short wordlength (\(\epsilon=10^{2}\), \(L_{\mathrm{w}}=40\mbox{ bits}\)), which is less precise than doubleprecision. In Figure 1, this calculation is compared to a simulation with (\(\epsilon=10^{4}\), \(L_{\mathrm{w}}=48\mbox{ bits}\)), through the yellow (upper) curves in the first three panels. After the first BS integration step, δ obtains a value of the order of the BS tolerance, and continues to increase due to exponential divergence, to eventually exceed \(\delta \sim 10^{1}\), after which the errors become on the order of the typical distance and speed in the system.
In the following step, we repeat the calculation with a precision of (\(\epsilon=10^{6}\), \(L_{\mathrm{w}}=56\mbox{ bits}\)), and compare the result with the calculation using (\(\epsilon=10^{4}\), \(L_{\mathrm{w}}=48\mbox{ bits}\)). The comparison is represented by the orange curves (second from above) in Figure 1. The overall behaviour of δ is similar, but the system diverges at a later time due to a higher initial precision.
We continue the iterative procedure until a converged solution has been obtained. In the first three panels of Figure 1, it can be seen that subsequent simulations with higher precision shift the curve to lower values of δ. Superposed on the steady growth of the error are sharp spikes, where the error grows by several orders of magnitude, after which the error restores again (Miller 1964). These spikes are dominated by errors in the velocity, as can be deduced by comparing the magnitude of the spikes in position and velocityspace. Eccentric binaries which are out of phase when comparing two solutions cause large, periodic errors in the velocity. We finish the procedure when a solution is obtained for which the criterion for convergence is fulfilled, considering the magnitude of the error between the sharp spikes (bottom, black curves).
In the bottom right panel of Figure 1, we compare solutions obtained by the Hermite integrator to the converged solution. The different curves belong to different timestep parameters; η = 2^{−3}, 2^{−5}, 2^{−7} up to 2^{−13}. Note that for a timestep parameter \(\eta< 2^{9}\), the curve is not shifted to lower values of δ, but even increases again. At this point roundoff error becomes important, making the solution less accurate. The final close encounter in the Pythagorean problem occurs around 60 time units, after which a permanent binary and an escaper are formed. The Hermite integrator is able to accurately reproduce the evolution up to this point, but not subsequently, because δ has increased to values of order unity or higher. This can be explained by a small error in the final close encounter between all three stars, such that the direction of the escaper is slightly different.
To obtain the converged solution up to the first three decimal places, a tolerance of 10^{−14} and a wordlength of 88 bits were needed. The simulation was about twice as slow compared to the Hermite simulation with \(\eta=2^{9}\). The Hermite simulation, however, had a slightly different solution and a final, relative energy conservation of 10^{−8}. Decreasing the value of η will improve the level of energy conservation, but due to roundoff error δ will not decrease.
The equilateral triangle
As a second test case, we adopt the 3body equilateral triangle as an initial condition (Lagrange 1772). In the exact solution this configuration remains intact, but small perturbations, such as produced by numerical errors, quickly cause the triangle to fall apart. For this problem, we also have a source of error in the initial conditions. Whereas the Pythagorean problem can be set up using integers, the initial condition for the equilateral triangle contains irrational numbers. To control the error in the initial condition, we calculate the initial coordinates with the same wordlength as used for the simulation.
In the left panel of Figure 2, a similar diagram is shown as for the Pythagorean problem in the lower left panel of Figure 1. The starting precision is \(\epsilon=10^{10}\) and the wordlength is a function of ϵ as in equation (4). Subsequent simulations are performed with a 10 orders of magnitude higher precision. For a short initial phase of 5 time units, the rate of divergence follows a power law. At later time, the solutions start to diverge exponentially with a characteristic rate independent of the tolerance and wordlength. To investigate this transition, we redo the simulations with a large, fixed wordlength of 512 bits (green dotted curves). This way, we reduce the amount of roundoff error. As a consequence the rate of divergence is first dominated by the accumulation of discretisation errors and this phase lasts for a longer time, until the transition in the behaviour of the divergence, is reached, but now at ∼45 time units. The time of the transition depends on wordlength. Why the exponential divergence starts once the roundoff error has kicked in, is a question that is still under investigation.
The red dashed curves in the same diagram in Figure 2 give the results of the fourthorder Hermite, which are compared with the most precise Brutus simulation (with \(\epsilon=10^{80}\), \(L_{\mathrm{w}}=352\mbox{ bits}\)). The timestep parameter η = 10^{−1}, 10^{−2}, 10^{−3} and 10^{−4} for subsequent curves. The Hermite integrations show a similar behaviour as the Brutus results, which could imply that the rate of divergence is a physical property of the configuration, rather than a property of the integrator.
In the right panel of Figure 2 we show the duration for which the triangular configuration remains intact as a function of BS tolerance. For this experiment we halt the simulation when the distance between any two particles has increased or decreased by 10%, after which the triangle falls apart quickly. This diagram also illustrates the linear relation between accuracy and time in this system, which is caused by the constant number of digits being lost during every unit of time. The small scatter is due to the discrete times at which we check the triangular configuration. The solid, blue line is a fit to the data and its slope is \(0.52(3)\), which is equivalent to a loss of \(1.9(1)\) digits per cycle.
A Plummer distribution with \(N=16\)
As a third test we simulate the dynamical formation of the first hard binary in a small star cluster. We select a moderate number of sixteen equal mass stars and draw them randomly from a Plummer distribution (Plummer 1911). We integrate this system for about ten crossing times and apply the method of convergence. In Figure 3 we present how two solutions with the same initial conditions, but different precisions, diverge as a function of time. The rate of exponential divergence, on average, starts rather constant, with a loss of ∼2/3 digits per time unit. This is equivalent to an efolding time of \(t_{\mathrm{e}}=0.65\), which is consistent with the results of Goodman et al. (1993) (see their Figure 8). From \(t=20\) onwards, the rate of divergence experiences systematic changes, in particular a steep rise of the error of about 10 orders of magnitude between \(t=26\) and \(t=29\). Such rises are a signature for the presence of a hard binary interacting with surrounding stars.
The right panel in Figure 3 shows the energy conservation (black bullets, solid line) and the normalized phase space distance (red triangles, dashed line) versus ϵ. Energy conservation is proportional to ϵ, but the solutions only start to converge for \(\epsilon<10^{34}\). More generally, even if conserved quantities like total energy are conserved to machineprecision or better, it is not guaranteed that the solution itself has converged.
The highest precision Brutus simulation in this example (\(\epsilon=10^{50}\), \(L_{\mathrm{w}}=232\mbox{ bits}\)), took about a day of wallclock time, which is about 7,000 times slower than a simulation with Hermite using \(\eta=2^{9}\).
Scaling of the wallclock time
The use of arbitraryprecision arithmetic dramatically increases the CPU time of Nbody simulations. Also the BS method, which performs integration steps iteratively, makes an integration scheme more expensive by at least a factor two or more. To investigate for example how feasible it would be to run a converged Nbody simulation for 10^{3} stars through core collapse, we perform a scaling test in which we vary the number of particles and the precision, ϵ and \(L_{\mathrm{w}}\).
We randomly select positions and velocities for N equal mass stars from the virialised Plummer distribution (Plummer 1911), for \(N=2, 4, 8\), …, up to 1,024. The BS tolerance is fixed at a level of 10^{−6} and the wordlength at 64 bits. We integrate the systems for one Hénon time unit and measure the wallclock time. In the top left panel in Figure 4 we show the wallclock time as a function of N, which fit the relation \(t_{\mathrm{CPU}} \propto N^{2.6}\).
For \(N>32\), it becomes efficient to parallellise the code. Our version implements iparallellisation (Portegies Zwart et al. 2008) in the calculation of the accelerations. In the top right panel of Figure 4, we plot the speedup, S, against the number of cores. For \(N = 1\mbox{,}024\), we obtain a speed up of a factor 30 using 64 cores.
In the lower panels of Figure 4 we present the scaling of the wallclock time with BS tolerance and wordlength. To measure the dependence on BS tolerance, we simulated a 16body cluster for 1 Hénon time unit. We varied the BS tolerance while keeping the wordlength fixed at \(L_{\mathrm{w}} = 1\mbox{,}024\mbox{ bits}\). The relation obtained converges to \(t_{\mathrm{CPU}} \propto\epsilon^{0.032}\). A similar experiment was performed to measure the dependence on wordlength. This time we fixed the BS tolerance at \(\epsilon=10^{10}\) and varied the wordlength. For \(L_{\mathrm{w}} < 1\mbox{,}024\), the relation can be estimated as \(t_{\mathrm{CPU}} \propto L_{\mathrm{w}}^{0.33}\), while for \(L_{\mathrm{w}} > 1\mbox{,}024\), \(t_{\mathrm{CPU}} \propto L_{\mathrm{w}}\). This transition depends on the internal workings of the arbitraryprecision library which we will not discuss here.
Using a very long wordlength of 4,096 bits, i.e. ∼10^{3} digits, results in a slowdown of a factor \(f_{\mathrm{s}} \sim16\) compared to 64 bits. But for some simulations a BS tolerance smaller than 10^{−50} can easily be required to reach convergence, and this will result in a slowdown of a factor \(f_{\mathrm{s}} > 100\). The very small BS tolerance is often the main cause for the slowdown of the simulations, instead of the increased wordlength.
Using the above results, we can construct the following model to estimate the wallclock time for integrating 1 Hénon time unit with \(L_{\mathrm{w}} < 1\mbox{,}024\mbox{ bits}\):
Integrating \(N=1\mbox{,}024\) with standard precision (\(\epsilon =10^{6}\), \(L_{\mathrm{w}}=64\mbox{ bits}\)), up to core collapse at ∼300 time units, and taking into account a speed up of a factor 30 due to parallellisation, we estimate a total wallclock time of a week. Increasing the precision to (\(\epsilon=10^{20}\), \(L_{\mathrm{w}}=112\mbox{ bits}\)), will take about a month. A precision of (\(\epsilon=10^{50}\), \(L_{\mathrm{w}}=232\mbox{ bits}\)) will take roughly a year. To estimate how much precision is needed, we will assume that the rate of exponential divergence before the formation of the first hard binary is approximately constant. In the left panel of Figure 3, the initial slopes correspond to a loss of ∼2/3 digits per time unit. We construct the following approximate model for the initial BS tolerance needed to end up with a converged solution:
Here ϵ is the BS tolerance parameter, \(\delta _{\mathrm{final}}\) is the final precision of all the coordinates in the system, \(R_{\mathrm{div}}\) is the approximately constant rate of divergence, e.g. the number of accurate digits lost per unit of time, and \(t_{\mathrm{cc}}\) is the core collapse time. We set the final precision to 10^{−6}, i.e. convergence to the first 6 decimal places, and we set the core collapse time to ∼300 as before. If we adopt \(R_{\mathrm{div}} = 2/3\), we estimate that we need an \(\epsilon\sim10^{206}\). This would take about 10^{5} years to finish. It would be more practical to simulate a 256body cluster. If we set the core collapse to 100 time units we estimate \(\epsilon\sim 10^{73}\), which would take about a month on a cluster of 64 Intel^{®} Xeon^{®} E5530 cores.
For direct Nbody codes, the time for integrating up to core collapse usually scales as \(\mathcal{O}(N^{3})\). Using the analysis above, we estimate that the time for converged core collapse simulations scales approximately exponentially. This is effectively caused by the exponential divergence.
Precision of statistical results: experimental setup
In the previous section we demonstrated that it is possible to obtain a converged solution for a particular initial condition. We have also shown that a solution obtained by Hermite diverges from the converged solution, even up to the point that the microscopic solution given by Hermite is beyond recognition. We now perform a statistical study, to examine the hypothesis that doubleprecision Nbody simulations produce statistically indistinguishable results, from those obtained from an ensemble of converged solutions with the same set of initial conditions. Because it is computationally expensive to reach convergence, we start investigating the hypothesis above by exploring the accuracy of 3body statistics.
The \(N=3\) experiment is inspired by the Pythagorean problem, where after a complex 3body interaction, a binary and an escaper are formed. As a variation to this, we define four different sets of initial conditions as follows:

1.
Plummer distribution equal mass.

2.
Plummer distribution with masses 1:2:4.

3.
Plummer distribution equal mass with zero velocities.

4.
Plummer distribution with masses 1:2:4 and zero velocities.
The positions and velocities of the three stars are selected randomly from a virialised Plummer distribution (Plummer 1911; Aarseth et al. 1974). For the cold collapse systems, we set the velocities to zero. Then we rescale the positions and velocities to virialise the systems if the initial velocities are nonzero, or we set the total energy equal to \(E=0.25\) if the system starts out cold. We adopt standard Hénon units (Hénon 1971; Heggie and Mathieu 1986) throughout.
In the case of the cold initial conditions, the systems start democratically, i.e. the minimal distance between each pair of particles is greater than \(N^{1}\). We reject initial conditions in which this criterion is not satisfied. This is to prevent initial realisations where two stars which are very near, fall to each other radially causing very long wallclock times for the integration. When starting with a democratic configuration, there will also be an initial close triple encounter (Aarseth et al. 1994), which is hard to integrate accurately and is therefore a good test. A total of 10,000 random realisations are generated for each set of initial conditions and can be found in Additional files 1, 2, 3 and 4.
We stop the simulations when the system is dissolved into a permanent binary and an escaper. The criteria used to detect an escaper are the following:

1.
escaper has a positive energy, \(E >0\),

2.
is a certain distance away from the center of mass, \(r > 2 r_{\mathrm{virial}}\),

3.
is moving away from the center of mass, \(r \cdot v > 0\).
The energy of the escaper is calculated in the barycentric frame of the three particles and \(r_{\mathrm{virial}}\) is the virial radius of the system, which is of the order unity in Hénon units.
There may be situations in which a star is ejected without actually escaping from the binary. After a long excursion the star turns around and once again engages the binary in a 3body resonance (Hut and Bahcall 1983). Because these systems need to be integrated for a longer time, they also require higher precision to reach convergence, which takes a long time to integrate [see also Hut (1993)]. To deal with this issue, we perform the simulations iteratively by increasing the final integration time \(t_{\mathrm{end}}\). Starting with \(t_{\mathrm{end}}=50\) Hénon time units, we evolve every system and detect those that are dissolved. Then we increase \(t_{\mathrm{end}}\) to 100, 150, 200, etc., but only for those systems which have not yet dissolved. A complete ensemble of solutions is obtained up to \(t_{\mathrm{end}} \sim 500\), or equivalently ∼180 crossing times where the crossing time has a value of \(2\sqrt{2}\) in Hénon units (Hénon 1971; Heggie and Mathieu 1986). Systems which take a longer time to integrate are not taken into account in this research. The fraction of longlived systems is however a statistic we measure. We gathered the final, converged configurations in Additional files 1, 2, 3 and 4.
Each initial realisation is run with the Hermite code, using standard doubleprecision, and with Brutus, using arbitraryprecision until a converged solution is obtained. At the end of each simulation, we investigate the nature of the binary and the escaper. In addition to the BS tolerance, wordlength, CPU time and dissolution time, we record the mass, speed and escape direction of the escaping single star, and the semimajor axis, binding energy and eccentricity of the binary. In this way, we obtain statistics for \(N=3\) generated by a conventional Nbody solver and by Brutus.
Results
Before we perform a detailed comparison between results obtained by Hermite and Brutus, we first compare the Brutus results with analytical distributions from the literature in order to relate to previous studies. We compare Hermite and Brutus on a global level by performing twosample KolmogorovSmirnov tests (Kolmogorov 1933; Smirnov 1948) to see whether global distributions are statistically indistinguishable. We also compare the distribution of lifetimes of triples to see whether precision influences the stability and we measure the typical CPU time and BS tolerance needed to obtain a converged solution. After this, we compare Hermite and Brutus per individual system, with the aim of investigating the nature of the differences of every individual outcome. Finally, we define categories which classify a conventional simulation as a preservation or exchange, depending on whether the identity of the escaping star is consistent between Hermite and Brutus.
Brutus versus analytical distributions
In Figure 5, the distributions obtained by converged solutions are given for the following quantities: velocity and kinetic energy of the escaper in the barycentric reference frame, and semimajor axis, binding energy and eccentricity of the binary. We start by looking at the eccentricity distributions (bottom panel in Figure 5). These distributions can be estimated analytically by assuming that the probability of a certain configuration is proportional to the associated volume in phase space (Monaghan 1976; Valtonen and Karttunen 2006) or by considering an equilibrium distribution of binary stars in a cluster (Heggie 1975). The resulting thermal distribution in the threedimensional case is given by
and in the twodimensional case by
The 3body cold collapse problem is essentially a twodimensional problem. We compare the empirical and theoretical distributions by means of the KS test (see also next section). It turns out that the distributions in eccentricity are statistically distinguishable. By inspection by eye we observe that in the virialised case, there are slight deviations at high eccentricities. In the case of the equalmass, cold systems, there are more low eccentricity binaries compared to the theoretical prediction. They coincide at an eccentricity of about 0.7, after which they deviate again. For the cold systems with unequal masses, this behaviour is the other way around. The analytical predictions are able to capture the empirical distributions only in a qualitative manner.
The velocity distribution of the single escaping star can be estimated analytically in a similar way as was done for the eccentricities. The resulting distribution is predicted to be a double powerlaw given by (Monaghan 1976; Valtonen and Karttunen 2006):
We fit this model to the data (see Figure 5, first panel) and obtain values for α and β which are given in Table 1. The powerlaw indices vary with mass ratio and total angular momentum. To remove the dependence on mass ratio, we plot the kinetic energy of the escaper (see Figure 5, top right panel). Again, we fit a double powerlaw of a similar form as equation (9), and the powerlaw indices are given in Table 1. Both the escaper velocity and kinetic energy are consistent with a double powerlaw distribution.
The binary semimajor axis and binding energy are related quantities. We fit the binding energy distribution (see Figure 5, middle right panel) to a powerlaw (Heggie 1975; Monaghan 1976; Valtonen and Karttunen 2006):
The fitted powerlaw indices are given in Table 1. The empirical distributions are consistent with a powerlaw, although somewhat steeper than predicted (Heggie 1975; Monaghan 1976; Valtonen and Karttunen 2006). The slopes do tend to vary somewhat as a function of angular momentum (Monaghan 1976; Valtonen and Karttunen 2006).
The empirical distributions obtained by Brutus are in qualitative agreement with the analytical estimates present in the literature (Heggie 1975; Monaghan 1976; Valtonen and Karttunen 2006). Slight variations are present due to the dependence on total angular momentum, a limited statistical sampling and assumptions made in the derivation of the analytical distributions. Nevertheless, a similar qualitative agreement has been obtained between the analytical distributions discussed above and empirical distributions from an ensemble of conventional numerical solutions, e.g. not converged (Valtonen and Karttunen 2006, Chapters 78 and references therein). The question remains to what extend conventional and converged solutions agree quantitatively.
Brutus versus Hermite: global comparison
A quantitative way to compare global distributions is by performing twosample KolmogorovSmirnov tests (KS tests) (Kolmogorov 1933; Smirnov 1948). The KS test gives the likelihood that two samples are drawn from the same distribution, quantified by the value called p. When the pvalue is below five percent, the distributions are considered to be significantly different.
In Figure 6 we plot the pvalue obtained by comparing the Brutus distribution with the Hermite distribution versus timestep parameter η used for Hermite. In the panel showing the data for the binary semimajor axis, the distributions of the cold systems become significantly different for \(\eta> 2^{6}\). The distributions from the initially virialised systems start to differ for \(\eta> 2^{4}\). The cold systems are harder to model accurately, because of the close encounters that occur shortly after the start. The reason the distributions start to become significantly different at large timesteps is because at these large timesteps most simulations violate energy conservation by \(\vert \Delta E/E \vert > 0.1\). When this occurs, solutions might reach regions in 6Ndimensional phasespace, which theoretically are forbidden. The distribution then becomes biased by these outlier solutions.
Lifetime of triple systems
In Figure 7, we present the fraction of triple systems which are undissolved, i.e. still interacting, as a function of time. The results by Brutus are represented by the data points: equalmass Plummer (black bullets), Plummer with different masses (red triangles), equalmass cold Plummer (blue squares) and cold Plummer with different masses (green stars). The results by Hermite for a timestep parameter \(\eta= 2^{5}\) are represented by the curves appearing to go through the data points.
The initially cold systems dissolve faster than the initially virialised systems. This is somewhat expected due to the close triple encounter resulting from the initial cold collapse: the rate of energy exchange can be very high for these encounters (Johnstone and Rucinski 1991). After ∼180 crossing times, about 40% of the systems which started with an equalmass Plummer initial configuration, are undissolved, compared to about 10% for the cold Plummer with different masses. Systems which include stars with different masses dissolve faster than their equal mass counterparts. Energy equipartition tends to cause the lightest particle to quickly reach the escape velocity.
In Figure 7, the grey curves through the data points represent the interpolated Hermite results. Even though Hermite and Brutus use different algorithms and precisions to solve the equations of motion, we find that the lifetime of an unstable triple is statistically indistinguishable between converged Brutus and nonconverged Hermite solutions (but see also Section 6.3).
In Figure 8, we plot the maximum CPU time and minimum BS tolerance, both as a function of dissolution time. This is shown for the Brutus simulations, for the four different initial conditions. The longer it takes for a system to dissolve, the longer the CPU time and the higher the precision needed to reach a converged solution. To reach ∼180 crossing times, there are systems which require a BS tolerance of the order 10^{−100}, with the final converged run taking of the order a few days. The average CPU time as a function of time is about an order of magnitude smaller than the maximum CPU time. The average BS tolerance ranges from ∼10^{−20} to 10^{−30}. For systems which dissolve within 100 crossing times, Brutus is on average about a factor 120 slower than Hermite.
We were able to obtain a complete ensemble of systems dissolving within ∼180 crossing times. Simulations which take longer than this are not taken into account in this experiment. The fraction of longlived systems as obtained by Hermite and Brutus are consistent. For our purpose of comparing results from conventional integrators with the converged solution, integrating up to ∼180 crossing times is sufficient, in the sense that there is enough time for conventional solutions to diverge from the true solution (see Section 5.4.1). Including the longlived triple systems may however influence the statistical distributions and biases on the long term.
Brutus versus Hermite: individual comparison
For the individual comparison, we take a certain initial realisation and compare the solutions of Hermite and Brutus. In Figure 9 we show scatter plots of the Hermite solution (with timestep parameter \(\eta=2^{5}\)) versus the converged Brutus solution for the equalmass Plummer data set.
Data points on the diagonal represent accurate solutions, whereas the scatter around it represents inaccurate Hermite solutions. The diagonal is present in each panel and extends throughout the range of possible outcomes. The width of the diagonal is very narrow. When the normalized phasespace distance between the Hermite and Brutus solution \(\delta< 10^{1}\), then the coordinates are accurate enough to produce derived quantities accurate to at least one decimal place and Hermite and Brutus will give similar results. Once \(\delta> 10^{1}\), the solution has diverged to a different trajectory in phasespace leading to a different outcome. This outcome could in principle be any of the possible outcomes as can be derived from the amount of scatter in the Hermite solutions at a fixed Brutus solution.
In the scatter plot of the dissolution time, we observe that for small times (\(t < 10\)), Hermite and Brutus agree on the solution in the sense that the data points lie on the diagonal. Systems which dissolve after a short time don’t have sufficient time to accumulate enough error to diverge to another trajectory in phasespace. Once however this level of divergence is reached, the scatter immediately covers the entire, available outcome space. This randomisation is also observed in the other panels.
The fraction of accurate solutions
In Figure 10 we estimate the fraction of data points on the diagonal as a function of the Hermite timestep parameter, η. We only include the data points for which the normalized phasespace distance \(\delta< 10^{1}\). For the largest timestep parameters used (\(\eta> 10^{1}\)) the fraction on the diagonal, or the accurate fraction, varies from zero to about 0.2. By reducing the timestep parameter, the accurate fraction increases until it saturates at about 0.4 to 0.7 depending on the initial conditions. Even though by reducing η, the discretisation error decreases, the number of integration steps increases, which then increases the roundoff error. For the data sets with zero angular momentum, the maximum accurate fraction is obtained for \(\eta\sim2^{9}\). For the initially virialised systems this seems to occur between \(\eta\sim 10^{3}\mbox{}10^{4}\), although the actual saturation point is not visible yet. This dependence on angular momentum is due to the initial cold collapse and subsequent close encounters, which increases the roundoff error.
The error distribution
In Figure 11 we present statistics on the distribution of the errors, i.e. \(S_{\mathtt{Hermite}}S_{\mathtt{Brutus}}\), with S a statistic. For the dissolution time and the eccentricity, the average error converges to zero for \(\eta< 10^{1}\). For larger timesteps, simulations which grossly violate energy conservation (\(\vert \Delta E/E \vert > 0.1\)) cause biases in the average error. For the binary semimajor axis however, the data representing the cold collapse simulations also seem to be systematically biased for small timesteps, in the sense that Hermite makes fewer tight binaries.
The width of the error distributions converge to a nonzero value. This can be understood because with decreasing timestep, roundoff errors will become more important so that the standard deviation of the errors will never reach zero. For the dissolution time, the width of the error distribution for the smallest timestep parameter adopted, varies from 60 to 100 crossing times. For the eccentricities the width is on average ∼0.2. For the semimajor axis the width approaches ∼0.05 (in Hénon units). In the case of the semimajor axis, the data representing the cold collapse simulations behave differently, because the width is much larger than the width for the data representing the initially virialised systems.
If we regard the results given by Brutus and Hermite as random variables drawn from the same distribution, then we can write the variance in a certain statistic, in this example the eccentricity, as:
Here e stands for eccentricity and the subscripts for Brutus and Hermite. For a thermal eccentricity distribution (equation (7)), we obtain a standard deviation of \(1/3\). However, this only applies to inaccurate Hermite results, which had enough time to diverge through outcome space. If we multiply the theoretical standard deviation calculated above by the inaccurate fraction, we obtain a range in the standard deviation from 0.17 to 0.27, as η ranges from the most precise value to \(\eta= 10^{1}\).
Symmetry of the error distribution
To measure the symmetry of the error distribution, we count the fraction of positive errors (Figure 11, bottom panels). Again for an \(\eta< 10^{1}\), this fraction converges to 0.5. A more detailed comparison is given in Figure 12, where we compare distribution functions of positive and negative errors. In Section 2.3, we mentioned that in our experiment we define the Brutus solution to be converged when at least 3 decimal places of every coordinate have converged. To investigate the symmetry up to higher precision, we repeated a subset of 1,000 simulations. We did this only for the initial conditions with equalmass stars picked randomly from a virialised Plummer distribution and this time we obtain solutions converged up to the first 15 decimal places.
We observe that the majority of errors are larger than ∼10^{−3} and within the statistical error, the positive and negative errors have a similar distribution. For the smallest errors however, we observe an asymmetry in the sense that there are more negative, small errors. The magnitude of the error where this excess occurs is determined by the precision of the integration. For the smallest η, the excess is below doubleprecision and thus not observable anymore (see Section 6.2 for more explanation).
Escaper identity
In this section we compare the solutions obtained with Hermite and Brutus individually, by looking at which star eventually becomes the escaper and which form the binary. We define preservation if the Hermite and the Brutus solution both have the same star as the escaper. We define it as exchange if the escaping star is different. A further distinction can be made in the preservation category, if the Hermite simulation is also accurate. We can typify each Hermite simulation as follows:

Accurate: The coordinates are accurate, up to at least two digits.

Preservation: The coordinates are inaccurate, but same star escapes.

Exchange: Different star escapes.
In Figure 13 we present the fraction of each category as a function of time. As expected, systems which dissolve quickly, hardly have time to develop errors and are categorized as accurate simulations. In time however, because errors grow exponentially, the solutions become inaccurate. The fractions of preservation and exchange start to grow. For a small timestep parameter (\(\eta= 2^{11}\), top row in Figure 13), this growth starts after ∼20 crossing times for the initially virialised systems. For the initially cold systems, the inaccurate fractions already start to grow after a single crossing time.
The cold collapse with equalmass stars is the hardest problem to integrate as the accurate fraction is of comparable magnitude as the preservation and exchange fractions. The accurate fraction generally remains dominant, with a final fraction varying from about 0.4 for the equalmass cold Plummer to about 0.7 for the Plummer with different masses. For the lesser precision (\(\eta= 2^{3}\), bottom row in the figure), the accurate fractions decrease to below 0.2.
In the panels in Figure 13, which include the data for the systems with different masses, preservation is more common than exchange. This can be understood, because due to energy equipartition, the lightest particle will be more likely to escape and therefore the identity is more often correct than in the equal mass case. For the equal mass case, the fraction of preservation and exchange is comparable, except in the case of the equalmass cold Plummer with the low precision (\(\eta= 2^{3}\), the bottom row). If we regard the identity of the escaping star to be completely random once the solution has become inaccurate, we would expect the fraction of exchange to be twice the fraction of preservation. This is roughly what we observe in the equal mass cold collapse case with low precision. Because of the low precision and the initial close encounter, solutions will diverge very quickly. In the panel with the higher precision this trend is not observed because the solutions are less randomised. The preservation category includes solutions which slightly differ from the converged solution only in the escape angle of the escaper. Also the longlived triples are not taken into account here, which will alter these fractions.
Discussion
Energy conservation
In every ensemble of Hermite solutions there are some that grossly violate conservation of energy \(\vert \Delta E/E \vert > 0.1\). This deformation of the energy hypersurface in phasespace can allow solutions to reach parts of phasespace which are theoretically forbidden. This affects the global statistical distributions. In Figure 14, we replot the average error in the binary semimajor axis as a function of the timestep parameter. We produce similar diagrams as presented in Figure 11, but this time we introduce a maximum allowed error in the energy. If we filter out simulations with a relative energy conservation \(\vert \Delta E/E \vert > 1\), or \(\vert \Delta E/E \vert > 0.1\), we observe that the bias in the average error of the semimajor axis of the binaries vanishes. We conclude that this bias is caused by a few simulations which grossly violate energy conservation. A similar bias in the velocity of the escaping star is less pronounced.
Timereversible, symplectic integrators should in principle conserve energy to a better level than nonsymplectic integrators, since there is no drift present in the energy error. Therefore, by using a symplectic integrator, the number of simulations with large energy error could be reduced. Using a Leapfrog integrator with constant timesteps, we tested this assumption and we find that for resonant 3body interactions, it is challenging to obtain accurate solutions. The main reason is that, contrary to regular systems like, for example, the Solar System, resonant 3body interactions often include very close encounters, which need a very small timestep size to be resolved accurately. This is especially the case for the initially cold systems. Adopting such a small timestep size for the whole simulation, will increase the wallclock time to that of Brutus or beyond.
Asymmetry at small errors
In Section 5.4.3, we discussed an asymmetry at small errors. In Figure 15, we present similar diagrams as in Figure 12 for the positive and negative errors. This time we add the errors in the total energy and angular momentum of the system and the error in the velocity of the escaper.
We also vary the integration method because different methods produce different (biased) error distributions in energy and angular momentum. We use a standard Leapfrog integrator, a standard Hermite integrator and a Hermite integrator which uses the \(P(EC)^{n}\) method (we adopted \(n=3\)) (Kokubo et al. 1998). This last method adds an iterative procedure to the algorithm to improve the predictions and corrections, which improves the timesymmetry. For each method we implement a shared, adaptive timestep criterion as in equation (1), with a timestep parameter \(\eta= 2^{7}\). As a consequence they will not be timesymmetric nor symplectic.
We first look at the error distributions in the total energy and angular momentum. We observe that none of them are symmetric, in the sense that the positive and negative errors have identical distributions, except for the angular momentum in the Leapfrog simulations. The Leapfrog solutions tend to gain energy, whereas the standard Hermite loses energy. The Hermite with the \(P(EC)^{n}\) method produces both positive and negative errors in the energy, but not in a symmetric manner.
To investigate whether the bias in energy and angular momentum conservation propagates to a bias in the binary and escaper properties, we estimate what the errors should be if we regard the error in the energy and angular momentum as a small perturbation to the converged solution. For the error in the velocity of the escaper, using the derivative of the kinetic energy with respect to velocity, we obtain the following expression:
Here m is the mass of a star, v the velocity as obtained by Brutus, δE the energy error and δv the error in the velocity due to this energy error. For the binary semimajor axis we obtain:
Here a is the semimajor axis from the Brutus solution. For the eccentricity we obtain:
Here μ is the total mass of the binary, ϵ and l the specific energy and specific angular momentum of the binary as obtained by Brutus. The error in the eccentricity δe has contributions from errors in the energy δϵ and angular momentum δl.
If we compare the resulting error distributions to the actual error distributions, we find that the approximated error distribution is positioned at the asymmetry in the empirical error distribution. This is most clearly seen for the semimajor axis and eccentricity (see Figure 15).
The reason why the approximated error distribution overestimates the excess, is because not all errors are solely due to an error in the energy and angular momentum. In time, the numerical solution diverges from the true solution and this error due to divergence will become more dominant. With this in mind, we can approximate the error in a statistic as follows:
Here S is a statistic that is related to energy and/or angular momentum, \(\delta S_{\mathrm{conservation}}\) is the error due to a small perturbation in the energy and/or angular momentum and \(\delta S_{\mathrm{divergence}}\) is the error due to divergence of the solution. When the solution has not diverged appreciably yet, the first type of error will dominate and possible biases can be observed. When the second type of error dominates, we observe that the symmetry is restored to within the statistical error.
Upon inspection of the velocity data, we observe no asymmetry in the Hermite results. When we measure which fraction of the energy error is reserved for the binary and which fraction for the escaper, we find that in most cases the error propagates to the binary. For the Leapfrog however, the asymmetry is still present.
Preservation of the macroscopic properties
Valtonen et al. (2004) state that the final statistical distributions forget the specific initial conditions and only depend on globally conserved quantities. This assumption makes predictions which are verified by our experiment. The results show that for a timestep parameter \(\eta< 2^{5}\), the distributions are statistically indistinguishable, even though at least half of the solutions diverged from the converged solution. If however, energy conservation is grossly violated, biases are introduced in the statistics. In our experiment, a maximum level of relative energy conservation of \(\vert \Delta E/E \vert = 0.1\) was sufficient to remove the biases. This is a much milder constraint than the \(\vert \Delta E/E \vert \sim10^{6}\) usually adopted in collisional simulations. Whether 0.1 is also sufficient for systems with more stars, should be verified experimentally. Heggie (1991) for example, finds that the energy of escaping stars in higherN systems, depends sensitively on integration accuracy. The maximum required level of energy conservation should be such that it is below the energy taken away from the cluster by the escaping stars.
The chaoticity of the 3body problem is illustrated by the scatter diagrams in Figure 9. For a certain value of a statistic obtained by Brutus, any other value in the allowed outcome space is reachable for the Hermite integrator. For example, if the converged solution gives an eccentricity for the binary of 0.6, a diverged solution can produce any eccentricity between 0 and 1. Once the solution has diverged from the true solution, it will start a random walk through or near the allowed phasespace until the 3body system has dissolved. We observed that this randomisation happens in such a way that the available outcome space is still completely sampled and that it preserves global statistical distributions.
In Section 4, we discussed that the lifetime of an unstable triple does not depend on the integrator used nor on the accuracy of that integrator. This last point should be interpreted in the sense that when more effort is put into performing simulations with higher precision, that this does not change the global statistics, even though individual solutions will change with precision (see for example the Hermite results in Figure 1). If instead we continue to decrease the precision, there will be a point where biases start to appear. Urminsky (Urminsky 2008) analysed the 3body Sitnikov problem and showed that the precision of the integration influences the average lifetime of triple systems, contrary to our results. The integration times in our experiment however, are much shorter. Obtaining a converged solution for a resonant 3body system for longer than 200 crossing times, is still computationally challenging. Therefore any statistical difference on the long term will not be visible in our experiment.
Conclusion
Brutus is an Nbody code that uses the BulirschStoer method to control discretisation errors, and arbitraryprecision arithmetic to control roundoff errors. By using the method of convergence, where we systematically vary the BulirschStoer tolerance parameter and the wordlength, we can obtain a solution for a particular Nbody problem, for which the first p digits in the mantissa are independent of the timestep size and wordlength. We call this solution converged to p decimal places.
Obtaining the converged solution is computationally expensive, mainly because of the exponential divergence of the solution. In some cases, BulirschStoer tolerances of 10^{−100} are needed to reach convergence. We estimate that the time for simulating a star cluster up to core collapse, until convergence, scales approximately exponentially with the number of stars. Simulations with 256 stars however, may be performed within a year of computing time.
The motivation to obtain expensive, converged solutions is to test the assumption that the statistics of an ensemble of approximate solutions, are indistinguishable from the statistics of an ensemble of true solutions. To put this assumption to the test, we have investigated the statistics on the breakup of 3body systems. In our experiment, a bound triple system will eventually dissolve into a binary and an escaping star. Solutions to every initial realisation were obtained using the standard Hermite integrator and using Brutus.
For systems with a long lifetime it is challenging to obtain the converged solution. Due to repeated ejections and resonances, many accurate digits will be lost and so a very small BulirschStoer tolerance is required. Therefore, we have set an integration limit at ∼180 crossing times. For equalmass, virialised systems, ∼40% of the random initial realisations were not dissolved by this time. For the initially cold systems with different masses this was ∼10%. Hermite and Brutus are consistent on the average lifetime of an unstable triple system. However, possible differences on the long term are not visible in this experiment.
When we compare the results on an individual basis, we find that on average about half of the Hermite solutions give accurate results, i.e. at most a 1% relative difference compared to Brutus. For the inaccurate results, the error distribution becomes unbiased and symmetric for a timestep parameter \(\eta\le2^{5}\) and implementing a maximum level of relative energy conservation of \(\vert \Delta E/E\vert < 0.1\).
Once the conventional solution has diverged from the converged solution, it will start a random walk through or near the allowed region in phase space. such that any allowed outcome of a statistic is reachable. This randomisation process completely samples the available outcome space of a statistic and it also preserves the global statistical distributions.
KolmogorovSmirnov tests were performed to compare the global distributions produced by Hermite and Brutus. No significant differences were detected when using the criteria mentioned above for the timestep parameter η and relative energy conservation. This research for the 3body problem supports the assumption that results from conventional Nbody simulations are valid in a statistical sense. We observed however that a bias is introduced for the smallest errors, if the algorithm used to solve the equations of motion, is biased in the conservation of energy and angular momentum. In this research however, this bias did not have an appreciable effect. It is important to see whether this remains true for statistics of higherN systems or systems with a dominant mass. An example of a higherN system where precision might play a role is a young star cluster (without gas) going through the process of cold collapse (Caputo et al. 2014). At the moment of deepest collapse, a fraction of stars will obtain large accelerations, so that a small error in the acceleration can cause large errors in the position and velocity. The rate of divergence can increase up to about 5 digits per Hénon time unit for 128 particles and it increases with N.
Notes
 1.
We use the opensource library GMP: http://gmplib.org/.
 2.
Formerly known as Nbody units. Introduced by D. Heggie at MODEST14.
References
Aarseth, SJ, Anosova, JP, Orlov, VV, Szebehely, VG: Global chaoticity in the Pythagorean threebody problem. Celest. Mech. Dyn. Astron. 58, 116 (1994)
Aarseth, SJ, Anosova, JP, Orlov, VV, Szebehely, VG: Close triple approaches and escape in the threebody problem. Celest. Mech. Dyn. Astron. 60, 131137 (1994)
Aarseth, SJ, Henon, M, Wielen, R: A comparison of numerical methods for the study of star cluster dynamics. Astron. Astrophys. 37, 183187 (1974)
Bulirsch, R, Stoer, J: Fehlerabschätzungen und extrapolation mit rationalen funktionen bei verfahren vom richardsontypus. Numer. Math. 6, 413427 (1964)
Burrau, C: Numerische Berechnung eines Spezialfalles des Dreikörperproblems. Astron. Nachr. 195, 113 (1913)
Caputo, DP, de Vries, N, Portegies Zwart, S: On the effects of subvirial initial conditions and the birth temperature of R136. Mon. Not. R. Astron. Soc. 445, 674685 (2014)
Dejonghe, H, Hut, P: Roundoff sensitivity in the Nbody problem. In: Hut, P, McMillan, SLW (eds.) The Use of Supercomputers in Stellar Dynamics. Lecture Notes in Physics, vol. 267, p. 212. Springer, Berlin (1986)
Goodman, J, Heggie, DC, Hut, P: On the exponential instability of Nbody systems. Astrophys. J. 415, 715 (1993)
Gragg, WB: On extrapolation algorithms for ordinary initial value problems. SIAM J. Numer. Anal. 2, 384403 (1965)
Heggie, DC: Binary evolution in stellar dynamics. Mon. Not. R. Astron. Soc. 173, 729787 (1975)
Heggie, DC: Chaos in the Nbody problem of stellar dynamics. In: Roeser, S, Bastian, U (eds.) Predictability, Stability, and Chaos in NBody Dynamical Systems, pp. 4762 (1991)
Heggie, DC, Mathieu, RD: Standardised units and time scales. In: Hut, P, McMillan, SLW (eds.) The Use of Supercomputers in Stellar Dynamics. Lecture Notes in Physics, vol. 267, p. 233. Springer, Berlin (1986)
Hénon, MH: The Monte Carlo method (Papers appear in the Proceedings of IAU Colloquium No. 10 Gravitational NBody Problem (ed. by Myron Lecar), R. Reidel Publ. Co., DordrechtHolland.) Astrophys. Space Sci. 14, 151167 (1971)
Hut, P: Binarysinglestar scattering. III  Numerical experiments for equalmass hard binaries. Astrophys. J. 403, 256270 (1993)
Hut, P, Bahcall, JN: Binarysingle star scattering. I  Numerical experiments for equal masses. Astrophys. J. 268, 319341 (1983)
Hut, P, Heggie, DC: Orbital divergence and relaxation in the gravitational Nbody problem. J. Stat. Phys. 109, 10171025 (2002)
Ito, T, Tanikawa, K: Longterm integrations and stability of planetary orbits in our Solar system. Mon. Not. R. Astron. Soc. 336, 483500 (2002)
Johnstone, D, Rucinski, SM: Statistical properties of planar zeroangularmomentum equalmass triple systems. Publ. Astron. Soc. Pac. 103, 359367 (1991)
Kokubo, E, Yoshinaga, K, Makino, J: On a timesymmetric Hermite integrator for planetary Nbody simulation. Mon. Not. R. Astron. Soc. 297, 10671072 (1998)
Kolmogorov, A: Sulla determinazione empirica di una legge di distribuzionc. 1st. Ital. Attuari. G. 4, 111 (1933)
Lagrange, JL: Essai sur le Problème des Trois Corps. Prix de l’Académie Royale des Sciences de Paris 6, 292 (1772)
Makino, J, Aarseth, SJ: On a Hermite integrator with AhmadCohen scheme for gravitational manybody problems. Publ. Astron. Soc. Jpn. 44, 141151 (1992)
Miller, RH: Irreversibility in small stellar dynamical systems. Astrophys. J. 140, 250 (1964)
Monaghan, JJ: A statistical theory of the disruption of threebody systems. I  Low angular momentum. Mon. Not. R. Astron. Soc. 176, 6372 (1976)
Newton, I: Philosophiae Naturalis Principia Mathematica (1687)
Plummer, HC: On the problem of distribution in globular star clusters. Mon. Not. R. Astron. Soc. 71, 460470 (1911)
Portegies Zwart, S, McMillan, S, Groen, D, Gualandris, A, Sipior, M, Vermin, W: A parallel gravitational Nbody kernel. New Astron. 13, 285295 (2008)
Portegies Zwart, S, McMillan, S, Pelupessy, I, van Elteren, A: Multiphysics simulations using a hierarchical interchangeable software interface. In: CapuzzoDolcetta, R, Limongi, M, Tornambè, A (eds.) Advances in Computational Astrophysics: Methods, Tools, and Outcome. Astronomical Society of the Pacific Conference Series, vol. 453, p. 317 (2012)
Quinlan, GD, Tremaine, S: On the reliability of gravitational Nbody integrations. Mon. Not. R. Astron. Soc. 259, 505518 (1992)
Smirnov, N: Table for estimating the goodness of fit of empirical distributions. Ann. Math. Stat. 19(2), 279281 (1948)
Smith, H Jr.: The dependence of statistical results from Nbody calculations on N. Astron. Astrophys. 76, 192199 (1979)
Szebehely, V, Peters, CF: Complete solution of a general problem of three bodies. Astron. J. 72, 876 (1967)
Urminsky, D: On the calculation of average lifetimes for the 3body problem. In: Vesperini, E, Giersz, M, Sills, A (eds.) IAU Symposium, vol. 246, pp. 235236 (2008)
Valtonen, M, Karttunen, H: The ThreeBody Problem. Cambridge University Press, Cambridge (2006)
Valtonen, M, Mylläri, A, Orlov, V, Rubinov, A: Statistical approach to the threebody problem. In: Byrd, GG, Kholshevnikov, KV, Myllri, AA, Nikiforov, II, Orlov, VV (eds.) Order and Chaos in Stellar and Planetary Systems. Astronomical Society of the Pacific Conference Series, vol. 316, p. 45 (2004)
Verlet, L: Computer ‘Experiments’ on classical fluids. I. Thermodynamical properties of LennardJones molecules. Phys. Rev. 159, 98103 (1967)
Zadunaisky, PE: On the accuracy in the numerical solution of the Nbody problem. Celest. Mech. 20, 209230 (1979)
Acknowledgements
We thank Douglas Heggie, Piet Hut, Michiko Fujii and Guilherme Gonçalves Ferrari for useful discussions and comments on the manuscript. TB would also like to thank Ann Young and Lucie Jíková for carefully reading the manuscript and improving the presentation. The authors also thank the referees for providing useful improvements to our manuscript. This work was supported by the Netherlands Research Council NWO (grants #643.200.503, #639.073.803 and #614.061.608) and by the Netherlands Research School for Astronomy (NOVA). Part of the numerical computations were carried out on the Little Green Machine at Leiden University and on the Lisa cluster at SURFSara in Amsterdam.
Author information
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
TB wrote the Brutus Nbody code, participated in designing the experiments, performed the Nbody simulations, gathered the results from the simulations, interpreted the results and wrote the major part of the manuscript. SPZ thought of the concept of the Brutus code, participated in designing the experiments, interpreted the results and helped to draft the manuscript. All authors read and approved the final manuscript.
Electronic Supplementary Material
Below are the links to the electronic supplementary material.
Rights and permissions
Open Access This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
About this article
Cite this article
Boekholt, T., Portegies Zwart, S. On the reliability of Nbody simulations. Comput. Astrophys. 2, 2 (2015). https://doi.org/10.1186/s4066801400053
Received:
Accepted:
Published:
Keywords
 methods: numerical
 methods: Nbody simulations
 stars: dynamics
 binaries: formation