.. Initial Conditions
Josh Borrow, 5th April 2018
.. _Initial_Conditions_label:
Initial Conditions
==================
To run anything more than examples from our suite, you will need to be able to
produce your own initial conditions for SWIFT. We use the same initial
conditions format as the popular `GADGET-2
`_ code, which uses HDF5 for
its type 3 format. Note that we do not support the GADGET-2 types 1 and 2
formats.
One crucial difference is that whilst GADGET-2 can have initial conditions split
over many files SWIFT only supports initial conditions in one single file. **ICs
split over multiple files cannot be read by SWIFT**. See the
":ref:`multiple_files_ICs`" section below for possible solutions. In GADGET-2
having multiple files allows multiple ones to be read in parallel and is the
only way the code can handle more than 2^31 particles. This limitation is not in
place in SWIFT. A single file can contain any number of particles (well... up to
2^64...) and the file is read in parallel by HDF5 when running on more than one
compute node.
The original GADGET-2 file format only contains 2 types of particles: gas
particles and 5 sorts of collision-less particles that allow users to run with 5
separate particle masses and softenings. In SWIFT, we expand on this by using
two of these types for stars and black holes.
As the original documentation for the GADGET-2 initial conditions format is
quite sparse, we lay out here all of the necessary components. If you are
generating your initial conditions from python, we recommend you use the h5py
package. We provide a writing wrapper for this for our initial conditions in
``examples/KeplerianRing/write_gadget.py``.
You can find out more about the HDF5 format on their `webpages
`_.
Structure of the File
---------------------
There are several groups that contain 'auxiliary' information, such as
``Header``. Particle data is placed in separate groups depending of the type of
the particles. Some types are currently ignored by SWIFT but are kept in the
file format for compatibility reasons.
+---------------------+------------------------+----------------------------------------+
| HDF5 Group Name | Physical Particle Type | In code ``enum part_type`` |
+=====================+========================+========================================+
| ``/PartType0/`` | Gas | ``swift_type_gas`` |
+---------------------+------------------------+----------------------------------------+
| ``/PartType1/`` | Dark Matter | ``swift_type_dark_matter`` |
+---------------------+------------------------+----------------------------------------+
| ``/PartType2/`` | Background Dark Matter | ``swift_type_dark_matter_background`` |
+---------------------+------------------------+----------------------------------------+
| ``/PartType3/`` | Sinks | ``swift_type_sink`` |
+---------------------+------------------------+----------------------------------------+
| ``/PartType4/`` | Stars | ``swift_type_star`` |
+---------------------+------------------------+----------------------------------------+
| ``/PartType5/`` | Black Holes | ``swift_type_black_hole`` |
+---------------------+------------------------+----------------------------------------+
| ``/PartType6/`` | Neutrino Dark Matter | ``swift_type_neutrino`` |
+---------------------+------------------------+----------------------------------------+
The last column in the table gives the ``enum`` value from ``part_type.h``
corresponding to a given entry in the files.
Note that the only particles that have hydrodynamical forces calculated
between them are those in ``PartType0``. The background dark matter
particles are used for zoom-in simulations and can have different masses
(and as a consequence softening length) within the ``/PartType2`` arrays.
Necessary Components
--------------------
There are several necessary components (in particular header information) in a
SWIFT initial conditions file. Again, we recommend that you use the ``write_gadget``
script.
Header
~~~~~~
In the ``/Header/`` group, the following attributes are required:
+ ``Dimension``, an integer indicating the dimensionality of the ICs (1,2 or 3).
Note that this parameter is an addition to the GADGET-2 format and will be
ignored by GADGET. SWIFT will use this value to verify that the dimensionality
of the code matches the ICs. If this parameter is not provided, it defaults
to 3.
+ ``BoxSize``, a floating point number or N-dimensional (usually 3) array that
describes the size of the box. If only one number is provided (as per the
GADGET-2 standard) then the box is assumed have the same size along all the
axis. In cosmological runs, this is the comoving box-size expressed in the
units specified in the ``/Units`` group (see below). Note that, unlike GADGET,
we express all quantities in "h-free" units. So that, for instance, we express
the box side-length in ``Mpc`` and not ``Mpc/h``.
+ ``NumPart_Total``, a length 6 array of integers that tells the code how many
particles of each type are in the initial conditions file. Unlike traditional
GADGET-2 files, these can be >2^31.
+ ``NumPart_Total_HighWord``, a historical length-6 array that tells the code
the number of high word particles in the initial conditions there are. If you
are unsure, just set this to ``[0, 0, 0, 0, 0, 0]``. This does have to be
present but can be a set of 0s unless you have more than 2^31 particles and
want to be fully compliant with GADGET-2. Note that, as SWIFT supports
``NumPart_Total`` to be >2^31, the use of ``NumPart_Total_HighWord`` is only
here for compatibility reasons.
+ ``Flag_Entropy_ICs``, a historical value that tells the code if you have
included entropy or internal energy values in your initial conditions files.
Acceptable values are 0 or 1. We recommend using internal energies over
entropy in the ICs and hence have this flag set to 0.
You may want to include the following for backwards-compatibility with many
GADGET-2 based analysis programs:
+ ``MassTable``, an array of length 6 which gives the masses of each particle
type. SWIFT ignores this and uses the individual particle masses, but some
programs will crash if it is not included.
+ ``NumPart_ThisFile``, a length 6 array of integers describing the number of
particles in this file. If you have followed the above advice, this will be
exactly the same as the ``NumPart_Total`` array. As SWIFT only uses ICs
contained in a single file, this is not necessary for SWIFT-only ICs.
+ ``NumFilesPerSnapshot``, again a historical integer value that tells the code
how many files there are per snapshot. You will probably want to set
this to 1. If this field is present in a SWIFT IC file and has a
value different from 1, the code will return an error message.
+ ``Time``, time of the start of the simulation in internal units or expressed
as a scale-factor for cosmological runs. **SWIFT ignores this and reads it
from the parameter file**, behaviour that matches the GADGET-2 code. Note
that SWIFT writes the current time since the Big Bang, not scale-factor, to
this variable in snapshots.
+ ``Redshift``, the redshift at the start of the simulation. SWIFT checks this
(if present) against ``a_begin`` in the parameter file at the start of
cosmological runs. Note that we explicitly do **not** compare the ``Time``
variable due to its ambiguous meaning.
Particle Data
~~~~~~~~~~~~~
Now for the interesting part! You can include particle data groups for each
individual particle type (e.g. ``/PartType0/``) that have the following *datasets*:
+ ``Coordinates``, an array of shape (N, 3) where N is the number of particles
of that type, that are the cartesian co-ordinates of the
particles. Co-ordinates must be within the box so, in the case of a cube
within [0, L)^3 where L is the side-length of the simulation volume. In the
case of cosmological simulations, these are the co-moving positions.
+ ``Velocities``, an array of shape (N, 3) that is the cartesian velocities of
the particles. When running cosmological simulations, these are the **peculiar
velocities**. Note that this is different from GADGET which uses peculiar
velocities divided by ``sqrt(a)`` (see below for a fix).
+ ``ParticleIDs``, an array of length N that are unique identifying numbers for
each particle. Note that these have to be unique to a particle, and cannot be
the same even between particle types. The **IDs must be >= 0**. Negative
IDs will be rejected by the code.
Note, however, that if the parameters to remap the IDs upon startup is switched
on (see :ref:`Parameters_ICs`), the IDs can be omitted entirely from the ICs.
+ ``Masses``, an array of length N that gives the masses of the particles.
For ``PartType0`` (i.e. particles that interact through hydro-dynamics), you will
need the following auxiliary items:
+ ``SmoothingLength``, the smoothing lengths of the particles. These will be
tidied up a bit, but it is best if you provide accurate numbers. In
cosmological runs, these are the co-moving smoothing lengths.
+ ``InternalEnergy``, an array of length N that gives the internal energies per
unit mass of the particles. If the hydro-scheme used in the code is based on
another thermodynamical quantity (entropy or total energy, etc.), the
conversion will happen inside the code. In cosmological runs, this is the
**physical** internal energy per unit mass. This has the dimension of velocity
squared.
Note that for cosmological runs, all quantities have to be expressed in "h-free"
dimensions. This means ``Mpc`` and not ``Mpc/h`` for instance. If the ICs have
been generated for GADGET (where h-full values are expected), the parameter
``InitialConditions:cleanup_h_factors`` can be set to ``1`` in the
:ref:`Parameter_File_label` to make SWIFT convert the quantities read in to
h-free quantities. Switching this parameter on will also affect the box size
read from the ``/Header/`` group (see above).
Similarly, GADGET cosmological ICs have traditionally used velocities expressed
as peculiar velocities divided by ``sqrt(a)``. This can be undone by switching
on the parameter ``InitialConditions:cleanup_velocity_factors`` in the
:ref:`Parameter_File_label`.
.. _ICs_units_label:
Optional Components
-------------------
In the ``/Units/`` HDF5 group, you cans specify what units your initial conditions are
in. If this group is not present, the code assumes that you are using the same
units for your initial conditions as in your :ref:`Parameter_File_label`
(i.e. as the internal units system used by the code), but it is best to include
them to be on the safe side. You will need:
+ ``Unit length in cgs (U_L)``
+ ``Unit mass in cgs (U_M)``
+ ``Unit time in cgs (U_t)``
+ ``Unit current in cgs (U_I)``
+ ``Unit temperature in cgs (U_T)``
These are all floating point numbers. Note that we specify the time units and
not the velocity units.
If the units specified in the initial conditions are different from the internal
units (specified in the parameter file), SWIFT will perform a conversion of all
the quantities when reading in the ICs. This includes a conversion of the box
size read from the ``/Header/`` group.
Summary
-------
You should have an HDF5 file with the following structure:
.. code-block:: bash
Header/
BoxSize=[x, y, z]
Flag_Entropy_ICs=0
NumPart_Total=[0, 1, 0, 0, 4, 5]
NumPart_Total_HighWord=[0, 0, 0, 0, 0, 0]
Units/
Unit current in cgs (U_I)=1.0
Unit length in cgs (U_L)=1.0
Unit mass in cgs (U_M)=1.0
Unit temperature in cgs (U_T)=1.0
Unit time in cgs (U_t)=1.0
PartType0/
Coordinates=[[x, y, z]]
Velocities=[[vx, vy, vz]]
ParticleIDs=[...]
Masses=[...]
InternalEnergy=[...]
SmoothingLength=[...]
PartType1/
Coordinates=[[x, y, z]]
Velocities=[[vx, vy, vz]]
ParticleIDs=[...]
Masses=[...]
.. _multiple_files_ICs:
ICs split over multiple files
-----------------------------
A basic script ``tools/combine_ics.py`` is provided to merge basic GADGET-2
initial conditions split into multiple files into one single valid file. This
script can handle simple HDF5 files (GADGET-2 type 3 ICs) that follow the format
described above but split over multiple files.
The script can also convert ICs using a ``MassTable`` and create the
corresponding particle fields. Note that additional fields present in ICs beyond
the simple GADGET-2 specification will not be merged.
One additional option is to compress the fields in the files using HDF5's gzip
compression. This is very effective for the fields such as masses or particle
IDs which are very similar. A checksum filter is also applied in all cases to
help with data curation.
**We caution that this script is very basic and should only be used with great
caution.**