208 lines
10 KiB
ReStructuredText
208 lines
10 KiB
ReStructuredText
===============================
|
|
Building a Distribution of LLVM
|
|
===============================
|
|
|
|
.. contents::
|
|
:local:
|
|
|
|
Introduction
|
|
============
|
|
|
|
This document is geared toward people who want to build and package LLVM and any
|
|
combination of LLVM sub-project tools for distribution. This document covers
|
|
useful features of the LLVM build system as well as best practices and general
|
|
information about packaging LLVM.
|
|
|
|
If you are new to CMake you may find the :doc:`CMake` or :doc:`CMakePrimer`
|
|
documentation useful. Some of the things covered in this document are the inner
|
|
workings of the builds described in the :doc:`AdvancedBuilds` document.
|
|
|
|
General Distribution Guidance
|
|
=============================
|
|
|
|
When building a distribution of a compiler it is generally advised to perform a
|
|
bootstrap build of the compiler. That means building a "stage 1" compiler with
|
|
your host toolchain, then building the "stage 2" compiler using the "stage 1"
|
|
compiler. This is done so that the compiler you distribute benefits from all the
|
|
bug fixes, performance optimizations and general improvements provided by the
|
|
new compiler.
|
|
|
|
In deciding how to build your distribution there are a few trade-offs that you
|
|
will need to evaluate. The big two are:
|
|
|
|
#. Compile time of the distribution against performance of the built compiler
|
|
|
|
#. Binary size of the distribution against performance of the built compiler
|
|
|
|
The guidance for maximizing performance of the generated compiler is to use LTO,
|
|
PGO, and statically link everything. This will result in an overall larger
|
|
distribution, and it will take longer to generate, but it provides the most
|
|
opportunity for the compiler to optimize.
|
|
|
|
The guidance for minimizing distribution size is to dynamically link LLVM and
|
|
Clang libraries into the tools to reduce code duplication. This will come at a
|
|
substantial performance penalty to the generated binary both because it reduces
|
|
optimization opportunity, and because dynamic linking requires resolving symbols
|
|
at process launch time, which can be very slow for C++ code.
|
|
|
|
.. _shared_libs:
|
|
|
|
.. warning::
|
|
One very important note: Distributions should never be built using the
|
|
*BUILD_SHARED_LIBS* CMake option. That option exists for optimizing developer
|
|
workflow only. Due to design and implementation decisions, LLVM relies on
|
|
global data which can end up being duplicated across shared libraries
|
|
resulting in bugs. As such this is not a safe way to distribute LLVM or
|
|
LLVM-based tools.
|
|
|
|
The simplest example of building a distribution with reasonable performance is
|
|
captured in the DistributionExample CMake cache file located at
|
|
clang/cmake/caches/DistributionExample.cmake. The following command will perform
|
|
and install the distribution build:
|
|
|
|
.. code-block:: console
|
|
|
|
$ cmake -G Ninja -C <path to clang>/cmake/caches/DistributionExample.cmake <path to LLVM source>
|
|
$ ninja stage2-distribution
|
|
$ ninja stage2-install-distribution
|
|
|
|
Difference between ``install`` and ``install-distribution``
|
|
-----------------------------------------------------------
|
|
|
|
One subtle but important thing to note is the difference between the ``install``
|
|
and ``install-distribution`` targets. The ``install`` target is expected to
|
|
install every part of LLVM that your build is configured to generate except the
|
|
LLVM testing tools. Alternatively the ``install-distribution`` target, which is
|
|
recommended for building distributions, only installs specific parts of LLVM as
|
|
specified at configuration time by *LLVM_DISTRIBUTION_COMPONENTS*.
|
|
|
|
Additionally by default the ``install`` target will install the LLVM testing
|
|
tools as the public tools. This can be changed well by setting
|
|
*LLVM_INSTALL_TOOLCHAIN_ONLY* to ``On``. The LLVM tools are intended for
|
|
development and testing of LLVM, and should only be included in distributions
|
|
that support LLVM development.
|
|
|
|
When building with *LLVM_DISTRIBUTION_COMPONENTS* the build system also
|
|
generates a ``distribution`` target which builds all the components specified in
|
|
the list. This is a convenience build target to allow building just the
|
|
distributed pieces without needing to build all configured targets.
|
|
|
|
Special Notes for Library-only Distributions
|
|
--------------------------------------------
|
|
|
|
One of the most powerful features of LLVM is its library-first design mentality
|
|
and the way you can compose a wide variety of tools using different portions of
|
|
LLVM. Even in this situation using *BUILD_SHARED_LIBS* is not supported. If you
|
|
want to distribute LLVM as a shared library for use in a tool, the recommended
|
|
method is using *LLVM_BUILD_LLVM_DYLIB*, and you can use *LLVM_DYLIB_COMPONENTS*
|
|
to configure which LLVM components are part of libLLVM.
|
|
Note: *LLVM_BUILD_LLVM_DYLIB* is not available on Windows.
|
|
|
|
Options for Optimizing LLVM
|
|
===========================
|
|
|
|
There are four main build optimizations that our CMake build system supports.
|
|
When performing a bootstrap build it is not beneficial to do anything other than
|
|
setting *CMAKE_BUILD_TYPE* to ``Release`` for the stage-1 compiler. This is
|
|
because the more intensive optimizations are expensive to perform and the
|
|
stage-1 compiler is thrown away. All of the further options described should be
|
|
set on the stage-2 compiler either using a CMake cache file, or by prefixing the
|
|
option with *BOOTSTRAP_*.
|
|
|
|
The first and simplest to use is the compiler optimization level by setting the
|
|
*CMAKE_BUILD_TYPE* option. The main values of interest are ``Release`` or
|
|
``RelWithDebInfo``. By default the ``Release`` option uses the ``-O3``
|
|
optimization level, and ``RelWithDebInfo`` uses ``-O2``. If you want to generate
|
|
debug information and use ``-O3`` you can override the
|
|
*CMAKE_<LANG>_FLAGS_RELWITHDEBINFO* option for C and CXX.
|
|
DistributionExample.cmake does this.
|
|
|
|
Another easy to use option is Link-Time-Optimization. You can set the
|
|
*LLVM_ENABLE_LTO* option on your stage-2 build to ``Thin`` or ``Full`` to enable
|
|
building LLVM with LTO. These options will significantly increase link time of
|
|
the binaries in the distribution, but it will create much faster binaries. This
|
|
option should not be used if your distribution includes static archives, as the
|
|
objects inside the archive will be LLVM bitcode, which is not portable.
|
|
|
|
The :doc:`AdvancedBuilds` documentation describes the built-in tooling for
|
|
generating LLVM profiling information to drive Profile-Guided-Optimization. The
|
|
in-tree profiling tests are very limited, and generating the profile takes a
|
|
significant amount of time, but it can result in a significant improvement in
|
|
the performance of the generated binaries.
|
|
|
|
In addition to PGO profiling we also have limited support in-tree for generating
|
|
linker order files. These files provide the linker with a suggested ordering for
|
|
functions in the final binary layout. This can measurably speed up clang by
|
|
physically grouping functions that are called temporally close to each other.
|
|
The current tooling is only available on Darwin systems with ``dtrace(1)``. It
|
|
is worth noting that dtrace is non-deterministic, and so the order file
|
|
generation using dtrace is also non-deterministic.
|
|
|
|
Options for Reducing Size
|
|
=========================
|
|
|
|
.. warning::
|
|
Any steps taken to reduce the binary size will come at a cost of runtime
|
|
performance in the generated binaries.
|
|
|
|
The simplest and least significant way to reduce binary size is to set the
|
|
*CMAKE_BUILD_TYPE* variable to ``MinSizeRel``, which will set the compiler
|
|
optimization level to ``-Os`` which optimizes for binary size. This will have
|
|
both the least benefit to size and the least impact on performance.
|
|
|
|
The most impactful way to reduce binary size is to dynamically link LLVM into
|
|
all the tools. This reduces code size by decreasing duplication of common code
|
|
between the LLVM-based tools. This can be done by setting the following two
|
|
CMake options to ``On``: *LLVM_BUILD_LLVM_DYLIB* and *LLVM_LINK_LLVM_DYLIB*.
|
|
|
|
.. warning::
|
|
Distributions should never be built using the *BUILD_SHARED_LIBS* CMake
|
|
option. (:ref:`See the warning above for more explanation <shared_libs>`.).
|
|
|
|
Relevant CMake Options
|
|
======================
|
|
|
|
This section provides documentation of the CMake options that are intended to
|
|
help construct distributions. This is not an exhaustive list, and many
|
|
additional options are documented in the :doc:`CMake` page. Some key options
|
|
that are already documented include: *LLVM_TARGETS_TO_BUILD*,
|
|
*LLVM_ENABLE_PROJECTS*, *LLVM_BUILD_LLVM_DYLIB*, and *LLVM_LINK_LLVM_DYLIB*.
|
|
|
|
**LLVM_ENABLE_RUNTIMES**:STRING
|
|
When building a distribution that includes LLVM runtime projects (i.e. libcxx,
|
|
compiler-rt, libcxxabi, libunwind...), it is important to build those projects
|
|
with the just-built compiler.
|
|
|
|
**LLVM_DISTRIBUTION_COMPONENTS**:STRING
|
|
This variable can be set to a semi-colon separated list of LLVM build system
|
|
components to install. All LLVM-based tools are components, as well as most
|
|
of the libraries and runtimes. Component names match the names of the build
|
|
system targets.
|
|
|
|
**LLVM_RUNTIME_DISTRIBUTION_COMPONENTS**:STRING
|
|
This variable can be set to a semi-colon separated list of runtime library
|
|
components. This is used in conjunction with *LLVM_ENABLE_RUNTIMES* to specify
|
|
components of runtime libraries that you want to include in your distribution.
|
|
Just like with *LLVM_DISTRIBUTION_COMPONENTS*, component names match the names
|
|
of the build system targets.
|
|
|
|
**LLVM_DYLIB_COMPONENTS**:STRING
|
|
This variable can be set to a semi-colon separated name of LLVM library
|
|
components. LLVM library components are either library names with the LLVM
|
|
prefix removed (i.e. Support, Demangle...), LLVM target names, or special
|
|
purpose component names. The special purpose component names are:
|
|
|
|
#. ``all`` - All LLVM available component libraries
|
|
#. ``Native`` - The LLVM target for the Native system
|
|
#. ``AllTargetsAsmParsers`` - All the included target ASM parsers libraries
|
|
#. ``AllTargetsDescs`` - All the included target descriptions libraries
|
|
#. ``AllTargetsDisassemblers`` - All the included target dissassemblers libraries
|
|
#. ``AllTargetsInfos`` - All the included target info libraries
|
|
|
|
**LLVM_INSTALL_TOOLCHAIN_ONLY**:BOOL
|
|
This option defaults to ``Off``: when set to ``On`` it removes many of the
|
|
LLVM development and testing tools as well as component libraries from the
|
|
default ``install`` target. Including the development tools is not recommended
|
|
for distributions as many of the LLVM tools are only intended for development
|
|
and testing use.
|