20–24 Sept 2021
US/Pacific timezone

Session

Containers and Checkpoint/Restore MC

20 Sept 2021, 07:00

Description

The Containers and Checkpoint/Restore Microconference focuses on both userspace and kernel related work. The micro-conference targets the wider container ecosystem ideally with participants from all major container runtimes as well as init system developers.

Contributions to the micro-conference are expected to be problem statements, new use-cases, and feature proposals both in kernel- and userspace.

Presentation materials

There are no materials yet.

  1. Stéphane Graber (Canonical Ltd.)
    20/09/2021, 07:00
  2. Stéphane Graber (Canonical Ltd.), Christian Brauner
    20/09/2021, 07:05

    The user namespace currently relies on mapping UIDs and GIDs from the initial namespace (full uint32 range) into the newly created user namespace. This is done through the use of uid_map/gid_map with the kernel allowing mapping your own uid/gid and otherwise requiring a privileged process write a more complete map.

    As more and more software (not just container managers) are making use of...

    Go to contribution page
  3. Tom Hromatka
    20/09/2021, 07:30

    The Oracle database offers a long-term-stable version that is supported and
    maintained for many years. But as Linux distributions slowly transition
    from cgroup v1 to cgroup v2, this creates a challenge for the DB. cgroup v1
    and cgroup v2 have different interfaces and best practices.

    This talk is to discuss the current status of the cgroup abstraction layer, how
    applications like the...

    Go to contribution page
  4. Pavel Tikhomirov (Virtuozzo)
    20/09/2021, 08:10

    Mount checkpoint/restore is an important part of CRIU, it is responsible for
    consistency of the file system view of dumped processes. In current state we
    can only restore simple mount configurations, something more complex would
    either make us fail or, which is even worse, make us creating wrong file
    system view for restored processes.

    In CRIU we only see the final state, the result of...

    Go to contribution page
  5. Alexander Graf, Mr Adrian Catangiu
    20/09/2021, 08:35

    Starting things is slow. Even if only 1 second slow, saving 1s on a million container restores means we can save 11 days of useless work that every container will perform identically.

    That's where snapshots come in. Snapshots in theory allow us to save an initialized container once, but then restore it a million times at less overhead than cold starting it takes.

    Unfortunately, Linux...

    Go to contribution page
  6. Mr Rajneesh Bhardwaj (AMD), Mr Felix Kuehling (AMD), Mr David Yat Sin (Mr)
    20/09/2021, 09:10

    We recently announced our work to support Checkpoint and Restore with AMD GPUs. This was first time a device plugin is introduced and that deals with one of the most complex devices on the system i.e. GPU. We made several changes to CRIU, introduced new plugin hooks and reported some issues with CRIU.

    https://github.com/RadeonOpenCompute/criu/tree/amd-criu-dev-staging/plugins/amdgpu#readme...

    Go to contribution page
  7. Alexander Mikhalitsyn (Virtuozzo), Andrei Vagin
    20/09/2021, 10:00

    CRIU uses many different interfaces to get information about kernel resources,
    to extract sockets data sock_diag subsystem is used, for mounts/mount namespaces,
    procfs per-pid mountinfo files are used, to get some file type-specific info we
    use procfs fdinfo interface (which allows to get mnt_id from which file was opened,
    file flags and so on).

    One of the most important and...

    Go to contribution page
  8. Christian Brauner
    20/09/2021, 10:25
  9. CRIU can checkpoint and restore processes using standard kernel interfaces. However, out of the box, it cannot support processes using device driver interfaces for devices like GPUs or compute accelerators.

    CRIU already has a plugin architecture to support processes using device files. Using this architecture we added a plugin for supporting CRIU with GPU compute applications running on...

    Go to contribution page
  10. CRIU can checkpoint and restore processes using standard kernel interfaces. However, out of the box, it cannot support processes using device driver interfaces for devices like GPUs or compute accelerators.

    CRIU already has a plugin architecture to support processes using device files. Using this architecture we added a plugin for supporting CRIU with GPU compute applications running on...

    Go to contribution page
  11. The Containers and Checkpoint/Restore micro-conference brings together kernel developers, runtime maintainers, and developers working on container- and sandboxing related technologies in general to discuss current problems and agree on new features.

    Both userspace and kernel related work is acceptable. The micro-conference targets the wider container ecosystem ideally with participants from...

    Go to contribution page
  12. The Containers and Checkpoint/Restore micro-conference brings together kernel developers, runtime maintainers, and developers working on container- and sandboxing related technologies in general to discuss current problems and agree on new features.

    Both userspace and kernel related work is acceptable. The micro-conference targets the wider container ecosystem ideally with participants from...

    Go to contribution page
Building timetable...

Diamond Sponsor

Platinum Sponsor

Gold Sponsors

Silver Sponsors

Speaker Gift Sponsor

T-Shirt Sponsor

Conference Services provided by