OpenVZ Forum: Support » CUDA support inside containers

Home » General » Support » CUDA support inside containers (How can I run CUDA workloads in multiple containers?)

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

Re: OverlayFS (was Re: CUDA support inside containers) [message #52655 is a reply to message #52653]

Tue, 22 November 2016 08:42

abufrejoval
Messages: 21
Registered: November 2016
Location: Frankfurt

Junior Member

khorenko wrote on Mon, 21 November 2016 17:11

abufrejoval wrote on Mon, 21 November 2016 15:18
I guess OverlayFS is what I need to selectively replicate /proc and /sys contents from the host to make the CUDA runtime happy.

I remember reading about it on lwn.net and it looks like it's actually been backported from 3.18 to RHEL/CentOS/(OpenVZ?)7 3.10 kernels to support Docker, but it seems it's kernel only and missing support from the userland tools ("mount: unknown file system type 'overlayfs'...").

It's quite maddenning, because device access to /dev/nvidia* from inside the container in general seems to be supported in OpenVZ: I get a 'proper' "invalid argument" when doing a 'cat /dev/nvidia-uvm' and not the dreaded "permission denied" I get from LXC on the native CentOS.

You can use overlayfs inside a Virtuozzo 7/OpenVZ 7 Container, all you need is to load the appropriate kernel module on the host.

Did you try to get further with mounting /proc/modules inside a Container with the file with content from the host's /proc/modules?

I write too much, that's why the answers got lost Smile

Yes, I did try mounting /proc/modules via a copied file from the host which I then bind mounted as you suggested.

And then the runtime library just wanted the *next* file which wasn't there. I copied that, too but eventually I got stuck at /proc/devices, which is a *directory* on the host but an empty *file* on the guest: I couldn't bind mount a directory over the empty file (nor could I delete the empty file from the guest's procfs).

That's why I thought I might potentially get there using the overlayfs, which really seems to support all kinds of dirty tricks.

And there the trouble is, that the Virtuozzo 3.10. kernel supports the overlayfs functionality via its sys-call interface thanks to Docker running so much quicker with it. But the actual user land tool 'mount' doesn't understand the -t overlayfs parameter: I'd have to go and get one from e.g. a more up-to-date Fedora and statically compile that against a matching c-library etc.

In short words: All kinds of trouble when ideally Nvidia should offer a run-time library option, which doesn't do all these 'convenience' checks.

I've sent a request to Nvidia accordingly and I'm hoping for them to fix the issue at the source.

Of course somewhere within Virtuozzo there must be a table which decides which elements in /proc and /sys are visible to guests and which need translation (e.g. UID or PID mapping).

I should be able to patch that code and build a 'matching' CUDA kernel, just to see if that eventually solves the problem, too.

But I'd invest that effort only, if I could be sure that CUDA enabled Docker workloads also run on both the host and inside OpenVZ containers, because that would make OpenVZ feature complete with regards to the environment I need to build. That requires support for the current docker-engine 1.12.1 on both sides and evidently Docker and OpenVZ don't get along as well as I had hoped any more. Some tests using older Docker variants had looked rather promising early this year, but Nvidia has built a docker-plugin, which requires 1.10 or greater.

Essentially I want to support two major 'client' workloads: CUDA enabled Docker images and CUDA enabled 'IaaS' container.
Ubuntu delivery both, but with--well Ubuntu and LXC both of which require significant relearning and additional risks.

I really don't want to go down that road, but at the moment I have no choice.

Report message to a moderator

[Message index]

		CUDA support inside containers By: abufrejoval on Sat, 12 November 2016 13:33
		Re: CUDA support inside containers By: khorenko on Mon, 14 November 2016 18:04
		Re: CUDA support inside containers By: abufrejoval on Mon, 14 November 2016 22:28
		Re: CUDA support inside containers By: khorenko on Tue, 15 November 2016 07:58
		Re: CUDA support inside containers By: abufrejoval on Tue, 15 November 2016 00:38
		Re: CUDA support inside containers By: khorenko on Tue, 15 November 2016 05:45
		Re: CUDA support inside containers By: abufrejoval on Wed, 16 November 2016 02:37
		Re: CUDA support inside containers By: khorenko on Wed, 16 November 2016 06:02
		Re: CUDA support inside containers By: abufrejoval on Fri, 18 November 2016 03:37
		Re: CUDA support inside containers By: abufrejoval on Fri, 18 November 2016 04:10
		Re: CUDA support inside containers By: khorenko on Fri, 18 November 2016 18:07
		Re: CUDA support inside containers By: abufrejoval on Mon, 21 November 2016 01:11
		Re: CUDA support inside containers By: abufrejoval on Mon, 21 November 2016 03:55
		OverlayFS (was Re: CUDA support inside containers) By: abufrejoval on Mon, 21 November 2016 12:18
		Re: OverlayFS (was Re: CUDA support inside containers) By: khorenko on Mon, 21 November 2016 16:11
		Re: OverlayFS (was Re: CUDA support inside containers) By: abufrejoval on Tue, 22 November 2016 08:42
		Re: OverlayFS (was Re: CUDA support inside containers) By: khorenko on Wed, 23 November 2016 15:57
		Re: OverlayFS (was Re: CUDA support inside containers) By: abufrejoval on Wed, 23 November 2016 19:17
		Re: OverlayFS (was Re: CUDA support inside containers) By: abufrejoval on Mon, 28 November 2016 14:34

Previous Topic:	CVE-2016-7910 CVE-2016-7911
Next Topic:	Can you rebuild vz/private files?

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Tue Jun 16 18:04:16 GMT 2026

Total time taken to generate the page: 0.56956 seconds