Podman is awesome—and totally frustrating

witten@lemmy.world · edit-2 1 year ago

Podman is awesome—and totally frustrating

Geronimo Wenja@agora.nop.chat · 1 year ago

One of the really nice side-effects of it running rootless is that you get all the benefits of it running as an actual Unix user.

For instance, you can set up wireguard with IP route to send all traffic from a given UID through the VPN.

Using that, I set up one user as the single user for running all the stuff I want to have VPN’d for outgoing connections, like *arr services, with absolutely no extra work. I don’t need to configure a specific container, I don’t need to change a docker-compose etc.

In rootful docker, I had to use a specific IP subnet to achieve the same, which was way more clunky.

witten@lemmy.world · 1 year ago

Really good example of one way that Podman’s Unix Philosophy is actually helpful!

1 year ago

Could you explain or show how to do that?

Geronimo Wenja@agora.nop.chat · edit-2 1 year ago

Yeah sure.

I’m going to assume you’re starting from the point of having a second linux user also set up to use rootless podman. That’s just following the same steps for setting up rootless podman as any other user, so there shouldn’t be too many problems there.

If you have wireguard set up and running already - i.e. with Mullvad VPN or your own VPN to a VPS - you should be able to run ip link to see a wireguard network interface. Mine is called wg. I don’t use wg-quick, which means I don’t have all my traffic routing through it by default. Instead, I use a systemd unit to bring up the WG interface and set up routing.

I’ll also assume the UID you want to forward is 1001, because that’s what I’m using. I’ll also use enp3s0 as the default network link, because that’s what mine is, but if yours is eth0, you should use that. Finally, I’ll assume that 192.168.0.0 is your standard network subnet - it’s useful to avoid routing local traffic through wireguard.

#YOUR_STATIC_EXTERNAL_IP# should be whatever you get by calling curl ifconfig.me if you have a static IP - again, useful to avoid routing local traffic through wireguard. If you don’t have a static IP you can drop this line.

[Unit]
Description=Create wireguard interface
After=network-online.target

[Service]
RemainAfterExit=yes
ExecStart=/usr/bin/bash -c " \
        /usr/sbin/ip link add dev wg type wireguard || true; \
        /usr/bin/wg setconf wg /etc/wireguard/wg.conf || true; \
        /usr/bin/resolvectl dns wg #PREFERRED_DNS#; \
        /usr/sbin/ip -4 address add #WG_IPV4_ADDRESS#/32 dev wg || true; \
        /usr/sbin/ip -6 address add #WG_IPV6_ADDRESS#/128 dev wg || true; \
        /usr/sbin/ip link set mtu 1420 up dev wg || true; \
        /usr/sbin/ip rule add uidrange 1001-1001 table 200 || true; \
        /usr/sbin/ip route add #VPN_ENDPOINT# via #ROUTER_IP# dev enp3s0 table 200 || true; \
        /usr/sbin/ip route add 192.168.0.0/24 via 192.168.0.1 dev enp3s0 table 200 || true; \
        /usr/sbin/ip route add #YOUR_STATIC_EXTERNAL_IP#/32 via #ROUTER_IP# dev enp3s0 table 200 || true; \
        /usr/sbin/ip route add default via #WG_IPV4_ADDRESS# dev wg table 200 || true; \
"

ExecStop=/usr/bin/bash -c " \
        /usr/sbin/ip rule del uidrange 1001-1001 table 200 || true; \
        /usr/sbin/ip route flush table 200 || true; \
        /usr/bin/wg set wg peer '#PEER_PUBLIC_KEY#' remove || true; \
        /usr/sbin/ip link del dev wg || true; \
"

[Install]
WantedBy=multi-user.target

There’s a bit to go through here, so I’ll take you through why it works. Most of it is just setting up WG to receive/send traffic. The bits that are relevant are:

        /usr/sbin/ip rule add uidrange 1001-1001 table 200 || true; \
        /usr/sbin/ip route add #VPN_ENDPOINT# via #ROUTER_IP# dev enp3s0 table 200 || true; \
        /usr/sbin/ip route add 192.168.0.0/24 via 192.168.0.1 dev enp3s0 table 200 || true; \
        /usr/sbin/ip route add #YOUR_STATIC_EXTERNAL_IP#/32 via #ROUTER_IP# dev enp3s0 table 200 || true; \
        /usr/sbin/ip route add default via #WG_IPV4_ADDRESS# dev wg table 200 || true; \

ip rule add uidrange 1001-1001 table 200 adds a new rule where requests from UID 1001 go through table 200. A table is a subset of ip routing rules that are only relevant to certain traffic.

ip route add #VPN_ENDPOINT# ... ensures that traffic already going through the VPN - i.e. wireguard traffic - does. This is relevant for handshakes.

ip route add 192.168.0.0/24 via 192.168.0.1 ... is just excluding local traffic, as is ip route add #YOUR_STATIC_EXTERNAL_IP

Finally, we add ip route add default via #WG_IPV4_ADDRESS# ... which routes all traffic that didn’t match any of the above rules (local traffic, wireguard) to go to the wireguard interface. From there, WG handles all the rest, and passes returning traffic back.

There’s going to be some individual tweaking here, but the long and short of it is, UID 1001 will have all their external traffic routed through WG. Any internal traffic between docker containers in a docker-compose should already be handled by podman pods and never reach the routing rules. Any traffic aimed at other services in the network - i.e. sonarr calling sabnzbd or transmission - will happen with a relevant local IP of the machine it’s hosted on, and so will also be skipped. Localhost is already handled by existing ip route rules, so you shouldn’t have to worry about that either.

Hopefully that helps - sorry if it’s a bit confusing. I learned to set up my own IP routing to avoid wg-quick so that I could have greater control over the traffic flow, so this is quite a lot of my learning that I’m attempting to distill into one place.

1 year ago

Thank you very much!

beerd@lemmy.world · 1 year ago

Thank you so much, i was just looking for a way to do this the other day!

MrWiggles@prime8s.xyz · 1 year ago

Saving this for later, thank you very much for the detailed writeup. I might look into this for my main machine to partition the vpn tasks from the non-vpn tasks

Geronimo Wenja@agora.nop.chat · 1 year ago

You’re welcome. It has some really nice side-effects - i.e. if I want to quickly grab a file without it being from my normal IP, I can just SSH to the right user on my server and it just works - no configuration, no needing to interrupt other traffic.

Jeena@jemmy.jeena.net · edit-2 1 year ago

I guess this: “you run a “root” container in your completely unprivileged Unix user and everything just works” sounds like chroot. Also managing your container starts with systemd sounds pretty good to me because this is what systemd is designed for, dependencies between services, etc.

burningquestion@lemmy.world · edit-2 1 year ago

deleted by creator

exu@feditown.com · 1 year ago

Fyi, you can create a pod, create containers in that pod and generate the systemd file for that. Now you have one service to start/stop everything.

witten@lemmy.world · 1 year ago

That’s a great point, and honestly not something I’d thought of. Now I just need to give up my lovely Docker Compose YAML and dip my toes into podman run, Quadlet config, or podman kube play…

MrWiggles@prime8s.xyz · 1 year ago

There’s also podman-compose, which I’ve been using. It’s not quite feature complete, but it’s pretty close.

Marxine@lemmy.world · 1 year ago

Didn’t know about Podman yet, I’ll probably give it a spin on some personal projects to get a bit more used to tools other than docker. Thanks for the great intro!

witten@lemmy.world · 1 year ago

If you do try it out, let us know how totally obnoxious it is for you!

Marxine@lemmy.world · 1 year ago

Sure thing, ranting is therapeutic and I wouldn’t miss the chance~

lightree@kbin.social · 1 year ago

had a knowledge sharing meeting at work recently on container security. the guy was using podman like a docker cli, but it said “this is only an emulation of docker” - is there any downsides to running podman like this? im very familiar with docker on the command line

loren@sh.itjust.works · edit-2 1 year ago

I think calling it an emulation downplays podman. Docker and podman are both container runtimes. Docker came first and is known synonymously with containers, whereas podman is newer and attempts to fix docker’s problems.

One outcome of this is podman chose to match docker’s cli very closely so nobody needs to learn a new cli. You can even put podman on the docker socket so “docker [command]” runs with podman.

witten@lemmy.world · 1 year ago

So as far as I’m understanding your description, he’s actually using Podman under the hood, just via its Docker compatibility CLI. The main downside of that IMO is you lose out on Podman-specific flags and features. Which, honestly are probably not a huge deal to you if you just want a thing that walks and talks like Docker while hopefully being more secure.

The big caveat though is root vs. non-root. If he’s running his commands as root/sudo, it’ll work a whole lot like Docker just without a daemon (and without containers starting on boot). But if he’s running as a non-root user, well… See my original post for some downsides.

amp@kbin.social · 1 year ago

I’ve switched over my own server last week, using ansible to generate the systemd files, and it worked great. It’s just a dozen containers or so.

The only problems I had were with container interdependencies (network-mode=container:x). That didn’t work so well with systemd, restarting and updating, but when I used a pod instead these problems all went away.

So I can’t say I regret my experience so far. Now I’ll be starting to use it at work too, where the user-namespace problem rears its head, but only because we have this very specific, very dumb big lamp dev container that houses apache, sql, redis, and more under one supervisord. That’s why we have more than one user in it and frankly that’s our own damn fault! When you make proper containers they shouldn’t have more than one user in it and then userns=keep-id should work just fine.

So far, I fully recommend podman.

witten@lemmy.world · 1 year ago

Using Ansible to spew out systemd service boilerplate seems like a good idea. I’ll have to try that if I can ever give up my Docker Compose security blanket. And I wish you luck with your mega-container Podman conversion. That one sounds like it’ll be… a learning experience.

amp@kbin.social · 1 year ago

I understand very well wanting to stay with the declarative nature of docker-compose. Someone should really build a better podman-compose. (or sooner or later I’ll do it myself >_<)

witten@lemmy.world · 1 year ago

Do it! I think there’s a market there. Although the “Podman Compose” name is taken and you’ll have to think of something else…

Dark Arc@lemmy.world · 1 year ago

I’ve tried to switch in the past, but tripped over the differences in Podman vs Docker networking. IIRC Docker is better for creating an isolated network.

I have noticed that Docker doesn’t do the best job at graceful shutdowns (say for automatic installation of updates). I suspect Podman with systemd integration could do much much beter.

kat@feddit.nl · 1 year ago

At least podman does not circumvent my firewall (ufw) like docker did. Had to use a workaround to get it to work with docker.

witten@lemmy.world · 1 year ago

Podman respects UFW?? That’s awesome to hear.

kat@feddit.nl · 1 year ago

Yep, turns out it doesn’t insert its own iptables rules like docker does, so the special rules from ufw-docker weren’t necessary anymore.

ShittyKopper [they/them]@lemmy.blahaj.zone · 1 year ago

…and of course I learn this after switching to firewalld.

At least firewalld feels relatively painless compared to rest of the redhat-container-verse.

witten@lemmy.world · 1 year ago

Yeah, one of my motivations for kicking Docker to the curb (beyond security) is all the weird little bugs: Preventing shutdown, unresponsive commands, random hangs, broken upgrades, etc. Podman may not end up being any better there, but at least I can pretend for a while.

markstos@lemmy.world · 1 year ago

I see it as a feature that Podman containers are run via systemd. This makes their management consistent with the other systemd-managed services. Also, Docker does it own things with logs, while with systemd, the logs are managed in a consistent way as well.

Maybe you missed podman generate systemd? Podman will generate the systemd unit files for you.

For me, the two big benefits of podman are being able to run containers via systemd and improved security by being able to run them rootless.

Den Zuko@lemmy.world · 9 months ago

I actually find this a huge problem. Not all distros are built around LSB, XDG, or FreeDesktop.org nor should they be since not everyone is running Linux as a workstation/PC replacement. While yes for the most part podman can be ran on the likes of Gentoo, Alpine, Arch and etc. It becomes a pain in the arse to decouple the tooling for podman away from freedesktop.org standards. Even more a pain in the arse for clustering options (e.g. podman-remote expects freedesktop.org norms, kubernetes expects docker containerd or freedesktop.org with podman, and nomad stack is just bulky vaporware).

The really sad part of this is that podman isn’t adding much of anything new that LXC or linux namespaces outside of not needing a daemon, allowing rootless execution (again because it doesn’t need a daemon) and giving ACLs around which OCI repos could be pulled from unlike docker’s wildcard by default. It shouldn’t be hard to do linux containerization without being tied to anything other than the linux kernel.

witten@lemmy.world · 1 year ago

Thanks for the suggestion, but I actually tried podman generate systemd and it did not work at all with my containers; the resultant services did not start successfully and I wasn’t able to get them working with any amount of tweaking. (I’d tell you what the error messages were, but I’ve only recorded them in now-deleted comments on The Site That Shall Not Be Named.)

markstos@lemmy.world · 1 year ago

Ok. I was already very familiar with systemd when I started using podman and didn’t have any trouble at that step. In my case, I created the systems unit files by hand.

witten@lemmy.world · 1 year ago

That sounds like it’s probably the way to go…

poVoq@slrpnk.net · 1 year ago

The easiest is actually using Quadlet (integrated in Podman 4.x or later): https://www.redhat.com/sysadmin/quadlet-podman and write simple .container files.

Works very well and does auto-start, service dependencies and even container auto-updates.

Also running containers in Pods is a really nice way to handle virtual networking and allows you to manage all the containers in one go similar to docker-compose.

witten@lemmy.world · 1 year ago

The container auto-updates feature does sound pretty slick. I kind of have that hacked together now with a systemd timer and Docker Compose, but it’s not pretty.

HTTP_404_NotFound@lemmyonline.com · 1 year ago

Honestly, I had to use podman at work due to… issues.

Its close enough to docker, that most docker commands will work just fine. You can even alias docker as podman, and things will for the most part, just work.

HOWEVER, there are some big changes and difference. First- podmon creates a systemctl for your containers, for starting them. This- is different, and if you forget to tell it to create the service- then your containers won’t start.

My personal opinion after using it for a few years? I strongly prefer docker. It gives me very few issues. I have spent too much time troubleshooting odd things podman does.

witten@lemmy.world · 1 year ago

I’m just glad we now have multiple container engine choices. For a while there, Docker was the only game in town.

HTTP_404_NotFound@lemmyonline.com · 1 year ago

Don’t forget the multiple flavors of kubernetes too.

witten@lemmy.world · 1 year ago

Say more…?

HTTP_404_NotFound@lemmyonline.com · 1 year ago

Kubernetes just runs docker containers, (and lots more).

Imagine a solution to manage containers on multiple hosts, with tons of redundancy. With network and storage automation built in.

That’s Kubernetes in a nutshell.

But, has a steep learning curve

witten@lemmy.world · 1 year ago

Oh, thanks, but I’m familiar with Kubernetes. :) I was just asking what the multiple flavors of K8s have to do with Podman vs. Docker.

Scribbd@feddit.nl · edit-2 1 year ago

I work somewhere that doesn’t have licensing with Docker Inc. And I work on a Mac. With Docker desktop out of the picture, I got some experience with the alternatives. I know this post is about the native implementation and not the VM one, but I just wanted to add my 2 cents:

Alternatives run by me: Podman, Rancher Desktop, Finch

Results:

Podman uses a lot more energy on idle than Finch and Rancher. On AVG 4 more Wats on an M1. (Normal idle is about 5W, so 9 almost doubles it cutting greatly in my battery life)
Podman and Finch are not compatible with some tools that expect a full docker sock. In my case the AWS CDK and SAM CLI have issues. (Which is fun as Finch is also made by AWS)
Finch does not offer a sock at all
Finch requires you to recreate the full VM when updated.
If you really want to have a drop-in replacement for Docker Desktop, use Rancher Desktop. Rancher lacks in UI and the extension feature. But I never had issues with the sock, as I can run it with containerd.
Finch has no UI
Podman’s VM has clock drift if you put your machine in sleep. Only solution I found is to reboot the podman VM.
Podman allows you to log in the VM with a command. I haven’t found a way on the others.

Avoid8822@lemmy.world · 1 year ago

The clock drift issue has been resolved recently: https://github.com/containers/podman/issues/11541

Scribbd@feddit.nl · 1 year ago

That is awesome. I prefer podman, despite what my list might suggest.

Sparking@lemm.ee · 1 year ago

Wow, 4 watts? That’s a lot. Any insight on what is taking up all this extra energy? I thought that podman would be thinner than docker honestly.

Scribbd@feddit.nl · 1 year ago

I did some ~~shallow~~ digging, and my guess is the virtual machine that is started for each.

I see that the podman vm is a whole ass fedora image, at least back in 2021 when this article was written.

Rancher seems to use alpine if I understand the configuration correctly

Finch also uses fedora… I think. Their config is seemingly simple to the point it looks deceptive.

Sparking@lemm.ee · 1 year ago

Oh, are you using Podman on windows? Yeah, it needs a virtual machine because it has to load the linux kernel. I would definitely believe that the windows version (or mac, I guess) of podman is way heavier than the alternatives on those platforms, but on linux it just ends up using the host kernel.

If you are doing this on linux, and still need to load a vm to use podman, that would be interesting. I haven’t run across that, but I haven’t been able to use podman too much.

Scribbd@feddit.nl · 1 year ago

You forgot the third option: A Mac ;)

ShittyKopper [they/them]@lemmy.blahaj.zone · edit-2 1 year ago

The other big annoying thing about Podman is that because there’s no Big Bad Daemon managing everything, there are certain things you give up. Like containers actually starting on boot. […] until you realize that means Podman wants you to manage your containers entirely with systemd. So… running each container with a systemd service, using those services to stop/start/manage your containers, etc.

Surprisingly, they have a solution for that that doesn’t involve using systemd for everything. They put an --all option to podman start, and a systemd service to run it at boot with the correct --filter (yeah. because unix philosophy). Debian seems to enable it by default AFAICT.

No idea how well it works rootless though.

Edit: Oh and for rootless networking, Podman 4.4.0 seems to ship pasta which seems to be the solution to slirp4netns’s existence. Unfortunately I have no idea if it works at all because I run Debian stable which is still on 4.3.1

witten@lemmy.world · edit-2 1 year ago

Surprisingly, they have a solution for that that doesn’t involve using systemd for everything. They put an --all option to podman start, and a systemd service to run it at boot with the correct --filter (yeah. because unix philosophy). Debian seems to enable it by default AFAICT.

TIL! I wonder though why that isn’t working / auto-enabled on my system… Maybe just because I am doing rootless. Thanks for the tip! I’ll have to look into this.

j4k3@lemmy.world · 1 year ago

Lurker. Never self hosted. Nothing useful to add here. Fedora Silverblue has a lot of integration with podman. It is the basis of toolbx, which has its issues, namely that the distro it spins up can’t be upgraded in each toolbx container. The tools are there to mess with containers stuff.

witten@lemmy.world · 1 year ago

Makes sense… Fedora, Red Hat, Podman… All one big ecosystem. Fortunately for other users, Podman runs fine on other distros too.

j4k3@lemmy.world · 1 year ago

IMO it is the integration with toolbx that is interesting to me, but I struggle with these things.

witten@lemmy.world · 1 year ago

Well feel free to make a post about any problems you run into with it… Someone here may be able to help!

spark431@kbin.social · 1 year ago

Its not too bad, you can define the UID that your podmab process runs on since it can run as its own process and not be reliant on a daemon. It makes you do a little more admin work, but honestly you have to start doing that anyway in a lot of environments.

I have had good experimental experiences with podman, but my biggest barrier to adopting it is how old the packaged version with debian is. I’m hoping to see if I can do some damage with the release of bookworm.

witten@lemmy.world · 1 year ago

Oh yeah, I hear you about that ancient Debian stable version of Podman. I actually ended up upgrading some of my servers to Bookworm for that very reason.

burningquestion@lemmy.world · edit-2 1 year ago

deleted by creator

misosoup64@lemmy.world · 1 year ago

Docker and k8s aren’t the same thing though? Docker is a container runtime while k8s is an orchestration platform.

burningquestion@lemmy.world · edit-2 1 year ago

deleted by creator

misosoup64@lemmy.world · 1 year ago

Docker and k8s aren’t the same thing though? Docker is a container runtime while k8s is an orchestration platform.

stevedave@lemmy.world · 1 year ago

I’ve used podman on an RHEL server at work because it works nicely with selinux. I had a hell of a time with rootless containers and network throughput when using an nginx reverse proxy. Made the site painfully slow. Turned out it was due to the slirp4netns rootless networking and MTU size. Just decided to say screw the rootless thing and went rootfull. Next time honestly would just use docker since it’s more common

witten@lemmy.world · 1 year ago

That is one major caveat of Podman I neglected to mention above… Non-root does mean you give up some performance over running as root. Trade-offs!

float@feddit.de · 1 year ago

Not necessarily. For networking, I wrote a bash script with just a few lines that creates and assigns a private networking namespace to a pod and sets up the default routes. That script is run by a systemd user instance and has the suid flag set. One could argue that it’s not rootless because of that but that’s just the moment when it’s starting. No performance impact and very robust. A lot better than the docker network bridges imho.