Setting Up Ollama With Docker

docker ps

Table of Contents

1 Method 2: Running Ollama with Docker compose
2 Accessing Ollama in Docker
- - 2.0.1 1. Using the Docker shell
  - 2.0.2 2. Using Ollama’s API with Web UI Clients
3 Conclusion

Method 2: Running Ollama with Docker compose

Ollama exposes an API on http://localhost:11434, allowing other tools to connect and interact with it. That was when I got hooked on the idea of setting up Ollama inside Docker and leveraging GPU acceleration.docker exec -it ollama <commands>

accessing the ollama shell using docker exec command

I’m considering testing it with Jellyfin for hardware-accelerated transcoding, which would be a huge boost for my media server setup. Now, to install the NVIDIA Container Toolkit, follow these steps:

Using a one liner docker run command.
With Docker compose

Ollama has been a game-changer for running large language models (LLMs) locally, and I’ve covered quite a few tutorials on setting it up on different devices, including my Raspberry Pi.

Whether you want something lightweight or a full-featured alternative to ChatGPT, there’s a UI that fits your needs.

Before we get started, if you haven’t installed Docker yet, check out our previous tutorials on setting up Docker on Linux.

We’ll start with creating a docker-compose.yml file, to manage the Ollama container:That said, I’d love to hear about your setup! Are you running Ollama in Docker, or do you prefer a native install? Have you tried any Web UI clients, or are you sticking with the command line? docker run -d --gpus=all -v ollama:/root/.ollama -p 11434:11434 --name ollama ollama/ollama

But as I kept experimenting, I realized there was still another fantastic way to run Ollama: inside a Docker container.We’ve actually covered 12 different tools that provide a Web UI for Ollama. This is really easy, you can access Ollama container shell by typing:

docker run -d: Runs the container in detached mode.
--name ollama: Names the container “ollama.”
-p 11434:11434: Maps port 11434 from the container to the host.
-v ollama:/root/.ollama: Creates a persistent volume for storing models.
ollama/ollama: Uses the official Ollama Docker image.

running ollama without nvidia-container-toolkit using docker run method

In this guide, I’ll walk you through two ways to run Ollama in Docker with GPU support:On another note, diving deeper into NVIDIA Container Toolkit has sparked some interesting ideas. The ability to pass GPU acceleration to Docker containers opens up possibilities beyond just Ollama.

If you prefer a graphical user interface (GUI) instead of the command line, you can use several Web UI clients.

echo 'alias ollama="docker exec -it ollama ollama"' >> $HOME/.bashrc

source $HOME/.bashrc

distribution=$(. /etc/os-release;echo $ID$VERSION_ID) curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add - curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list sudo apt update

adding nvidia-container-toolkit repository — If your Nvidia GPU driver is not properly installed, you might encounter some problems when installing nvidia-container-toolkit on your system just like in my case on Debian 12.

Install the NVIDIA Container Toolkit by running the following command in a terminal window:

Running Ollama in Docker provides a flexible and efficient way to interact with local AI models, especially when combined with a UI for easy access over a network.Once the container is running, you can check its status with:

Accessing Ollama in Docker

docker run -d --name ollama -p 11434:11434 -v ollama:/root/.ollama ollama/ollama

It wasn’t until I was working on an Immich tutorial that I stumbled upon NVIDIA Container Toolkit, which allows you to add GPU support to Docker containers.

1. Using the Docker shell

Before installation, make sure that you have already installed the GPU drivers on your specific distro.Add this to your .bashrc file:Other projects, like Stable Diffusion or AI-powered upscaling, could also benefit from proper GPU passthrough.There are two main ways:I’m still tweaking my setup to ensure smooth performance across multiple devices, but so far, it’s working well.💡

2. Using Ollama’s API with Web UI Clients

docker-compose up -d

running the container using docker compose up command

Now, this isn’t exactly breaking news. The first Ollama Docker image was released back in 2023. But until recently, I always used it with a native install. Drop your thoughts in the comments below.

Open WebUI – A simple and beautiful frontend for local LLMs.
LibreChat – A powerful ChatGPT-like interface supporting multiple backends.

📋Now, let’s dive in.

Conclusion

The NVIDIA Container Toolkit includes the NVIDIA Container Runtime and the NVIDIA Container Toolkit plugin for Docker, which enable GPU support inside Docker containers.echo 'alias ollama="docker exec -it ollama ollama"' >> $HOME/.zshrc

Now that we have Ollama running inside a Docker container, how do we interact with it efficiently? If you’re setting up Ollama with Open WebUI, I would suggest to use docker volumes instead of bind mounts for a less frustrating experience. ollama ps ollama pull llama3 ollama run llama3

setting up alias for docker exec command

version: '3.8'

services:
ollama:
image: ollama/ollama
container_name: ollama
ports:
- "11434:11434"
volumes:
- ollama:/root/.ollama
deploy:
resources:
reservations:
devices:
- driver: nvidia
count: all
capabilities: [gpu]
restart: unless-stopped

volumes:
ollama:

docker compose stack of ollama with nvidia-container-toolkit

Some popular tools that work with Ollama include:

Setting Up Ollama With Docker

Method 2: Running Ollama with Docker compose

Accessing Ollama in Docker

1. Using the Docker shell

2. Using Ollama’s API with Web UI Clients

Conclusion

How to Partition and Format Disk Drives on Linux? (Detailed Guide)

Private AI With Ollama and OpenWebUI to host your own GPT models #oai #privateai #ollama #ai

The PERFECT Desktop Homelab Server!

Absolute Essentials You Need to Know to Survive Vi Editor

WordPress Tutorials

VMware ESXi Power Optimization Overview

Best Webhosting

Australian Alumni

Coding Heros

Method 2: Running Ollama with Docker compose

Accessing Ollama in Docker

1. Using the Docker shell

2. Using Ollama’s API with Web UI Clients

Conclusion

Similar Posts