Set up Ollama, NGROK, and LangChain

Published: 3 days ago (February 13, 2026 at 07:25 AM EST)

2 min read

Source: Dev.to

Source: Dev.to

This post shows how to quickly set up Ollama with ngrok and use it in LangChain.

Ollama

You can easily download Ollama for Linux, macOS, and Windows:

👉

Test if it’s running

curl http://localhost:11434; echo   # should output “Ollama is running”

Note: If nothing is returned, start the server with ollama serve.

Pull your preferred models

ollama pull phi4-mini
ollama pull nomic-embed-text:v1.5

Testing the Ollama installation

Pulling models in Ollama

Ngrok

Download ngrok for Linux, macOS, and Windows:

👉

Note: Remember to add your authtoken!

ngrok config add-authtoken

Expose Ollama with ngrok (basic auth)

ngrok http 11434 \
  --host-header="localhost:11434" \
  --basic-auth="username:password"

Public URL from ngrok

The command generates a public URL, e.g.:

https://09c6b3946ddd.ngrok-free.app

You can test it in a browser:

Testing the public URL

After successful authentication you’ll see “Ollama is running”:

Ollama running response

You can also verify the models via the /api/tags endpoint:

/api/tags response

✨ The two models pulled earlier are now ready to use.

LangChain & Ollama

Install the required Python packages:

pip install langchain langchain-core langchain-ollama

Use Ollama (LLM & Embeddings) from another network

import base64
from langchain_ollama.llms import OllamaLLM
from langchain_ollama.embeddings import OllamaEmbeddings

BASE_URL = "https://09c6b3946ddd.ngrok-free.app"
user_pass = b"username:password"
auth_header = base64.b64encode(user_pass).decode("utf-8")

client_kwargs = {
    "headers": {
        "Authorization": f"Basic {auth_header}"
    }
}

llm = OllamaLLM(
    model="phi4-mini:latest",
    base_url=BASE_URL,
    client_kwargs=client_kwargs,
)

embeddings = OllamaEmbeddings(
    model="nomic-embed-text:v1.5",
    base_url=BASE_URL,
    client_kwargs=client_kwargs,
)

# Test the LLM
print(llm.invoke("Hello, Ollama LLM =)"))

# Test the embeddings
print(embeddings.embed_query("Hello, Ollama Embeddings =)"))

Result example:

LangChain test output

More Info

Expose and Secure Your Self-Hosted Ollama API

Set up Ollama, NGROK, and LangChain

Ollama

Test if it’s running

Pull your preferred models

Ngrok

Expose Ollama with ngrok (basic auth)

LangChain & Ollama

Use Ollama (LLM & Embeddings) from another network

More Info

Related posts

Getting Started with Ollama: From Installation to Testing

Why Your AI Coding Agent Gets Exponentially More Expensive (and What to Do About It)

Enabling AI Agents to Use a Real Debugger Instead of Logging

8-Bit Music Theory: How They Made The Great Sea Feel C U R S E D