From DIET to Deployment: Training Your First Rasa NLU Model

Published: 3 days ago (February 16, 2026 at 08:30 AM EST)

4 min read

Source: Dev.to

CRF showed us structured entity extraction. DIET showed us joint intent–entity learning. Now it’s time to move from theory to practice.

Understanding models is important, but models are useless without data—that’s where real NLU development begins.

We’ve already discussed how DIET works internally. The practical question is: How do we actually train it?

Rasa training consists of three core steps:

Create structured training data
Configure the NLU pipeline
Train the model

Below we walk through each step.

Generating Training Data

Rasa models learn entirely from annotated examples. Instead of writing rules, you provide examples that illustrate the intents and entities you expect.

The NLU File

Training data lives in a YAML file, typically data/nlu.yml:

version: "3.1"

nlu:
  - intent: book_flight
    examples: |
      - Book a flight to [Paris](location)
      - I want to fly to [Berlin](location)
      - Get me a ticket to [London](location)

  - intent: greet
    examples: |
      - Hello
      - Hi
      - Hey there

Notes

Intents are the top‑level labels.
Entities are annotated inline ([text](entity)).
No separate entity file is needed; DIET learns both tasks from this single dataset.

How Much Data Do You Need?

There’s no magic number, but a common guideline is:

Use case	Examples per intent
Minimum prototype	10–15
Production baseline	50–100

Diverse phrasing is far more valuable than repetitive patterns.

Bad example (repetitive):

Book a flight to Paris
Book a flight to Berlin
Book a flight to London

Good example (varied):

I need to travel to Paris
Can you find flights to Berlin?
Get me a ticket heading to London
Fly me to Rome tomorrow

Variation teaches the model to generalise.

Training the Model

Once the data and pipeline configuration are ready, training is a single command:

rasa train

Behind the scenes Rasa:

Reads the NLU data and builds a vocabulary.
Initialises the DIET model and runs multiple training epochs.
Optimises the joint loss for intent and entity prediction.
Saves the trained artefacts (e.g., models/20260215-123456.tar.gz) containing the NLU and dialogue models.

What Happens During Training?

Text is tokenised.
Tokens are vectorised.
Transformer layers process context.
Intent and entity losses are computed jointly.
Gradients update shared weights.

Typical hyper‑parameters you may tune:

epochs
learning_rate (advanced)
embedding_dim
batch_size

Testing the Model

After training, launch an interactive NLU test server:

rasa shell nlu

Type an example, e.g., “Book a flight to Madrid tomorrow”, and you’ll receive a response like:

{
  "intent": {
    "name": "book_flight",
    "confidence": 0.94
  },
  "entities": [
    {
      "entity": "location",
      "value": "Madrid"
    }
  ]
}

This is DIET in action, trained on your data.

Common Beginner Mistakes

Pitfall	Why it hurts
Too few examples	Model cannot learn variability.
Overlapping intents	Causes ambiguity and low confidence.
Copy‑paste variations	Leads to overfitting on narrow phrasing.
Mixing business logic into NLU	Blurs the separation of concerns.
Ignoring real user phrasing	Model fails in production.

Focus on data quality: diverse phrasing, balanced intents, clear entity boundaries, and minimal overlap. Remember the iterative loop:

Train → Test → Improve → Retrain.

Where We Go Next

Now you know how to:

Generate training data
Configure DIET
Train a Rasa model

The next step is to connect NLU to dialogue management:

Domain files
Stories & rules
Slot filling

Predicting intent is only step one; building behaviour is step two. Stay tuned for the end‑to‑end assistant tutorial.

From DIET to Deployment: Training Your First Rasa NLU Model

Generating Training Data

The NLU File

How Much Data Do You Need?

Training the Model

What Happens During Training?

Testing the Model

Common Beginner Mistakes

Where We Go Next

Related posts

Preface

How to Set Up Android CI/CD with GitHub Actions — Firebase Distribution & Play Store

Your AI Agent Can Browse 6 Social Networks. Here's the One-Liner.

Data Modeling for the Lakehouse: What Changes