Add tiny transformer LLM notebook by cetagostini · Pull Request #2163 · pymc-devs/pytensor

cetagostini · 2026-05-22T07:44:43Z

Summary

Add a tiny decoder-only transformer LLM gallery notebook using PyTensor and xtensor named dimensions.
Lower matmul-shaped xtensor dot contractions to matmul and keep outer products on the existing einsum fallback.
Prefer static shape constants in tensordot reshape shapes and add focused regression coverage.

Test plan

conda run -n pytensor-dev python -m ruff check pytensor/tensor/math.py pytensor/xtensor/rewriting/math.py tests/tensor/test_math.py tests/xtensor/test_math.py
`conda run -n pytensor-dev python -m pytest tests/xtensor/test_math.py::test_dot
tests/xtensor/test_math.py::test_dot_lowers_to_matmul
tests/xtensor/test_math.py::test_dot_outer_product_falls_back_to_einsum
tests/xtensor/test_math.py::test_dot_errors tests/xtensor/test_math.py::test_dot_vectorize
tests/tensor/test_math.py::TestTensordot -q`

Add a new gallery notebook demonstrating a tiny decoder-only transformer LLM implemented with pytensor/xtensor (doc/gallery/transformers/tiny_transformer_llm.ipynb). Update .gitignore to exclude AI tool artifacts, gallery downloaded data, and JupyterLab session files. Also apply related updates to math implementation and rewrites (pytensor/tensor/math.py, pytensor/xtensor/rewriting/math.py) and adjust tests (tests/tensor/test_math.py, tests/xtensor/test_math.py) to match the changes.

review-notebook-app · 2026-05-22T07:44:48Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

cetagostini · 2026-05-22T10:51:57Z

Doing the notebook found a few things which were adding an overhead in xtensor, adjust and end-up being even faster now xtensor than plain tensor.

ricardoV94 · 2026-05-22T11:14:27Z

+    constant when possible, instead of a chain of ``Mul`` nodes over individual
+    ``ScalarConstant``s.


why? pytensor will rewrite away the mul constants, this is just eager stuff? We don't want to eagerly use static shapes in actual inputs

ricardoV94 · 2026-05-22T11:24:39Z

@@ -0,0 +1,1246 @@
+{


Line #1. from pytensor.xtensor.shape import stack as xstack
import pytensor.xtensor as ptx, and then use ptx.stack and the like

Reply via ReviewNB

ricardoV94 · 2026-05-22T11:24:39Z

@@ -0,0 +1,1246 @@
+{


Line #32. scores = px.dot(q, k, dim="hd") / scale # (batch, head, time_q, time_k)
you can do assert scores.dims == ("batch", "head", "time_q", "time_k"), to self document the dims instead of as a comment, also teaches these are always around for introspection

Reply via ReviewNB

ricardoV94 · 2026-05-22T11:24:39Z

@@ -0,0 +1,1246 @@
+{


Line #4. def gen_step(context, rng):
you can work with xtensor variables still, just convert to tensor before going into the scan, convert to xtensor inside the scan, convert to tensor before returning from scan, and convert the scan outputs to xtensor outside as soon as you get them. Basically handle the boundary.

Also you could make a while scan that runs until the termination token is emitted

Reply via ReviewNB

ricardoV94 · 2026-05-22T11:30:56Z

This is nice, I don't want the random xtensor changes, we need to investigate why it was not simplifying in your case, may be another symptom of #2056 or something else, but shouldn't be done in a docs PR

cetagostini · 2026-05-22T15:42:38Z

@ricardoV94 follow some of your comments, and came up with this: #2164

twiecki · 2026-05-26T14:33:25Z

@@ -0,0 +1,1246 @@
+{


That's cool!

Reply via ReviewNB

cetagostini marked this pull request as draft May 22, 2026 07:50

small changes

298d02a

cetagostini self-assigned this May 22, 2026

ricardoV94 reviewed May 22, 2026

View reviewed changes

ricardoV94 added docs xtensor labels May 22, 2026

cetagostini mentioned this pull request May 22, 2026

xtensor: inline Einsum OFG in lower_dot to avoid ShapeFeature compile blow-up #2164

Open

5 tasks

twiecki reviewed May 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tiny transformer LLM notebook#2163

Add tiny transformer LLM notebook#2163
cetagostini wants to merge 2 commits into
pymc-devs:mainfrom
cetagostini:cetagostini/llm_example_branch

cetagostini commented May 22, 2026 •

edited

Loading

Uh oh!

review-notebook-app Bot commented May 22, 2026

Uh oh!

cetagostini commented May 22, 2026

Uh oh!

ricardoV94 May 22, 2026

Uh oh!

ricardoV94 May 22, 2026 •

edited

Loading

Uh oh!

ricardoV94 May 22, 2026 •

edited

Loading

Uh oh!

ricardoV94 May 22, 2026 •

edited

Loading

Uh oh!

ricardoV94 commented May 22, 2026

Uh oh!

cetagostini commented May 22, 2026

Uh oh!

twiecki May 26, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		constant when possible, instead of a chain of ``Mul`` nodes over individual
		``ScalarConstant``s.

Conversation

cetagostini commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

review-notebook-app Bot commented May 22, 2026

Uh oh!

cetagostini commented May 22, 2026

Uh oh!

ricardoV94 May 22, 2026

Choose a reason for hiding this comment

Uh oh!

ricardoV94 May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented May 22, 2026

Uh oh!

cetagostini commented May 22, 2026

Uh oh!

twiecki May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cetagostini commented May 22, 2026 •

edited

Loading

ricardoV94 May 22, 2026 •

edited

Loading

ricardoV94 May 22, 2026 •

edited

Loading

ricardoV94 May 22, 2026 •

edited

Loading

twiecki May 26, 2026 •

edited

Loading