A user's guide to Penrose graphical notation?

Question

Penrose graphical notation seems to be a convenient way to do calculations involving tensors/ multilinear functions. However the wiki page does not actually tell us how to use the notation.

The several references, especially ones with Penrose as author, must be good places to start. But it is now the summer holiday and I am away from my school library. So I am wondering whether someone here has a nice introduction to "the use" of Penrose graphical notation.

Thanks!

I think the notation you are talking about has been generalized somewhat to something called "String diagrams". See Qiaochu's blog post on them here and also this nLab article (or probably more importantly, the references therein). — Ragib Zaman, Jul 01 '13 at 14:00
I have used the Penrose notation for long tensor calculations, which I learned as birdtracks. I would not had attempted what I did with the index notation. — Hbar, Sep 11 '13 at 00:47
https://math.stackexchange.com/questions/2917/proof-of-det-textbfst-det-textbfs-det-textbft-in-penrose-graphic?rq=1 — Clemens Bartholdy, May 22 '22 at 13:37
I'm trying to cover it all, and more, here: https://tensorcookbook.com/ — Thomas Ahle, Jan 03 '25 at 20:16

Jordan Taylor · Answer 1 · 2023-12-07T03:07:01.600

TensorNetwork.org or tensors.net are great places to see introductory examples, and the math3ma blog has a wonderful little intuitive introduction, while Hand-waving and interpretive dance: an introductory course on tensor networks is a nice reference to understand this notation more systematically. Additionally, I have written my own introduction to the notation, heavily inspired by the other references. It's aimed for a machine learning / interpretability context, but should be mostly understandable to everyone.

Here are some excerpts:

In graphical notation, tensors are represented as shapes with "legs" sticking out of them. A vector can be represented as a shape with one leg, a matrix can be represented as a shape with two legs, and so on:

Each leg corresponds to an index of the tensor - specifying an integer value for each leg of the tensor addresses a number inside of it:

where 0.157 happens to be the number in the (i=0,j=2,k=0) position of the tensor A. In python, this would be A[0,2,0]. The amount of memory required to store a tensor grows exponentially with the number of legs, so tensors with lots of legs are usually represented only implicitly: decomposed as operations between many smaller tensors.

Connecting the legs of two tensors indicates a tensor contraction (also known as an Einstein summation). Here are the most common kinds of contractions between vectors and matrices:

In every case you can tell how many legs the resulting tensor will have by the number of uncontracted "free'' legs on the left.

But graphical notation is most useful for representing unfamiliar operations between many tensors. One example in this direction is $\sum_{\alpha \beta} A_{i \alpha \beta}v_{\beta}B_{\alpha \beta j}=M_{ij}$, which can be represented in graphical notation as

The middle part of the graphical notation here shows that the number in each i, j position of the final matrix can be calculated with a sum over every possible indexing of the internal legs α and β, where each term in the sum consists of three numbers being multiplied (though in practice the contraction should be calculated in a much more efficient way).

Graphical notation really comes into its own when dealing with larger networks of tensors. For example, consider the contraction $\sum_{i\,j\,k\,l\,m\,n\,o\,p\,q\,r\,s\,t\,u} A_{ij} V_{ir} B_{jkl} W_{rks} C_{lmn} X_{smt} D_{nop} Y_{tou} E_{pq} Z_{uq},$ which is tedious to parse: indices must be matched up across tensors, and it is not immediately clear what kind of tensor (eg. number, vector, matrix ...) the result will be. But in graphical notation this is and we can immediately see which tensors are to be contracted, and that the result will be a single number. Contractions like this can be performed in any order. Some contraction orders are much more efficient than others, but they all get the same answer eventually.

I have since written my own introduction to Penrose graphical notation, here: https://www.lesswrong.com/posts/BQKKQiBmc63fwjDrj/graphical-tensor-notation-for-interpretability It is aimed for a machine learning context as it has einops / PyTorch code alongside the diagrams, but should be mostly understandable to everyone. — Jordan Taylor, Dec 07 '23 at 02:24

score 6 · Answer 2 · answered Jun 23 '14 at 14:44

6

There are nice notes from a tutorial given at Siggraph in 2002 here : http://research.microsoft.com/apps/pubs/default.aspx?id=79791

answered Jun 23 '14 at 14:44

Christine Darcoux

169
1
3

score 3 · Answer 3 · answered Jul 20 '21 at 07:54

3

There is succinct medium article covering it from the ground up. I am not aware of anything that goes into too much apart from what the previous answer points to

answered Jul 20 '21 at 07:54

Pugs

123

A user's guide to Penrose graphical notation?

3 Answers3

Linked