6

Has anyone seen this model's implementation using Keras?

inb4: tensorflow, pytorch

Stephen Rauch
  • 1,831
  • 11
  • 23
  • 34
Anton
  • 243
  • 2
  • 10

3 Answers3

2

Update for anyone googling this in 2021: Keras has implemented a MultiHead attention layer. If key, query, and value are the same, this is self-attention.

2

Here is an implementation from PyPI.

Stephen Rauch
  • 1,831
  • 11
  • 23
  • 34
eugen
  • 146
  • 4
0

One example from Kaggle is available.

Stephen Rauch
  • 1,831
  • 11
  • 23
  • 34
silverstone
  • 126
  • 5