6

Has anyone seen this model's implementation using Keras?

inb4: tensorflow, pytorch

Stephen Rauch
  • 1,783
  • 11
  • 21
  • 34
Anton
  • 243
  • 2
  • 10
  • 1
    You can find a version here: https://github.com/Lsdefine/attention-is-all-you-need-keras - Seems reasonable but I just briefly looked at the code so I can't guarantee it is exactly what its in the paper – user1587 Jun 08 '18 at 17:41

3 Answers3

2

Update for anyone googling this in 2021: Keras has implemented a MultiHead attention layer. If key, query, and value are the same, this is self-attention.

2

Here is an implementation from PyPI.

Stephen Rauch
  • 1,783
  • 11
  • 21
  • 34
eugen
  • 136
  • 4
0

One example from Kaggle is available.

Stephen Rauch
  • 1,783
  • 11
  • 21
  • 34
silverstone
  • 126
  • 5