TF: What is the difference between the 'kernel weights' and the 'recurrent kernel weights' in LSTMs/GRUs?

Asked Jan 09 '23 at 15:42

Active Jan 13 '23 at 09:27

Viewed 113 times

Context: I am trying to understand the differences between the GRU/LSTM cells from tensorflow and pytorch (for research reproducibility) and noticed that TensorFlow differentiates between the kernel_initializer and the recurrent_initializer (see documentation (GRU/LSTM)) while PyTorch does not even mention any build in initialization (though you can overwrite the variables for custom initialization) (see torch documentation (GRU/LSTM)).

Question: What is the difference between kernel weights and recurrent kernel weights in LSTMs and GRUs? Why is the initialization between those different (in Tensorflow) and how would I replicate such initialization in PyTorch?

edited Jan 10 '23 at 10:29

asked Jan 09 '23 at 15:42

Robin van Hoorn

TF: What is the difference between the 'kernel weights' and the 'recurrent kernel weights' in LSTMs/GRUs?

0 Answers0