-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
@BriansIDP Hi~ Firstly, appreciate your kindly sharing. I have some questions about this implementation, and hope for your replies.
- I notice that the default "layerlist" is "0", and I wonder if numbers of rnn layers effect a lot? Have you tried other settings, such as add rnn before each attention block.
- RNN layers are builded by "nn.LSTM(d_model, rnndim, 1)". I guess if we should additionally set the "batch_first" options to "True". For the first dim of tensor is "batch" in Transformer, but I have not found Reshape operations before rnn_forward.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels