Error in AdditiveAttentionForSeq call function due to incompatible dimensions #1

rozhix · 2024-07-22T12:54:43Z

Hello,

I'm using the attention-GRU-piecewise-linear RUL Prediction.ipynb notebook. There is an error in the AdditiveAttentionForSeq class, specifically in the call function at the line concat = tf.concat((state_rep, encoder_outputs), axis = -1). It tries to concatenate a 4D array with a 3D array, which is not possible.

I modified the call function as follows:

def call(self, state, encoder_outputs):
seq_len = encoder_outputs.shape[1]
averaged_state = tf.reduce_mean(tf.stack(state, axis = 1), axis = 1)
state_rep = tf.repeat(tf.expand_dims(averaged_state, axis = 1), repeats = seq_len, axis = 1)
shape = tf.shape(state_rep)
seq_len = shape[1]
batch_size = shape[2]
hidden_dims2 = shape[3]
state_rep = tf.reshape(state_rep, (batch_size, seq_len, hidden_dims2))
concat = tf.concat((state_rep, encoder_outputs), axis = -1)
scores = tf.nn.tanh(self.attention(concat))
attention_weights = tf.nn.softmax(tf.reduce_sum(scores, axis = -1), axis = -1)
return tf.matmul(tf.expand_dims(attention_weights, axis = 1), encoder_outputs)

However, I'm not entirely sure if this is the correct approach to fix the error. Could you please help verify this fix or suggest the correct way to handle this issue?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error in AdditiveAttentionForSeq call function due to incompatible dimensions #1

Error in AdditiveAttentionForSeq call function due to incompatible dimensions #1

rozhix commented Jul 22, 2024

Error in AdditiveAttentionForSeq call function due to incompatible dimensions #1

Error in AdditiveAttentionForSeq call function due to incompatible dimensions #1

Comments

rozhix commented Jul 22, 2024