Update rnn.py, fix `torch.nn.RNN` document error #153620

AIboy996 · 2025-05-15T13:35:55Z

I found the same issue as #147490 (@jibril-b-coulibaly).

There's an equivalent in the doc-string of torch.nn.RNN:

# Efficient implementation equivalent to the following with bidirectional=False
def forward(x, hx=None):
    if batch_first:
        x = x.transpose(0, 1)
    seq_len, batch_size, _ = x.size()
    if hx is None:
        hx = torch.zeros(num_layers, batch_size, hidden_size)
    h_t_minus_1 = hx
    h_t = hx
    output = []
    for t in range(seq_len):
        for layer in range(num_layers):
            h_t[layer] = torch.tanh(
                x[t] @ weight_ih[layer].T
                + bias_ih[layer]
                + h_t_minus_1[layer] @ weight_hh[layer].T
                + bias_hh[layer]
            )
        output.append(h_t[-1])
        h_t_minus_1 = h_t
    output = torch.stack(output)
    if batch_first:
        output = output.transpose(0, 1)
    return output, h_t

However there's something wrong.

Like mentioned in Documentation: fix RNN example for multiple layers #147490, line 499 is wrong

pytorch/torch/nn/modules/rnn.py

Line 499 in fb55bac

x[t] @ weight_ih[layer].T

The input for RNNCell should be different for different layers.

The code contains several hidden reference-related issues that may result in unintended modifications to tensors. For example in line 504, this causes all elements in the final output list to point to the same tensor.

pytorch/torch/nn/modules/rnn.py

Line 504 in fb55bac

output.append(h_t[-1])

Some variable is not defined. Despite being a relatively minor issue in annotation, it can lead to significant confusion for those who are new to the concept. For example weight_ih in line 499

pytorch/torch/nn/modules/rnn.py

Line 499 in fb55bac

x[t] @ weight_ih[layer].T

So, i write a runnable version to make it more clear:

# Efficient implementation equivalent to the following with bidirectional=False
rnn = nn.RNN(input_size, hidden_size, num_layers)
params = dict(rnn.named_parameters())
def forward(x, hx=None, batch_first=False):
    if batch_first:
        x = x.transpose(0, 1)
    seq_len, batch_size, _ = x.size()
    if hx is None:
        hx = torch.zeros(rnn.num_layers, batch_size, rnn.hidden_size)
    h_t_minus_1 = hx.clone()
    h_t = hx.clone()
    output = []
    for t in range(seq_len):
        for layer in range(rnn.num_layers):
            input_t = x[t] if layer == 0 else h_t[layer - 1]
            h_t[layer] = torch.tanh(
                input_t @ params[f"weight_ih_l{layer}"].T
                + h_t_minus_1[layer] @ params[f"weight_hh_l{layer}"].T
                + params[f"bias_hh_l{layer}"]
                + params[f"bias_ih_l{layer}"]
            )
        output.append(h_t[-1].clone())
        h_t_minus_1 = h_t.clone()
    output = torch.stack(output)
    if batch_first:
        output = output.transpose(0, 1)
    return output, h_t

This code can reproduce the computation of torch.nn.RNN.

For example:

import torch
import torch.nn as nn

torch.manual_seed(0)
input_size, hidden_size, num_layers = 3, 5, 2
rnn = nn.RNN(input_size, hidden_size, num_layers)
params = dict(rnn.named_parameters())
x = torch.randn(10, 4, 3)


official_imp = rnn(x)
my_imp = forward(x)

assert torch.allclose(official_imp[0], my_imp[0])
assert torch.allclose(official_imp[1], my_imp[1])

cc @svekars @sekyondaMeta @AlannaBurke @albanD @mruberry @jbschlosser @walterddr @mikaylagawarecki

pytorch-bot · 2025-05-15T13:35:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153620

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit eadd6c0 with merge base 7482eb2 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2025-05-15T13:36:02Z

✅login: AIboy996 / (02ec2eb)
✅login: AIboy996 / (02ec2eb, eadd6c0)

The committers listed above are authorized under a signed CLA.

AIboy996 · 2025-05-15T13:39:43Z

@pytorchbot label "topic: not user facing"

AIboy996 · 2025-05-15T13:40:40Z

@pytorchbot label "documentation"

pytorch-bot · 2025-05-15T13:40:49Z

Didn't find following labels among repository labels: documentation

AIboy996 · 2025-05-15T13:41:26Z

@pytorchbot label "module: docs"

AIboy996 · 2025-05-15T13:43:15Z

@pytorchbot label "module: nn"

AIboy996 added 2 commits May 15, 2025 20:56

Update rnn.py, fix RNN document example error

02ec2eb

Update rnn.py, fix RNN document example error

eadd6c0

AIboy996 requested review from albanD, jbschlosser and mikaylagawarecki as code owners May 15, 2025 13:35

pytorch-bot bot added the topic: not user facing topic category label May 15, 2025

pytorch-bot bot added the module: docs Related to our documentation, both in docs/ and docblocks label May 15, 2025

pytorch-bot bot added the module: nn Related to torch.nn label May 15, 2025

pytorchbot added the open source label May 15, 2025

albanD removed their request for review May 15, 2025 14:33

mikaylagawarecki approved these changes May 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update rnn.py, fix `torch.nn.RNN` document error #153620

Update rnn.py, fix `torch.nn.RNN` document error #153620

Update rnn.py, fix torch.nn.RNN document error #153620

Are you sure you want to change the base?

Update rnn.py, fix torch.nn.RNN document error #153620

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153620

✅ No Failures

Update rnn.py, fix `torch.nn.RNN` document error #153620

Update rnn.py, fix `torch.nn.RNN` document error #153620