You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on May 16, 2025. It is now read-only.
The following error occurred while running cell 10 in 6. Tune language model using PPO with our preference model.
After adding __init__.py to /content/trlx/examples/summarize_rlhf/reward_model/, I still get the same error.
How can I fix it?
10 import torch
11 from datasets import load_dataset
---> 12 from reward_model.reward_model import GPTRewardModel
13 from tqdm import tqdm
14 from transformers import AutoTokenizer
ModuleNotFoundError: No module named 'reward_model.reward_model'; 'reward_model' is not a package