Pythia 160M is giving unreasonable logit values

```
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
model = AutoModelForCausalLM.from_pretrained("EleutherAI/pythia-160m")
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pythia-160m")
input_ids = tokenizer.encode("Hello, my dog is cute", return_tensors="pt")
model.eval()
with torch.no_grad():   
     logits = model(input_ids).logits
print(logits)
print(torch.topk(logits, k = 5))`
```

This is my code and the output is 

![image](https://github.com/user-attachments/assets/33688cc3-66d0-4024-9cfd-7398a2082b54)

For no other model do the logit values get this large. The 410m model has maximum values of ~10. I was wondering if there is a bug in the way logits are computed?




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pythia 160M is giving unreasonable logit values #177

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Pythia 160M is giving unreasonable logit values #177

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions