Skip to content
This repository was archived by the owner on Sep 9, 2024. It is now read-only.
This repository was archived by the owner on Sep 9, 2024. It is now read-only.

Why the lose value is Nan at the beginning of training sometimes? #46

Description

@Lzyin

Describe the bug
A clear and concise description of what the bug is.

To Reproduce
Steps to reproduce the behavior (e.g. the command that you used).

Expected behavior
A clear and concise description of what you expected to happen.

Desktop (please complete the following information):

  • OS: [e.g. Linux/Windows]
  • CUDA : yes/no

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions