Skip to content

Should apply model.norm layer to hidden_states[early_exit_layer] ? #9

@githubhyz

Description

@githubhyz

logits = self.lm_head(outputs.hidden_states[early_exit_layer])

I think you guys should apply model.norm layer to hidden_states[early_exit_layer] . Because only the last hidden_state applied model.norm layer. See

hidden_states = self.norm(hidden_states)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions