feat: replace action tokenizer with windowed attention by imitation-alpha · Pull Request #16 · AlmondGod/tinyworlds

imitation-alpha · 2025-11-29T06:08:56Z

Summary

This PR replaces the "mean pool + concat" mechanism in the LatentActionsEncoder with a "length-2 windowed attention + mean" mechanism. This change aims to better capture temporal dependencies between adjacent frames during action tokenization.

Changes

Modified models/latent_actions.py:
- Imported SpatialAttention from models.st_transformer.
- Updated LatentActionsEncoder to use SpatialAttention on concatenated windows of current and next frames.
- Removed the old mean pooling and concatenation logic.

Verification

Verified the implementation with a synthetic test script (scripts/verify_latent_actions.py - deleted after verification).
Confirmed that the model processes input frames and produces output actions with the correct dimensions.
Loss calculation works as expected.

Notes

This is a breaking change for LatentActionsEncoder checkpoints.

AlmondGod · 2025-12-13T00:37:57Z

this looks great! can you train a working world model to confirm the impact of the change?

NewJerseyStyle · 2026-01-09T04:35:37Z

Sorry to interrupt. I am not an expert, but I am curious if there are "KPIs" to be monitored to evaluate the impact of a change?
For example:

How to confirm it does not get worse, monitor steps used to converge?
How to confirm it gets better, monitor the loss of the model?

AlmondGod · 2026-01-20T21:47:49Z

Sorry to interrupt. I am not an expert, but I am curious if there are "KPIs" to be monitored to evaluate the impact of a change? For example:

How to confirm it does not get worse, monitor steps used to converge?

How to confirm it gets better, monitor the loss of the model?

yes, I'll add in a readme pr section specifying necessary criteria

feat: replace action tokenizer with windowed attention

05765b0

imitation-alpha force-pushed the feature/action-tokenizer-window-attention branch from 93ed906 to 05765b0 Compare November 29, 2025 06:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: replace action tokenizer with windowed attention#16

feat: replace action tokenizer with windowed attention#16
imitation-alpha wants to merge 1 commit intoAlmondGod:mainfrom
imitation-alpha:feature/action-tokenizer-window-attention

imitation-alpha commented Nov 29, 2025

Uh oh!

AlmondGod commented Dec 13, 2025

Uh oh!

NewJerseyStyle commented Jan 9, 2026

Uh oh!

AlmondGod commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

imitation-alpha commented Nov 29, 2025

Summary

Changes

Verification

Notes

Uh oh!

AlmondGod commented Dec 13, 2025

Uh oh!

NewJerseyStyle commented Jan 9, 2026

Uh oh!

AlmondGod commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants