Validate expected nodes by thomasmarshall · Pull Request #3843 · ruby/prism

thomasmarshall · 2026-01-11T14:59:24Z

This PR aims to address #3828 by validating expected node types and wrapping unexpected node types in an ErrorRecoveryNode. This should make it easier for consumer programs to handle invalid Ruby code. For example:

module def foo end

This is parsed as a ModuleNode where constant_path is a DefNode. After this change, constant_path will be an ErrorRecoveryNode with its child set to the DefNode. This way clients can just check PM_NODE_TYPE(PM_ERROR_RECOVERY_NODE) to understand if the expected node is either missing (formerly MissingNode, now ErrorRecoveryNode with empty child) or an unexpected node.

As per the suggestion from @kddnewton, I folded the missing nodes and unexpected nodes into a single ErrorRecoveryNode type. I also added tests for currently known error recovery scenarios and switched to validating expected node types at the callsites rather than in the constructors.

From an implementation perspective, I think it could be nicer to have comprehensive validations in the constructors because it doesn't require understanding any of the parsing logic (I think my changes are correct, but likely harder to review than validations in constructors). They could also potentially be codegen'd from config.yml somehow but I suppose we'd be doing unnecessary work in cases where unexpected nodes are not possible.

I split the changes into a series of small commits that should make it more straightforward to understand what is changing for each node type:

The first commit captures the existing error recovery scenarios in a Ruby test.
Then we rename MissingNode to ErrorRecoveryNode.
Then we setup the function validation macro.
Then there is a separate commit for each node type with an on error case in one of its fields in config.yml.
Finally, the last commit removes on error from config.yml as per this suggestion:

I would go ahead and get rid of the on error stuff in config.yml

However, I'm not 100% sure this is what was meant, I might have misunderstood.

This is a somewhat substantial change, so I understand if it's a little more difficult to review or feedback on. I'm very happy to make any changes to the approach and implementation, please just let me know!

eregon · 2026-01-15T19:56:20Z

I also added tests for currently known error recovery scenarios and switched to validating expected node types at the callsites rather than in the constructors.

FYI the Ruby code already has logic to check field kinds in the Ruby nodes constructors, search for check_field_kind in the codebase (context: #3022).
Are the added checks here redundant perhaps, or do they cover more somehow?

eregon · 2026-01-15T19:59:32Z

I should add: this PR LGTM from a quick look at the diff

thomasmarshall · 2026-01-16T12:36:54Z

Are the added checks here redundant perhaps, or do they cover more somehow?

The validations I added are on the C side. They check if the node is an expected type and then wrap it in an ErrorRecoveryNode if not. I believe that's different to check_field_kind which is only on the Ruby side, so in that sense they are not redundant.

thomasmarshall added 15 commits January 11, 2026 14:32

Add tests for error recovery scenarios

8c32f51

Rename MissingNode to ErrorRecoveryNode

bf17817

Add pm_unexpected_node_create function

7e1dc8f

Add PM_VALIDATE_NODE_TYPE macro

e101376

Validate AliasGlobalVariableNode

968ee09

Validate AliasMethodNode

9f8334e

Validate ClassNode

8a14cdc

Validate ForNode

ff4eefd

Validate InterpolatedStringNode

bc5f85d

Validate ModuleNode

7b23e44

Validate MultiTargetNode and MultiWriteNode

ec662f5

Validate ParametersNode

a3a218b

Validate PinnedVariableNode

a78c521

Validate RescueNode

aac272e

Add test for def node in module constant path

d1cff2f

thomasmarshall marked this pull request as draft January 11, 2026 15:02

thomasmarshall mentioned this pull request Jan 11, 2026

Identify unexpected nodes in error recovery scenarios #3828

Open

thomasmarshall and others added 2 commits January 13, 2026 15:59

Remove "on error" types from config.yml

c077b32

Sort ErrorRecoveryNode alphabetically

7f060ba

thomasmarshall force-pushed the error-recovery-nodes branch from 1033105 to 7f060ba Compare January 13, 2026 16:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validate expected nodes#3843

Validate expected nodes#3843
thomasmarshall wants to merge 17 commits intoruby:mainfrom
thomasmarshall:error-recovery-nodes

thomasmarshall commented Jan 11, 2026

Uh oh!

eregon commented Jan 15, 2026 •

edited

Loading

Uh oh!

eregon commented Jan 15, 2026

Uh oh!

thomasmarshall commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

thomasmarshall commented Jan 11, 2026

Uh oh!

eregon commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eregon commented Jan 15, 2026

Uh oh!

thomasmarshall commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

eregon commented Jan 15, 2026 •

edited

Loading