Nesting analysis using Prism #1092

tompng · 2025-04-11T10:52:27Z

#1024
Migrate nesting analysis from Ripper to Prism

Original nesting calculation with Ripper

Tokenize with ripper
Parse the tokens mainly focusing on open/close tokens
Collect open tokens per line

New nesting calculation with Prism

Traverse syntax tree
Check open_loc and closing_loc for a node that makes nesting, create a token-like object for it
Collect open token-like objects per line

Gemfile

Exclude prism-1.8.0 because test fails. (ruby/prism#3851)

st0012

Can we nudge Prism maintainers to cut v1.8.1 for the fix, and add prism as a dependency and require 1.3.0+?

st0012 · 2026-01-16T18:03:41Z

lib/irb/nesting_parser.rb

-        # scan_opens without block will return a list of open tokens at last token position
-        scan_opens(tokens)
+      # Return a list of open nestings at last token position
+      def open_nestings(parse_lex_result)


Can we make this or parse_by_line take (code, local_variables: []) and call Prism.parse_lex here?
I feel Prism should be hidden from the callers of this class. Then we only need to require it here instead of all the places that'd use a nesting parser.

In a followup pull request #1160, parse_lex_result will be also used in RubyLex#should_continue?.
To avoid parsing the same code two times in dynamic_prompt calculation, argument needs to be parse_lex_result.

@context.io.dynamic_prompt do |lines| parse_lex_result = Prism.parse_lex(code, scopes: [@context.local_variables]) line_results = IRB::NestingParser.parse_by_line(parse_lex_result) ... line_results.map.with_index do |...| # This part requires lex result continue = @scanner.should_continue?(tokens_until_line, line, line_num_offset + 1) ... end end

Maybe we can find a better way to separate after migrating to Prism, I think.

st0012 · 2026-01-16T18:13:22Z

Oh you did this in #1091 already. I guess we just need to wait for the release then

st0012 · 2026-01-18T13:26:50Z

lib/irb/nesting_parser.rb

-            first_token_on_line = true
-          elsif t.event != :on_sp
-            first_token_on_line = false
+          @heredocs[line_index]&.sort_by { |_node, (_line, col)| col }&.reverse_each do |elem|


Should this be:

Suggested change

@heredocs[line_index]&.sort_by { |_node, (_line, col)| col }&.reverse_each do |elem|

@heredocs[line_index]&.sort_by { |elem| elem.pos[1] }&.reverse_each do |elem|

If so, let's add a test case that catches the failure? Something like:

def test_multiple_heredocs_same_line_ordering code = <<~'RUBY' x = <<B + <<A B A RUBY line_results = parse_by_line(code) _prev, opens_line1, _min = line_results[0] assert_equal(['<<B', '<<A'], opens_line1.map(&:tok)) end

Nice catch! It was sorting with nil, no error was raised but not correctly sorted.
I'll add a test case that needs sort <<~A if <<B (<<B appears before <<A in the syntax tree)

lib/irb/nesting_parser.rb

st0012 · 2026-01-18T13:43:39Z

lib/irb/ruby-lex.rb

          preserve_indent
        end
      elsif prev_open_token&.event == :on_embdoc_beg || next_open_token&.event == :on_embdoc_beg
        if prev_open_token&.event == next_open_token&.event


If I understand the code correctly, these "tokens" are actually nesting elements? Can we update these variable names to reflect that?

Yes, it's nesting elements. Updated 👍

It was a token, but changed to NestingParser::NestingElement

st0012 · 2026-01-18T15:36:31Z

test/irb/test_nesting_parser.rb

    end

+    def test_heredoc_sorting
+      # Heredocs appears in the ordef B,A,D,C in syntax tree, but should be processed in A,B,C,D order.


Suggested change

# Heredocs appears in the ordef B,A,D,C in syntax tree, but should be processed in A,B,C,D order.

# Heredocs appears in the order of B,A,D,C in syntax tree, but should be processed in A,B,C,D order.

🙈 (thanks)

tompng mentioned this pull request Jan 15, 2026

Completely migrate to prism #1160

Open

tompng force-pushed the ripper_to_prism_nesting branch 3 times, most recently from be7dfe6 to c3a31b8 Compare January 16, 2026 16:52

tompng marked this pull request as ready for review January 16, 2026 17:28

st0012 reviewed Jan 16, 2026

View reviewed changes

tompng added 2 commits January 17, 2026 15:41

Nesting calculation by Prism

8c9681b

Avoid prism-1.8.0 used in test, use latest version on github

c4882f3

tompng force-pushed the ripper_to_prism_nesting branch from c3a31b8 to c4882f3 Compare January 17, 2026 06:42

st0012 reviewed Jan 18, 2026

View reviewed changes

tompng added 2 commits January 18, 2026 23:51

Fix open heredoc sorting

261649e

Rename old variable open_token to open_elem

a10f35b

It was a token, but changed to NestingParser::NestingElement

st0012 approved these changes Jan 18, 2026

View reviewed changes

Add detailed comment about NestingParse internal

eec5fa8

tompng force-pushed the ripper_to_prism_nesting branch from 0acd285 to eec5fa8 Compare January 18, 2026 16:29

tompng merged commit a0e7fba into ruby:master Jan 18, 2026
38 of 40 checks passed

tompng deleted the ripper_to_prism_nesting branch January 18, 2026 16:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Nesting analysis using Prism #1092

Nesting analysis using Prism #1092

tompng commented Apr 11, 2025 •

edited

Loading

Uh oh!

st0012 left a comment

Uh oh!

st0012 Jan 16, 2026

Uh oh!

tompng Jan 16, 2026

Uh oh!

st0012 commented Jan 16, 2026

Uh oh!

st0012 Jan 18, 2026

Uh oh!

tompng Jan 18, 2026

Uh oh!

Uh oh!

Uh oh!

st0012 Jan 18, 2026

Uh oh!

tompng Jan 18, 2026

Uh oh!

st0012 Jan 18, 2026

Uh oh!

tompng Jan 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	@heredocs[line_index]&.sort_by { \|_node, (_line, col)\| col }&.reverse_each do \|elem\|
	@heredocs[line_index]&.sort_by { \|elem\| elem.pos[1] }&.reverse_each do \|elem\|

	# Heredocs appears in the ordef B,A,D,C in syntax tree, but should be processed in A,B,C,D order.
	# Heredocs appears in the order of B,A,D,C in syntax tree, but should be processed in A,B,C,D order.

Nesting analysis using Prism #1092

Nesting analysis using Prism #1092

Conversation

tompng commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Original nesting calculation with Ripper

New nesting calculation with Prism

Gemfile

Uh oh!

st0012 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

st0012 commented Jan 16, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tompng commented Apr 11, 2025 •

edited

Loading