Fix grammar for block comments#2191
Merged
traviscross merged 2 commits intorust-lang:masterfrom Mar 2, 2026
Merged
Conversation
traviscross
reviewed
Mar 1, 2026
src/comments.md
Outdated
Comment on lines
+20
to
+23
| `/**/` | ||
| | `/***/` | ||
| | `/*` | ||
| ^ | ||
| ( ~[`*` `!`] | `**` | BLOCK_COMMENT_OR_DOC ) | ||
| ( BLOCK_COMMENT_OR_DOC | ~`*/` )* | ||
| ^ ( BLOCK_COMMENT_OR_DOC | ~`*/` )* |
Contributor
There was a problem hiding this comment.
With the changes, the first two branches aren't needed, right?
Separately, we might as well avoid using ~ here on this multiple character lookahead.
Contributor
Author
There was a problem hiding this comment.
Indeed, it is no longer needed, I have removed it and the use of ~.
This fixes an issue with the block comment grammar with nested block comments. For example, this would fail to match `/*/* test */*/` because of the following: The open slash of the inner block comment matches `~[`*` `!`]` which then misinterprets the rest of the inner block comment as being normal characters, and prevents the nesting from working correctly. The original intent with this formulation was to prevent it from matching an inner or outer block doc comment. This is no longer needed because we are now defining the order with the COMMENT production, and the doc comments come before block comments. This also changes the order of BLOCK_COMMENT_OR_DOC, but that is not strictly necessary because once inside a block comment, everything is comment text. rustc does not expose nested comments as being distinct. I mainly ordered them to be consistent with COMMENT (and in case I forgot anything). The implementation for this is at [`Cursor::block_comment`](https://github.com/rust-lang/rust/blob/859951e3c7c9d0322c39bad49221937455bdffcd/compiler/rustc_lexer/src/lib.rs#L782-L817).
92d669c to
37ed14b
Compare
In the grammar rule for `BLOCK_COMMENT`, we indent the rule expression for the content of the block comment. We had put the cut ahead of this rule, but the resulting indentation makes it seem as though the cut is scoped to this subpart when it in fact applies to all remaining parts of the rule. The potential confusion here is particularly subtle because the rule for the content of a block comment is followed by a Kleene star -- if the cut only applied to this subpart, it would be superfluous. Let's move the cut to after the opening `/*` to avoid this.
traviscross
approved these changes
Mar 2, 2026
Contributor
|
Looks good; thanks. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This fixes an issue with the block comment grammar with nested block comments. For example, this would fail to match
/*/* test */*/because of the following:The open slash of the inner block comment matches
~[*!]which then misinterprets the rest of the inner block comment as being normal characters, and prevents the nesting from working correctly.The original intent with this formulation was to prevent it from matching an inner or outer block doc comment. This is no longer needed because we are now defining the order with the COMMENT production, and the doc comments come before block comments.
This also changes the order of BLOCK_COMMENT_OR_DOC, but that is not strictly necessary because once inside a block comment, everything is comment text. rustc does not expose nested comments as being distinct. I mainly ordered them to be consistent with COMMENT (and in case I forgot anything).
The implementation for this is at
Cursor::block_comment.