Simplify Comment Parsing Code #3399

InsertCreativityHere · 2025-01-21T20:38:54Z

This PR simplifies the implementation of parseDocComment, which is the part of libSlice responsible for converting a raw string comment into a structured DocComment.

It's not meant to be a perfect cleanup.
But just to simplify some things before I add the last validation logic/fix the issues related to how @links are formatted.

I used my compiler-comparison script to check, and the generated code is identical before and after my changes here.

InsertCreativityHere · 2025-01-21T20:39:34Z

cpp/src/Slice/Parser.cpp

@@ -648,48 +648,44 @@ namespace
        if (stripMarkup)
        {
            // Strip HTML markup.


Switching to a while loop lets us avoid a redundant check for pos != string::npos.

The code assumes anything between < and > is an html tag.

What about something like:

a < b or x > y , won't this strip the comment to something like a y?

It is not a new problem, but would be nice to fix now that we are looking at this code.

We can fix in a separate PR.

InsertCreativityHere · 2025-01-21T20:39:43Z

cpp/src/Slice/Parser.cpp

-            } while (pos != string::npos);
+                comment.erase(pos, endpos - pos + 1);
+            }
+        }



This logic was nested inside if (stripMarkup). This doesn't seem right to me.
XML escaping should be unrelated to markdown striping.

InsertCreativityHere · 2025-01-21T20:40:13Z

cpp/src/Slice/Parser.cpp

@@ -742,35 +738,27 @@ namespace
        return result;
    }



This function took a bool namedTag parameter which was always hard-coded.
And even worse, it took a const string& name which was only used some of the time based on the bool.
So I split it into 2 functions: parseNamedCommentLine and parseCommentLine. No more bool.

InsertCreativityHere · 2025-01-21T20:40:31Z

cpp/src/Slice/Parser.cpp

@@ -803,94 +807,69 @@ Slice::Contained::parseDocComment(function<string(string, string)> linkFormatter

    DocCommentPtr comment = make_shared<DocComment>();


All the remaining code below this is where we parse the actual tags (@param etc) into their respective fields on DocComment. Before, we had an entire state machine, but this was overkill.

I removed all this logic, and replaced it with a StringList* currentSection,
which points to whichever part of the doc-comment we're currently parsing.
At the beginning it points to the overview, but as we encounter tags, we re-point it appropiately.
I find this approach simpler than using a state-machine, and switching on the state to determine where we're writing to.

pepone · 2025-01-22T10:29:34Z

cpp/src/Slice/Parser.cpp

@@ -648,48 +648,44 @@ namespace
        if (stripMarkup)
        {
            // Strip HTML markup.


The code assumes anything between < and > is an html tag.

What about something like:

a < b or x > y , won't this strip the comment to something like a y?

It is not a new problem, but would be nice to fix now that we are looking at this code.

We can fix in a separate PR.

InsertCreativityHere · 2025-01-22T15:28:32Z

I agree, it's also a problem I noticed, but didn't want to fix in this PR.
At first I thought about making the check fail if there was whitespace within the < ... >...
but I wasn't sure enough that that was a safe assumption.

InsertCreativityHere added 2 commits January 21, 2025 14:09

Simplify the comment parsing code.

abbdff6

We only need 'push_back' once at the end.

a3fa9c7

InsertCreativityHere commented Jan 21, 2025

View reviewed changes

InsertCreativityHere requested review from pepone and bernardnormier January 21, 2025 20:42

pepone approved these changes Jan 22, 2025

View reviewed changes

bernardnormier requested review from externl and removed request for bernardnormier January 22, 2025 15:51

InsertCreativityHere merged commit 1dd0241 into zeroc-ice:main Jan 22, 2025
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify Comment Parsing Code #3399

Simplify Comment Parsing Code #3399

InsertCreativityHere commented Jan 21, 2025

InsertCreativityHere Jan 21, 2025

pepone Jan 22, 2025 •

edited

Loading

InsertCreativityHere Jan 21, 2025

InsertCreativityHere Jan 21, 2025

InsertCreativityHere Jan 21, 2025

pepone Jan 22, 2025 •

edited

Loading

InsertCreativityHere commented Jan 22, 2025

		@@ -803,94 +807,69 @@ Slice::Contained::parseDocComment(function<string(string, string)> linkFormatter

		DocCommentPtr comment = make_shared<DocComment>();

Simplify Comment Parsing Code #3399

Simplify Comment Parsing Code #3399

Conversation

InsertCreativityHere commented Jan 21, 2025

InsertCreativityHere Jan 21, 2025

Choose a reason for hiding this comment

pepone Jan 22, 2025 • edited Loading

Choose a reason for hiding this comment

InsertCreativityHere Jan 21, 2025

Choose a reason for hiding this comment

InsertCreativityHere Jan 21, 2025

Choose a reason for hiding this comment

InsertCreativityHere Jan 21, 2025

Choose a reason for hiding this comment

pepone Jan 22, 2025 • edited Loading

Choose a reason for hiding this comment

InsertCreativityHere commented Jan 22, 2025

pepone Jan 22, 2025 •

edited

Loading

pepone Jan 22, 2025 •

edited

Loading