Purpose of this repository #81

pacak · 2023-07-14T22:38:21Z

What is the purpose of this repository?

I guess it's a continuation from #80 and #77

epage · 2023-07-18T01:55:23Z

I would classify this as a broadening of those as this doesn't address those but more fundamental aspect to the conversation,

The name "rosetta" was chosen to convey a comparative analysis. In particular, my focus for the rosetta repos I maintain on example code and metrics. This is in part because it would be impractical for me to keep up on what is going on with each library in the different repos for doing a more thorough analysis. Steps past the most basic (data driven OsStr, the parser design document) is more born out of frustration with people overlooking fundamental issues.

pacak · 2023-07-18T11:47:52Z

So comparative analysis. There's several levels we can compare parsers as far as parsing is concerned

user passes all the correct flags and values - this is what currently tested plus non-utf8 stuff
user uses correct flags but invalid values, this is not tested since xflags example was broken.
user uses incorrect flags or incorrect combinations of flags - I don't think this is tested

Parsing for the happy path only is easy, in my $dayjob we have a test task where an applicant needs to write a simple console app that gets passed a file name and an optional -r flag, what I usually see is equivalent of collecting all the things into a vector and checking if -r is there and using whatever is left as a file name.

Making sure that an app can only be used correctly comes at a complexity cost which needs to live somewhere - in the parser or in the user code. Should the samples show how this complexity would look like? Invalid or mutually options or groups of them for example.

I've seen code like that many times and fixed bugs related to that:

// without extra annotations parser would happily take `--do-this` and `--do-that` at the same time
// happens when command line is generated from chunks in some script, whatever is executed
// depends on how options are consumed. They can even be consumed in different order 
// depending on code path taken, seen that too.
struct Options {
    do_this: bool,
    do_that: bool,
    do_something_else: bool,
}

fn main() {
    ...
    // and this explodes as soon as new options are added, seen that too :)
    if opts.do_this {
        do_this()
    } else if opts.to_that() {
        do_that()
    } else if opts.do_something_else {
        do_something_else()
    }  // optionally, with else branch but often without it
      else { unreachable!() } 
}

Then there are design quirks. Say we add a bool switch --foo. should the parser accept or reject --number --foo 100?

epage · 2023-07-18T16:03:53Z

I think I'm missing what you were meaning to get at with the code samples and the discussion of issues at your $dayjob.

pacak · 2023-07-18T16:08:07Z

It is very easy to make a parser if you only consider situations where it will be given correct data and a lot of applicants to that to save time. It is harder to make a parser that will accept correct usage and reject incorrect one. From the user point of view rejecting incorrect usage is just as important as accepting the correct one.

epage · 2023-07-18T16:21:34Z

How is that tied into the purpose of this repository?

pacak · 2023-07-18T16:23:27Z

Is this repo comparing just the happy path or is it comparing parsers in a state you would actually want to use in your app?

epage · 2023-07-18T16:45:03Z

It is doing an automatically generated metric-based comparison.

There are a lot of different design trade offs and we leave it to people to dig into and decide between these trade offs. In a lot of cases, lexopt is fully valid to use despite having almost nothing to help you with correctness. In fact, I'm going to propose using something like lexopt within libtest and as a basis in general for the test ecosystem in Rust.

pacak · 2023-07-18T17:20:00Z

Well, it's not comparing apples to apples then. Parsers have different behavior.

I'm not saying lexopt is a bad library because it does not help with correctness, I'm trying to say that users will have to take care of that part themselves - libtest for example.

Without making sure parsers are identically this projects is basically - figures in the main page a bit misleading.

Check for correctly (or incorrectly) handling sample inputs can be done fully automatically too.

Anyway, feel free to close the ticket if you are okay with current state.

epage · 2023-07-18T17:51:02Z

Trust me, as the maintainer of clap, I fully understand that the numbers don't stand on their own, like comparing lexopts build time with claps. They provide a world of difference of features.

epage closed this as not planned Won't fix, can't repro, duplicate, stale Jul 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Purpose of this repository #81

Purpose of this repository #81

pacak commented Jul 14, 2023

epage commented Jul 18, 2023

pacak commented Jul 18, 2023

epage commented Jul 18, 2023

pacak commented Jul 18, 2023

epage commented Jul 18, 2023

pacak commented Jul 18, 2023

epage commented Jul 18, 2023

pacak commented Jul 18, 2023

epage commented Jul 18, 2023

Purpose of this repository #81

Purpose of this repository #81

Comments

pacak commented Jul 14, 2023

epage commented Jul 18, 2023

pacak commented Jul 18, 2023

epage commented Jul 18, 2023

pacak commented Jul 18, 2023

epage commented Jul 18, 2023

pacak commented Jul 18, 2023

epage commented Jul 18, 2023

pacak commented Jul 18, 2023

epage commented Jul 18, 2023