-
-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Notation for inline auxiliary for any kind of character set #177
Comments
I'm torn on this one. In a way the base vs auxiliary is a very binary distinction, and as you mention, it could be extended to more than just the characters of base. However, adding more implicit notation seems like it will be less clear and less simple to author. That said, this would be pretty neat. What if any auxiliary chars would be in parenthesis (I think that's not interfering with yaml parsing, but would need custom parsing all yaml strings)?:
Food for thought. Also, I wonder if "extended" makes more sense as a term in the docs and CLI parameters. Like checking for basic language support vs checking for extended language support. |
Also, for now we're not set on what kind of requirement the currency/numerals/punctuation) for — we talked about them either as opt-in or auxiliary level requirements, but having this more nuanced notation might open the door to also having some core currency/numerals/punctuation as base level required. |
So far, I have actually managed without it. See #155 We could use this notation to distinguish the Standard and Alternative notions as described here: https://en.wikipedia.org/wiki/Quotation_mark |
Why is putting them in auxiliary not an option? |
We would not be able to say whether it is punctuation, currency, numeral or character.
d
…On Fri, Aug 30, 2024 at 14:31, Denis Moyogo Jacquerye ***@***.***(mailto:On Fri, Aug 30, 2024 at 14:31, Denis Moyogo Jacquerye <<a href=)> wrote:
Why is putting them in auxiliary not an option?
—
Reply to this email directly, [view it on GitHub](#177 (comment)), or [unsubscribe](https://github.com/notifications/unsubscribe-auth/AADWQY7PBZVHSV2UYXF6JXDZUBQ2HAVCNFSM6AAAAABNF3F4QSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMRRGA4TGNZRGI).
You are receiving this because you were assigned.Message ID: ***@***.***>
|
Unicode character categories would help but there may be a few exceptions where a character category doesn't match its use in a language orthography system I guess. |
I suppose this very much has pros and cons, regardless of what such an implementation would look like. Either is conceptually nice, having and not having an
Yes, I was thinking about this as well, and had the same reservation. Firstly, it's just less distinct, but secondly, I too saw cases where e.g. modifier characters or e.g. apostrophe-like symbols may be auxiliary, and it is unclear if they are punctuation or character. What do we think of the above proposed |
There are situations when we want to list auxiliary characters for other kinds of things (e.g. punctuation, numerals), perhaps it would be better to have an inline notation for optional/auxiliary characters that could be used in any list of characters.
Instead of:
we could have the following (or use different escape character):
The text was updated successfully, but these errors were encountered: