luxem 0.0.2

Looks like

{
	name: luxem,
	*TODO complete color selection*
	background: [0.21568, 0.21568, 0.21568],
	groups: {
	    value: [
	        luxem-object,
	        luxem-array,
	        luxem-primitive,
	    ],
	},
	root: value,
	types: [
		{
		    id: luxem-object,
			name: "Luxem Object",
			back: [(data-record) data],
			middle: [
			    (record) {
			        key: data,
			        tag: value,
			    }
            ],
			front: [
				(mark) "{",
				(record) {
					key: data,
					infix: [ (mark) ":" ],
					separator: [ (mark) "," ],
				},
				(mark) "}",
			],
		},
		{
		    id: luxem-array,
			name: "Luxem Array",
			back: [(data-array) data],
			middle: [
			    (array) {
			        key: data,
			        tag: value,
			    }
            ],
			front: [
				(mark) "[",
				(array) {
					key: data,
					separator: [ (mark) "," ],
				},
				(mark) "]",
			],
		},
		{
		    id: luxem-primitive,
			name: "Luxem Primitive",
			back: [(data-primitive) data],
			middle: [(primitive) data],
			front: [
				(primitive) data,
			],
		}
	]
}

What is luxem?

luxem is a specification for serializing structured data.

luxem is similar to JSON. The main differences are:

You can specify a type using (typename) before any value. Ex: (direction) up.
You can have a , after the final element in an object or array.
Quotes are optional for simple strings (strings containing no spaces and no ambiguous symbols).
The document is an array with implicit (excluded) [] delimiters.
Comments (written as *comment text*) can be placed anywhere whitespace is.

All documents should be UTF-8 with 0x0A line endings (linux-style).

No basic types are defined in the parsing specification, but the following should be used as a guideline for minimum data type support:

bool true|false
int -?[0-9]+
dec -?[0-9]+(\.[0-9]+)?
string
ascii16 ([a-p][a-p])*

ascii16 is a binary encoding that is both ugly and easy to parse, using the first 16 characters of the alphabet.

Why?

In general, I think JSON is excellent. XML is bloated (header boiler-plate, number of reserved symbols) and inconsice, with multiple ways to represent the same data, and formatting can corrupt the data in the document. YAML is bloated, fragile, and inconsistent, mixing human-readable text with arcane symbolic sequences.

However, several JSON use cases are very difficult or impossible:

Serializing and deserializing polymorphic data

An object key-value pair could be used to identify an object type, but, since objects elements are unordered, a validated document could have the type-key occur after the object body, preventing efficient streaming.

An array can be used to guarantee the type identifier occurs before the body, but this is visually indistinguishable from other array uses and requires following a non-standard convention.
Distinguishing enumeration fields from free-entry strings
Formatting binary data
Writing a compliant, full parser

The numeric notation in the form 4e10 has limited value and complicates the parser. null must be accepted everywhere in the document, regardless of the data domain. UTF-16 escapes must be understood and converted.
Documenting templates

Workarounds involving ignored strings cause a multitude of problems, including increasing the likelihood of key collisions in objects, difficulty to use in array contexts, and requiring in-application comment filtering.

Workarounds involving preprocessors to remove comments increase processing complexity, software dependencies, and (not always accessible) pipeline changes, and still result in non-standard JSON files.

Implementations

luxem-c

This is a barebones C implementation, with no type translation or structure generation support.
luxem-python

The Python implementation wraps luxem-c and adds support for type translations and structure generation.
luxem-cxx

This is roughly equivalent to the Python version, with a C++ boxed-type implementation for building structures.
luxemj

Luxemj can only deserialize. It is based on the pidgoon parser generator and is fairly slow, but supports annotation-based structure deserialization.

Cool Tricks

Specifying a file type

(favnum v1.1.0) [
	2, 17, 11, 9, 2, 23
]

Merging sample sets

{x: 7, y: 3, weight: 1.29867},
{x: 1, y: -20, weight: 0.24234},

{x: -19, y: -22, weight: 0.33011},

=

{x: 7, y: 3, weight: 1.29867},
{x: 1, y: -20, weight: 0.24234},
{x: -19, y: -22, weight: 0.33011},

All three of the above are valid documents.

Minimalism

9, 2

Specification

The Pidgoon specification here should be treated as authoritative.

All documents should be UTF-8 with 0x0A line endings (linux-style).

The name luxem is always lowercase.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.gitignore		.gitignore
license.txt		license.txt
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

luxem 0.0.2

What is luxem?

Why?

Implementations

Cool Tricks

Specifying a file type

Merging sample sets

Minimalism

Specification

About

Releases

Packages

License

rendaw/luxem

Folders and files

Latest commit

History

Repository files navigation

luxem 0.0.2

What is luxem?

Why?

Implementations

Cool Tricks

Specifying a file type

Merging sample sets

Minimalism

Specification

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages