[RFC007] Migrate the typechecker to the new AST - Part I #2121

yannham · 2024-12-09T14:17:23Z

Migrate the typechecker to the new AST (Part I)

Content

After the parser, this PR undertakes the migration of the typechecker to operate on the new AST.

Actually swapping the typechecker isn't done as part of this PR, because this requires even more changes - in particular migrating the parts of NLS that handle the term visitor pattern, and keeping part of the mainline contract equality checking code because it's still used at runtime for contract deduplication.

In the meantime, this PR duplicates the whole typechecker under bytecode::typecheck temporarily, so that we can iterate there.

How to review

Most of those changes are pretty mechanic: we switched type signatures to use components from bytecode::ast::* instead of term::*, and then fixed the myriad of compiler errors. Because we duplicated the code, it's not easy to see the difference with the original version either; but most of the changes should really be boring.

Some parts that might be worth looking at:

all the Traverse mechanic has been moved to a new dedicated top-level module traverse (it was previously defined in term). As for combine::Combine, we added a second version of the trait for arena-allocated values: TraverseAlloc, and implemented it for all the new AST subcomponents.
typechecking records as defined in the new AST is not trivial, because all we have now is a list of path-defined fields, that might overlap in different ways. The machinery to handle record is now implemented in bytecode::typecheck::record. Most of it is a clean implementation of what we did in the parser before RFC007: transform a list of path-defined fields to nested maps. The typechecking part of bytecode::typecheck::record has been mostly taken from typecheck, and hasn't been substantially modified.
While I tried to keep this PR focused on doing the minimal changes so that the repo compiles again, saving more substantial refactoring/cleaning for later, it was just too tempting for typecheck::eq, which was previously complicated by the fact that it needs to handle both typechecking-time and run-time contract equality check. the new bytecode::typecheck::eq has been re-organized around traits instead of free-standing functions, plus other simplifications coming from the fact that runtime contract equality will have to be done entirely differently in the bytecode VM, so we can afford to be less generic here.

All in all, I believe that we'll have a net reduction of LoC once we scrap the original typechecker, because many edge cases just don't happen with the new, simpler AST.

github-actions · 2024-12-20T17:44:04Z

Bencher Report

Branch	rfc007/typechecking
Testbed	ubuntu-latest

Click to view all benchmark results

Benchmark	Latency	nanoseconds (ns)
fibonacci 10	📈 view plot 🚷 view threshold	488,230.00
foldl arrays 50	📈 view plot 🚷 view threshold	1,714,100.00
foldl arrays 500	📈 view plot 🚷 view threshold	6,764,600.00
foldr strings 50	📈 view plot 🚷 view threshold	6,987,500.00
foldr strings 500	📈 view plot 🚷 view threshold	60,716,000.00
generate normal 250	📈 view plot 🚷 view threshold	43,753,000.00
generate normal 50	📈 view plot 🚷 view threshold	1,943,700.00
generate normal unchecked 1000	📈 view plot 🚷 view threshold	3,244,200.00
generate normal unchecked 200	📈 view plot 🚷 view threshold	762,190.00
pidigits 100	📈 view plot 🚷 view threshold	3,167,500.00
pipe normal 20	📈 view plot 🚷 view threshold	1,501,600.00
pipe normal 200	📈 view plot 🚷 view threshold	9,967,800.00
product 30	📈 view plot 🚷 view threshold	851,120.00
scalar 10	📈 view plot 🚷 view threshold	1,544,300.00
sum 30	📈 view plot 🚷 view threshold	834,570.00

🐰 View full continuous benchmarking report in Bencher

jneem

That's a lot of changes! I left a comment mostly just to show that I read something 😉

Seriously, though, I read the parts you mentioned in the description, and skimmed the rest.

jneem · 2024-12-23T10:34:05Z

core/src/bytecode/typecheck/record.rs

+#[derive(Default)]
+pub(super) struct ResolvedRecord<'ast> {
+    /// The static fields of the record.
+    pub stat_fields: IndexMap<LocIdent, ResolvedField<'ast>>,


Maybe it's not important in this context, but in principle there are multiple locations for each field, right? It looks like ResolvedRecord::combine is just choosing one. (The LSP certainly wants them all, but I guess it isn't using this resolution anyway...)

Indeed. For now I'm just maintaining the old behavior, which was to use TermPos::None as soon as the parser had to merge multiple definitions either statically or dynamically, precisely because the RichTerm interface requires that you provide at most one position, which isn't possible. If you look at ResolvedField::pos,it uses a XOR so that we return a defined position iff there is only one position defined; if there is none or there are multiple ones, we bail out and return None.

We can certainly do better in the future, I suppose. Although having a multi-pieces definitions in a statically typed block is currently un-ergonomic enough (since we can't type merging correctly, everything has to be of type Dyn) that I don't expect it to happen much. Still, I expect to re-use the record resolution part for compilation, where you might want all the positions. And one day we might type & more precisely as well.

yannham mentioned this pull request Dec 16, 2024

Dynamic field are ignored by the typechecker #2124

Open

yannham force-pushed the rfc007/typechecking branch from bd062da to 39c022a Compare December 20, 2024 17:31

yannham added 20 commits December 20, 2024 18:34

Copy the code from typechecker, get rid of generic term env

54b4792

Switch to bytecode::ast::typ::*, add a whole cargaison of lifetimes

41ebdd2

Move Traverse in its own module

33faa04

[WIP] Pass missing allocator to conv functions

84bff33

Continue migration of the typechecker to the new AST

df14c2d

More typecheck conversion, typecheck::operation conversion

f7ca049

Introduce record-typechecking-related infrastructure

bd21a76

Some fixes related to term environment populating

43024ae

End of first step for record typechecking

adfb471

Implement Traverse for various new AST components

a45f5ed

Fix various compiler errors

681d6c8

Reset unintentional changes to mainline typecheck module

f4bfdaa

Fix more compiler errors (remaining: tc::eq and tc::error)

1cbb51c

Migrate type equality to the new AST

0eabdec

Fix compilation errors in bytecode::typecheck::error

38376b2

Fix compilation errors in bytecode::typecheck::subtyping

535cf75

Fix compilation errors in bytecode::typecheck::reporting

bb01014

Fix more compiler errors

e7192fd

Update unif to use new TypeEq trait

25ae6e7

Fix compilation errors and warnings

f5fca42

yannham force-pushed the rfc007/typechecking branch from 39c022a to f5fca42 Compare December 20, 2024 17:35

yannham changed the title ~~[RFC007] Migrate the typechecker to the new AST~~ [RFC007] Migrate the typechecker to the new AST - Part I Dec 20, 2024

yannham added 3 commits December 20, 2024 18:52

Fix typo in comment

10da66d

Fix clippy errors

30dfde7

Fix cargo doc warnings

3fb5b22

yannham marked this pull request as ready for review December 20, 2024 19:24

Fix more clippy warnings

5188d54

yannham requested a review from jneem December 20, 2024 19:33

jneem approved these changes Dec 23, 2024

View reviewed changes

yannham added this pull request to the merge queue Dec 23, 2024

Merged via the queue into master with commit 002f0fc Dec 23, 2024
6 checks passed

yannham deleted the rfc007/typechecking branch December 23, 2024 11:07

This was referenced Dec 23, 2024

[RFC007] Cleanup post "typechecker migration, part I" #2129

Merged

[RFC007] Migration of the typechecker - part II #2134

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC007] Migrate the typechecker to the new AST - Part I #2121

[RFC007] Migrate the typechecker to the new AST - Part I #2121

yannham commented Dec 9, 2024 •

edited

Loading

github-actions bot commented Dec 20, 2024 •

edited

Loading

jneem left a comment

jneem Dec 23, 2024

yannham Dec 23, 2024 •

edited

Loading

[RFC007] Migrate the typechecker to the new AST - Part I #2121

[RFC007] Migrate the typechecker to the new AST - Part I #2121

Conversation

yannham commented Dec 9, 2024 • edited Loading

Migrate the typechecker to the new AST (Part I)

Content

How to review

github-actions bot commented Dec 20, 2024 • edited Loading

Bencher Report

jneem left a comment

Choose a reason for hiding this comment

jneem Dec 23, 2024

Choose a reason for hiding this comment

yannham Dec 23, 2024 • edited Loading

Choose a reason for hiding this comment

yannham commented Dec 9, 2024 •

edited

Loading

github-actions bot commented Dec 20, 2024 •

edited

Loading

yannham Dec 23, 2024 •

edited

Loading