Support running btests on Windows #80

timwoj · 2022-12-12T18:45:32Z

This PR covers the majority of the changes that need to happen to the btest script in order to run tests on Windows. There's a few known issues I'll go into at the end. The big differences here are:

Switching to the multiprocess module instead of the built-in multiprocessing module, since the default one has some problems with pickling our internal types.
Changing executing processes on Windows to wrap them in a simple bash script, which lets them run correctly under msys2 (the bash environment installed with Git on Windows).
Passing data from the test manager down into the child processes via a proxied dictionary. mulitprocess on Windows doesn't seem to like our use of globals here and complains that it can't find objects. The proxied dictionary works around this, and is probably the more correct way of doing this anyways.

Known issues:

Running tests against debug builds hits an assertion down in the Windows OS libraries. I'm pretty sure this is a libkqueue bug, and opened Closing a file descriptor on Windows might cause an assertion mheily/libkqueue#151 to have them investigate. The easy work-around is to not close the event_queue file descriptor in iosource_mgr, and that can be used for testing.
Shutting down a run with the -j flag with ctrl-c doesn't work right. There's something different about signal masking on Windows. I spent a day trying to figure this one out and haven't solved it yet.
There's a problem with pickling our internal types using the dill library used by multiprocess that changes that internal ID stored for the types. run_cmdseq previously used isinstance to check whether a command being run was a CmdSeq but because the type IDs are different it was returning false (on all platforms). There's currently a work-around where we just check the type name, but it's likely possible to fix this via implementing custom pickling methods for those types.

These changes will require some modifications to the Zeek tree to work right as well, but this PR is independent of those.

ckreibich

This must have been pretty darn painful. Just a few quick comments here as you're finalizing this.

setup.py

btest

awelzel

I've made a bit of a pass.

btest

awelzel · 2022-12-15T13:55:32Z

Switching to the multiprocess module instead of the built-in multiprocessing module, since the default one has some problems with pickling our internal types.

From the naive side: Is there something we fix/work-around in the types involved? Adding non-stdlib dependency when there wasn't one before seems unfortunate to me.

timwoj · 2022-12-16T23:33:35Z

I rebased and squashed down all of the fixup commits. I'm down to just two btests failing:

tests.diag-file: This one looks like the .stdout and .stderr files get reset between commands run (meaning each time a bash script is fired up), which shouldn't be happening. This causes the output from one the internal btest commands to get lost, which the tests requires for success.
tests.environment: This one fails because the test calls pwd in the middle of it and compares it to the testbase value read from btest.cfg. On Windows, pwd will return a POSIX-style path (so /c/...) whereas btest currently uses pseudo-Windows-style paths internally (C:/...). It's making me want to switch everything over to use POSIX-style paths since it would help with some consistency between POSIX and Windows in other places, but the last time I tried to do that everything went south quickly.

I'll look over the rest of the comments above after I get back from the holiday break.

timwoj · 2023-01-04T23:15:02Z

From the naive side: Is there something we fix/work-around in the types involved? Adding non-stdlib dependency when there wasn't one before seems unfortunate to me.

Prior to switching over to the other library, I did attempt to fix the types it was failing on. It turns into a game of whack-a-mole pretty fast, with one type after another failing for varying reasons. I got fairly far into it and then ran into a type that I couldn't figure out how to fix, and figured there had to be a better way to go about it than manually fixing every type and hoping I didn't miss something.

btest

setup.py

If this option isn't here, the Windows runners will reset all of the line endings when it clones to \r\n. This breaks a few of the tests because the comparison will have the wrong line endings.

This is mostly suggestions from pycharm, but also includes some comments from when I was tracing through code.

The reason for this switch is primarily because the stock multiprocessing library has very poor support for pickling of non-primitive types on Windows.

As per the comment, some serializer/unserializers don't produce the identical type when unserializing, failing isinstance().

Using the 'spawn' method for multiprocessing causes the global state to get lost when moving from the parent into the child processes. Rebuilding it by looping over a subset and reinserting them into globals() ensures that they exist.

This changes how runSubprocess works on Windows to insert all of the calls within a temporary bash script. This ensures that the entire environment is available when running the processes, which doesn't work when simply calling subprocess.check_call().

Windows has some issue where `hash()` returns different values for the same string in the different child processes. crc32() returns the same values in each.

This fixes a problem on Windows where multiple TEST-EXEC statements in a test could cause those files to be overwritten by subsequent TEST-EXECs, causing failures.

The original tests.environment btest doesn't work on Windows due to some path differences in the output. This adds a new test that does the same things except does some additional conversions in the test script itself to remove those differences.

timwoj force-pushed the topic/timw/windows-support-2 branch 3 times, most recently from abd5f37 to 142cfba Compare December 12, 2022 19:56

ckreibich reviewed Dec 13, 2022

View reviewed changes

timwoj force-pushed the topic/timw/windows-support-2 branch from 754bc09 to dd84f74 Compare December 13, 2022 21:46

awelzel requested changes Dec 15, 2022

View reviewed changes

timwoj force-pushed the topic/timw/windows-support-2 branch 2 times, most recently from 10005c0 to 2338efc Compare December 16, 2022 23:07

timwoj force-pushed the topic/timw/windows-support-2 branch from 2338efc to 6596525 Compare December 16, 2022 23:51

Add some pycharm files to gitignore

9c135f6

timwoj force-pushed the topic/timw/windows-support-2 branch from 6596525 to 17df3c8 Compare January 4, 2023 23:04

timwoj force-pushed the topic/timw/windows-support-2 branch 2 times, most recently from 39e7e54 to 0e4e68d Compare January 4, 2023 23:44

awelzel reviewed Jan 5, 2023

View reviewed changes

btest Outdated Show resolved Hide resolved

timwoj force-pushed the topic/timw/windows-support-2 branch 7 times, most recently from 096dfca to 3a224d7 Compare January 6, 2023 22:32

bbannier reviewed Jan 9, 2023

View reviewed changes

setup.py Outdated Show resolved Hide resolved

timwoj force-pushed the topic/timw/windows-support-2 branch 5 times, most recently from c56c7bc to e56ecd0 Compare January 10, 2023 00:32

timwoj force-pushed the topic/timw/windows-support-2 branch 2 times, most recently from 44d2e48 to 66021f1 Compare January 13, 2023 22:19

timwoj and others added 25 commits January 13, 2023 15:33

Set git's autocrlf option to false when running tests on Github

e85a2b1

If this option isn't here, the Windows runners will reset all of the line endings when it clones to \r\n. This breaks a few of the tests because the comparison will have the wrong line endings.

Rename WSL bash so it doesn't override Git bash for Windows CI builds

4710392

Move outputhandler creation to separate function

3b3df87

Re-indent everything to be under __main__

efd9b41

General minor code cleanup

faf018b

This is mostly suggestions from pycharm, but also includes some comments from when I was tracing through code.

Switch to https://pypi.org/project/multiprocess/ on Windows

45f26c1

The reason for this switch is primarily because the stock multiprocessing library has very poor support for pickling of non-primitive types on Windows.

Add -s/--set command-line argument for overriding config defaults

2d5d1c0

Avoid isinstance() to determine whether a cmd is a CmdSeq

c2661ae

As per the comment, some serializer/unserializers don't produce the identical type when unserializing, failing isinstance().

Add method for normalizing paths on both Windows and POSIX

19c94bf

Fix running tests with dot-notation for their name

dbc8157

Use named pipes on Windows since AF_UNIX is not supported

915121c

Move option parsing to a method

01faf6c

Fix an error when attempting to delete the tmp dirs on Windows

641a0a8

Use binascii.crc32 for computing hashes for TEST-SERIALIZE commands

ca04709

Windows has some issue where `hash()` returns different values for the same string in the different child processes. crc32() returns the same values in each.

Force output to use unix-style line endings for consistency

59abbf5

Return error if trying to use Sphinx features on Windows

2e4a6fd

Add testing script to check for Windows, use it to disable some tests

df6c1be

Fix diff-remove-abspath to handle Windows drive letters

f63ffa1

Fix strip-test-base script to handle Windows paths correctly

3dd77f9

Fix tests.multiple-baseline-dirs btest to use pathsep

5f3704b

Open .stdout and .stderr in append mode

7d9e493

This fixes a problem on Windows where multiple TEST-EXEC statements in a test could cause those files to be overwritten by subsequent TEST-EXECs, causing failures.

Add tests.environment-windows btest

f256b5a

The original tests.environment btest doesn't work on Windows due to some path differences in the output. This adds a new test that does the same things except does some additional conversions in the test script itself to remove those differences.

Add Windows Caveats to README, add bash.exe check at startup

ff90899

timwoj force-pushed the topic/timw/windows-support-2 branch from 66021f1 to ff90899 Compare January 13, 2023 22:38

timwoj merged commit f313d13 into master Jan 13, 2023

timwoj deleted the topic/timw/windows-support-2 branch January 13, 2023 22:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support running btests on Windows #80

Support running btests on Windows #80

timwoj commented Dec 12, 2022

ckreibich left a comment

awelzel left a comment

awelzel commented Dec 15, 2022

timwoj commented Dec 16, 2022

timwoj commented Jan 4, 2023

Support running btests on Windows #80

Support running btests on Windows #80

Conversation

timwoj commented Dec 12, 2022

ckreibich left a comment

Choose a reason for hiding this comment

awelzel left a comment

Choose a reason for hiding this comment

awelzel commented Dec 15, 2022

timwoj commented Dec 16, 2022

timwoj commented Jan 4, 2023