Introduce sclang (interpreter) lexer, parser, and compiler regression tests #2751

mossheim · 2017-02-27T04:06:59Z

You may want to see this gist for my original proposal.

~~Also, this PR is apparently 1.4 mil new lines... sorry not sorry. :)~~

I'm going with a zip archive. These tests have to be done manually anyway.

Purpose

This is a major insurance PR. I want to make some pretty significant changes to the lexer/parser to solve some longstanding issues in SC, and obviously don't want to be doing it in the dark. This provides regression tests and related utilities that will allow us to examine the behavior of the lexer/parser/compiler under a wide and thorough (in some cases, complete) set of conditions. It will also be useful for anyone else who wants to make changes to the language in the future. These tests may seem excessive, but actually they are excessive, and that's OK.

Summary of additions

The main additions are (1) LexerParserCompilerTestUtils.sc, which provides the bulk of test code, file formatting, and IO functionality; and (2) TestLPCTestUtils.sc, which provides some basic functionality tests for (1). Whether or not that stays in this repository is up to you, although I like having tests (obviously).

The rest of the files are either test files or validation files. The test files are short bits of code that actually run the tests, and can also be used to generate the validation files. The validation files are the real regression data that future changes will be tested against. The test utilities provide the means to clearly display the list of differences, additions, and subtractions between test and validation outputs.

Test format

Generally, the operation is formalized as a series of interpretations or bytecodeifications of "all possible strings", where each string is made of a sequence, of fixed length, of symbols from a predefined alphabet. For a given all-possible-strings test, an optional prefix and/or suffix may be provided.

For instance, if the alphabet is all lowercase letters [a-z], the prefix "hello_", the suffix "_world", the sequence length 3, then the tests would cover:

hello_aaa_world
hello_aab_world
hello_aac_world
...
hello_mqx_world
...
hello_zzy_world
hello_zzz_world

Generally, two methods are used:

Compile the string. If that succeeds, interpret it. Record the resulting object using .asString, and if applicable, the resulting object's class.
Compile the string. If that succeeds, record its FunctionDef's bytecodes.

If any step fails, a unique ID indicating that either a compile-time or run-time error occurred is recorded. In this way, we can compare the output of the parser/lexer/compiler across versions of SC to see what input is legal, what is illegal, and how certain expressions compile (or don't compile).

Summary of tests

Lexer:
- strings up to length 3
- prefix/suffix combos for line and block comments, { }, code bits added before and after
Parser:
- strings up to length 8 with an alphabet consisting of parser-important things like "var", "arg", ";"
- a prefix/suffix combo for { }

See test files for full details, or I could add a .md file to this PR to explain what I've already explained here.

TODO

Thorough review of .sc files
Is this the right folder? I don't want to run these tests every time in Travis; they take almost an hour, and rightly so. They should really only apply when deeper layers of sclang have been modified.
Validate across platforms ~~(unlikely to conflict)~~ (I was so naïve)
Validate across locales (possible conflicts)
Add tests targeting specific subroutines of the lexer (accidentals, floats, radix/hex notation) - that's why this is a work-in-progress
Add tests that target the compiler by repurposing the lexer tests
Finish documentation via readme
Add notes elsewhere about the existence of these tests

- Move around var decls in testAllPossibleStrings - Initialize diffs = []

- Make constructor use a uniquely named initializer - Rename init -> initAlphabets - Convert method names from symbols to strings (can't index into symbol)

- lidString: "LID" as a hex string - strictOutputChecking: used in doOutputsMatch to turn on/off strict checking - maxline increased to 4096

nhthn · 2017-03-21T21:49:16Z

I already mentioned this on Gitter, but TestParser succeeds, and TestCompilerBrutal has two failures related to the presence of LID. So, all that remains to get the tests to pass are LID and the radix issue.

@snappizz

- Add a simple debugging method. - Make doOutputsMatch check the input string for "LID" instead of the outputs. Also add debugging messages that alert the user when certain inputs are being considered equal that are not strictly equal. - Make doOutputsMatch consider floating point numbers that are very close equal (1e-13). Solves an issue @snappizz had on Linux. - Write tests for all changes.

nhthn · 2017-03-21T22:26:12Z

I think it would be a good idea to add some comments elsewhere in the sclang source reminding future developers that these tests exist. I suggest advertising it to yylex() and the Bison source file at least.

mossheim · 2017-03-21T22:42:43Z

@snappizz I've added your suggestion to the main checklist at the top of this PR. I will get around to the documentation after tests pass & your code review if that's OK.

Ready to test again. I added more regression tests to TestLPCTestUtils as I solved each issue. It's still possible I missed something simple. In any case, you'll now get debugging messages when certain results are skipped for nan, LID, or floating-point precision reasons, which I think is a good thing to have. As the responsibilities of doOutputsMatch increase, I worry about it over-accepting results.

Also, per discussion on Gitter, this PR should be squash-merged. I'm a little sad that all my nice commit messages will disappear, but I think the code is well-commented where important or strange decisions had to be made.

nhthn · 2017-03-22T00:34:03Z

@brianlheim

I'm a little sad that all my nice commit messages will disappear, but I think the code is well-commented where important or strange decisions had to be made.

You can rebase and still preserve some commit messages as desired.

LID tests are still failing for me, though:

~/git/supercollider/testsuite/sclang/lpc [sclang-tests L|✚ 1⚑ 2] 
17:32 $ tail -n +1 */*_diff
==> compiler/allChars_3_basicNoTCO_diff <==

0 entries were missing from the test file
-----------------------------------------

0 entries were missing from the validation file
-----------------------------------------------

1 entries were different (test vs validation)
---------------------------------------------
LID: "0000F2" vs "compile-error"

==> compiler/allChars_3_basicTCO_diff <==

0 entries were missing from the test file
-----------------------------------------

0 entries were missing from the validation file
-----------------------------------------------

1 entries were different (test vs validation)
---------------------------------------------
LID: "0000F2" vs "compile-error"

==> lexer/half_3_basic_diff <==

0 entries were missing from the test file
-----------------------------------------

0 entries were missing from the validation file
-----------------------------------------------

1 entries were different (test vs validation)
---------------------------------------------
LID: "LID:Meta_LID" vs "compile-error"

==> lexer/half_3_semanticPrefix_diff <==

0 entries were missing from the test file
-----------------------------------------

0 entries were missing from the validation file
-----------------------------------------------

1 entries were different (test vs validation)
---------------------------------------------
LID: "LID:Meta_LID" vs "compile-error"

==> lexer/half_3_semanticSuffix_diff <==

0 entries were missing from the test file
-----------------------------------------

0 entries were missing from the validation file
-----------------------------------------------

1 entries were different (test vs validation)
---------------------------------------------
LID: "unique:Symbol" vs "compile-error"

mossheim · 2017-03-22T16:49:13Z

OK, I defined a dummy LID class on my own machine and made sure it works now. You should see a couple lines like this while running:

[debug] LPCTestUtils: Ignoring a result because of LID class.	Input: [ L, I, D ]	Output 1: [ 4C4944, Meta_LID ]	Output 2: [ !cErr ]

I'll try to run these tests on Windows later today.

nhthn · 2017-03-22T20:21:45Z

All the tests pass now! I'll do code review over the next few days, and also see what happens if I mess with yylex().

- Use 'expected', 'actual', and 'diff' as root level directories to more clearly separate and label files. - Rewrite directory behavior for cross-platform compatibility - Add `safeMkdir` to LPCTU - Formatting changes - Add options to delete on finish and clobber existing files

- fix error causing typo - fix diff file naming - fix behavior of overwriteFiles

- Rezip and rename to reflect new directory naming system - Update readme with new information/instructions

This suffix is no longer needed with the new directory structure

Windows prints inf as '1.#INF' and nan as '1.#IND'. This commit adds code and tests for `doOutputsMatch` to handle this. Also factors some of the code out to a new function.

mossheim · 2017-03-23T03:28:27Z

These tests now also pass on Windows. (Win10+VS 2013 build)

mossheim · 2017-03-23T03:31:49Z

LPCTestUtils has gotten pretty large (~900 loc). I could break out some of the file utilities (and corresponding tests) into a separate class...

nhthn

extremely sorry this took so long...

some of the files mix spaces and tabs, but we can fix that later.

mossheim · 2017-04-24T00:59:43Z

Wooooo. Yah I see one that I didn't intend. The rest are for alignment unless I'm missing something obvious. OK, will document and have ready for (squash) merge soon.

LFSaw · 2017-04-24T17:13:01Z

((( just want to say thanks for the massive work you're investing into this... sorry for the noise amd keep it up, massively appreciated! :) )))

mossheim · 2017-05-05T21:04:54Z

OK, done with the readme. Since this has two approved reviews and doesn't do anything with the main lib, I'm assuming this is good to merge once the CI tests pass.

To reduce noise in the commit log, eliminate the snapshots that had all those text files I committed in the first stages of this project, and avoid a rebase, I'm going to merge this as a squashed commit. Thanks again everyone for your time and reviews!

Brian Heim added 30 commits February 26, 2017 17:43

Add LPCTestUtils.sc

cc077fb

Add TestLPCTestUtils.sc

aa79bd0

Add stub TestLexerBrutal.sc

a2b32b7

Add Brutal Lexer Tests

bbd1958

Change output directory to something more specific

0df3dff

Set TestLexerBrutal.sc to generate validation files

24af758

Write printDiffs to display results intelligently

343580d

Update LPCTestUtils.sc

6180ff9

- Move around var decls in testAllPossibleStrings - Initialize diffs = []

TestLexerBrutal: fix bugs

ac8d4c4

- Make constructor use a uniquely named initializer - Rename init -> initAlphabets - Convert method names from symbols to strings (can't index into symbol)

Add convenience post to TestLexerBrutal

cd4704b

Add brutal lexer result files

37642bb

s/standard/expected in msg

ffae330

TestLexerBrutal: give a better info post

8ecbc11

TestLexerBrutal: switch to testing against validation files

8b7f073

TestLexerBrutal: add convenience post if no diffs found

562a2bb

TestLexerBrutal: post an extra blank line before test header

2657591

TestLexerBrutal: fix postln

27cf251

Add stub TestParserBrutal

a8931c5

LPCTestUtils: comment out some unnecessary posts

d756eeb

remove extra sem

d4a6d4f

Define full parser alpahbet

6e65ae9

Define small alphabet

9b60768

Define mini alphabet

4d3c0e4

Copy over checkDiffs and runParserTests from TestLexerBrutal

1ede0a1

TestParserBrutal: make runParserTests conform to alphabet types

bc359ea

TestParserBrutal: add comments

ee0f14d

TestParserBrutal: Factor out alphabet-specific testing

9249ac5

TestLexerBrutal: factor out alphabet testing

a1200aa

TestParserBrutal: add test cases

df0ab89

Move around diffs, fix errors

8bb12b0

LPCTestUtils: add constant fields and classvars

d91392f

- lidString: "LID" as a hex string - strictOutputChecking: used in doOutputsMatch to turn on/off strict checking - maxline increased to 4096

mossheim added 2 commits March 21, 2017 18:31

LPCTU & TestLPCTU: cosmetic changes, test updates

c5faf14

Surface LPCTU options in test_script.scd

7bf926d

mossheim added 2 commits March 22, 2017 12:44

AbstractLPCBrutalTest: checkDiffs -> handleDiffs

1d628d5

LPCTU: change lidString to the correct string, update tests

f39c6e1

mossheim added 7 commits March 22, 2017 18:33

deleteOnFinish -> deleteActualFilesOnFinish

4ef8203

LPCTU: small directory handling changes

bd5af3a

- fix error causing typo - fix diff file naming - fix behavior of overwriteFiles

Update test data archive and readme

25e2c00

- Rezip and rename to reflect new directory naming system - Update readme with new information/instructions

Remove _correct suffix from test files

24a8373

This suffix is no longer needed with the new directory structure

Remove .DS_Store file from zip archive

221859c

LPCTU: Deal with windows representation of inf/nan

a935db2

Windows prints inf as '1.#INF' and nan as '1.#IND'. This commit adds code and tests for `doOutputsMatch` to handle this. Also factors some of the code out to a new function.

nhthn removed the Work In Progress (WIP) - don't merge yet label Apr 23, 2017

nhthn approved these changes Apr 23, 2017

View reviewed changes

spaces -> tabs

f68678f

Improve readme

da0b8cd

mossheim merged commit fc3d732 into supercollider:master May 6, 2017

mossheim deleted the topic/sclang-tests branch May 6, 2017 19:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce sclang (interpreter) lexer, parser, and compiler regression tests #2751

Introduce sclang (interpreter) lexer, parser, and compiler regression tests #2751

mossheim commented Feb 27, 2017 •

edited

Loading

nhthn commented Mar 21, 2017 •

edited

Loading

nhthn commented Mar 21, 2017

mossheim commented Mar 21, 2017

nhthn commented Mar 22, 2017 •

edited

Loading

mossheim commented Mar 22, 2017

nhthn commented Mar 22, 2017

mossheim commented Mar 23, 2017

mossheim commented Mar 23, 2017

nhthn left a comment

mossheim commented Apr 24, 2017

LFSaw commented Apr 24, 2017

mossheim commented May 5, 2017

Introduce sclang (interpreter) lexer, parser, and compiler regression tests #2751

Introduce sclang (interpreter) lexer, parser, and compiler regression tests #2751

Conversation

mossheim commented Feb 27, 2017 • edited Loading

Purpose

Summary of additions

Test format

Summary of tests

TODO

nhthn commented Mar 21, 2017 • edited Loading

nhthn commented Mar 21, 2017

mossheim commented Mar 21, 2017

nhthn commented Mar 22, 2017 • edited Loading

mossheim commented Mar 22, 2017

nhthn commented Mar 22, 2017

mossheim commented Mar 23, 2017

mossheim commented Mar 23, 2017

nhthn left a comment

Choose a reason for hiding this comment

mossheim commented Apr 24, 2017

LFSaw commented Apr 24, 2017

mossheim commented May 5, 2017

mossheim commented Feb 27, 2017 •

edited

Loading

nhthn commented Mar 21, 2017 •

edited

Loading

nhthn commented Mar 22, 2017 •

edited

Loading