I noticed that for Scanning: Lexical Errors, the run scripts for the Gleam track contain grep filtering to remove the compiler messages, until a flag is implemented in the gleam cli.
However, these grep filters seem to create issues with ensuring error code 65 is returned, as the stdout of the program is sent to stdin for grep, and grep succeeds, returning an exit code of 0.