Flush error messages incrementally after processing a file #4396

msullivan · 2017-12-20T01:47:33Z

In order to avoid duplicate error messages for errors produced in both
load_graph() and process_graph() and to prevent misordered error
messages in a number of places, lists of error messages are now
tracked per-file.

These lists are collected and printed out when a file is complete. To
maintain consistency with clients that use .messages() (namely,
tests), messages are generated file-at-a-time even when not printing
them out incrementally.

Fixes #1294

JukkaL

Looks good! I mostly left minor notes, but I'd like to see some more rigorous testing. Also, have you verified manually that streaming can produce errors faster using a significant codebase (such as mypy itself)?

JukkaL · 2017-12-20T18:09:20Z

mypy/errors.py

+
+        Use a form suitable for displaying to the user.
+        """
+        self.new_messages()


Add comment about new_messages() storing new messages as a side effect (or potentially rename the method to something that makes this more explicit).

JukkaL · 2017-12-20T18:34:09Z

mypy/test/testerrorstream.py

+        if msgs:
+            a.append('==== Errors flushed ====')
+            a += msgs
+    plugin = ChainedPlugin(options, [LoggingPlugin(options, flush_errors), DefaultPlugin(options)])


Style nit: Add empty line after nested function for clarity.

JukkaL · 2017-12-20T18:37:22Z

test-data/unit/errorstream.test

+-- starting with "----" that are not ignored. The first two dashes of these
+-- lines are interpreted as escapes and removed.
+
+[case testErrorStream]


Add test case with an import cycle? You can perhaps use "deferred nodes" to interleave messages from two modules (like 'error from module a', 'error from module b', 'error from module a') and then you can test that the errors are grouped by file correctly.

JukkaL · 2017-12-20T18:42:40Z

mypy/main.py

    try:
-        res = type_check_only(sources, bin_dir, options)
+        res = type_check_only(sources, bin_dir, options, flush_errors)


I'd prefer if the main body of test cases (mypy/test/testscheck.py) would use streamin 8000 g errors (without testing that flushing happens as expected, i.e. we'd still only test that the sequence of errors in the output is as expected). Not sure if this is feasible without many changes to test cases. If not, at least it would be good to do a one-off manual check that streaming produces the same errors as not streaming in test cases.

JukkaL · 2017-12-20T18:48:15Z

test-data/unit/errorstream.test

+[out]
+==== Errors flushed ====
+a.py:1: error: Unsupported operand types for + ("int" and "str")
+==== Blocking error ====


What about also adding a third error that would be get reported after the blocking error (but doesn't get reported, because there was a blocking error I assume)?

msullivan · 2017-12-21T22:20:40Z

Time to report an error in mypy/errors.py was 4.3s with this patch and 11.8s without it.

gvanrossum

You may take my comments about the brittleness of the API with a grain of salt -- as we discussed off-line there isn't a great alternative. If you see ways to make it less brittle without having to change various call sites I'm all for it though!

gvanrossum · 2018-01-02T21:35:58Z

mypy/build.py


+
+def flush_compile_errors(f: F) -> F:


I suggest not to make this a decorator at all, but a wrapper function. You could rename build -> _build and then create a new build function with the same signature that calls _build and catches CompileError etc. You wouldn't need a type variable and the code smell would be less -- no higher-order function, no functools, no TypeVar, no cast, no kwargs.get(). (The price would be more repetition in the argument lists but that's all KISS. :-)

gvanrossum · 2018-01-02T21:40:47Z

mypy/main.py

+            for m in a:
+                f.write(m + '\n')
+        except BrokenPipeError:
+            pass


I'd sys.exit(1) here.

gvanrossum · 2018-01-02T22:17:26Z

mypy/errors.py

+    # files were processed.
+    error_info_map = None  # type: Dict[str, List[ErrorInfo]]
+
+    # The size of error_info the last time that error messages were flushed


There is no error_info any more. :-) More seriously, this design feels a bit brittle, but I'm not sure how to remove that feeling -- I was thinking of having two maps, one with flushed errors and one with "new" errors, where the new_* method transfers from the latter to the former, but that would require a whole bunch of updates to e.g. num_errors etc.

But perhaps the flushed errors are not used any more (other than being counted, and that's just used as a Boolean)? That might simplify things a bit? (It seems messages() just returns formatted_messages so it doesn't need the flushed errors either.)

gvanrossum · 2018-01-02T22:30:23Z

mypy/errors.py

@@ -302,40 +317,41 @@ def add_error_info(self, info: ErrorInfo) -> None:
            if info.message in self.only_once_messages:
                return
            self.only_once_messages.add(info.message)
-        self.error_info.append(info)
+        self._add_error_info(info)
        self.error_files.add(file)


The whole variable error_files is redundant, it should always be equal to set(self.error_info_map.keys()). (But see my comment about the design of the latter.)

gvanrossum · 2018-01-02T22:38:36Z

mypy/errors.py

        super().__init__('\n'.join(messages))
        self.messages = messages
        self.use_stdout = use_stdout
        self.module_with_blocker = module_with_blocker
+        self.num_already_seen = num_already_seen


This adds another ugly wart to the API (see the code in build.py that uses this variable). It might be prettier to have another List[str] attribute unflushed_messages that's set here. (Yet another place where I don't like indices pointing into arrays. :-)

gvanrossum · 2018-01-02T23:20:34Z

test-data/unit/errorstream.test

@@ -0,0 +1,72 @@
+-- Test cases for incremental error streaming. Each test case consists of two
+-- sections.
+-- The first section contains [case NAME] followed by the input code, while


This sentence is redundant, since all .test files work that way.

gvanrossum · 2018-01-02T23:20:40Z

test-data/unit/errorstream.test

+-- a plugin when a call to it is checked, which can be used to verify that
+-- error messages are printed before doing later typechecking work.
+--
+-- The input file name in errors is "file".


What does this mean?

gvanrossum · 2018-01-02T23:20:54Z

test-data/unit/errorstream.test

+--
+-- The input file name in errors is "file".
+--
+-- Comments starting with "--" in this file will be ignored, except for lines


This is also redundant.

mypy/test/testerrorstream.py

+        if msgs:
+            logged_messages.append('==== Errors flushed ====')
+            logged_messages.extend(msgs)
+        if is_real:


This block can also be indented.

gvanrossum · 2018-01-02T23:24:21Z

mypy/test/testerrorstream.py

+    logged_messages = []  # type: List[str]
+    real_messages = []  # type: List[str]
+
+    def flush_errors(msgs: List[str], serious: bool, is_real: bool=True) -> None:


There should be spaces around the = (it's a special case in PEP 8 :-).

gvanrossum · 2018-01-03T15:35:22Z

Another thought: let BuildResult only collect messages that haven't been flushed yet. This means that in general there are three ways you can get messages: (1) via the flush_errors callback, if set; (2) via BuildResult, if no flush_errors callback is set; (3) via CompileError. This way you can keep the old way of keeping track of messages, and you don't need to cache formatted messages. The tests simply pass flush_errors=None and get the message from BuildResult.

9E88

msullivan · 2018-01-04T00:59:16Z

The benefit of the caching scheme and returning the messages even when streaming is on is that it makes it easy to ensure that the streaming and the fixed interfaces return the same messages in the same order and also to test that. If we think that is important, ditching the caching would I think require some other machinery.

gvanrossum · 2018-01-04T01:48:26Z

I'm pretty confident about the ordering regardless, so I'd rather do without the cache scheme etc.

msullivan · 2018-01-04T20:35:45Z

Getting to the ordering to match without caching will require some new machinery not in the current patches: the errors are streamed out in the order that SCCs are processed, but this might not match the order in the OrderedDict if errors were generated at parse time. The machinery might be simpler than the caching though, so.

In order to avoid duplicate error messages for errors produced in both load_graph() and process_graph() and to prevent misordered error messages in a number of places, lists of error messages are now tracked per-file. These lists are collected and printed out when a file is complete. To maintain consistency with clients that use .messages() (namely, tests), messages are generated file-at-a-time even when not printing them out incrementally. Fixes #1294

msullivan · 2018-01-04T21:30:43Z

Nevermind, I have what I think is a good plan.

gvanrossum · 2018-01-04T22:42:14Z

Aren't parse-time errors always fatal (blocking)?

msullivan · 2018-01-05T00:16:19Z

Actual parse errors are blocking but the first pass of semanti 10000 c analysis is done immediately after parsing, and those can be nonblocking.

gvanrossum

I like this version a lot better! But I still have a bunch of questions and suggestions.

gvanrossum · 2018-01-05T22:21:31Z

mypy/build.py

@@ -25,6 +25,7 @@
 import time
 from os.path import dirname, basename
 import errno
+from functools import wraps


You don't need this any more.

gvanrossum · 2018-01-05T22:47:18Z

mypy/build.py

@@ -703,6 +743,9 @@ def add_stats(self, **kwds: Any) -> None:
    def stats_summary(self) -> Mapping[str, object]:
        return self.stats

+    def error_flush(self, msgs: List[str], serious: bool=False) -> None:


Consider getting rid of this and just inlining the only call site to add serious=False?

gvanrossum · 2018-01-05T23:15:12Z

mypy/errors.py

@@ -554,7 +596,8 @@ def report_internal_error(err: Exception, file: Optional[str], line: int,
    # Dump out errors so far, they often provide a clue.
    # But catch unexpected errors rendering them.
    try:
-        for msg in errors.messages():
+        errors.flushed_files = set()  # Print out already flushed messages too


Have you tried to provoke this? ISTM that in the only use case where it matters (real users running into a crash) this will just print the entire list of errors twice, potentially just confusing everyone. Or is there a unit test that needs this?

Also, flushed_files feels like an internal attribute of the Errors class -- if you really need this consider making it a flag to new_messages().

My thought was that I wanted to always make sure that all of the messages printed, even in the cases where the messages were being buffered in build.build. But I think you are right and we would rather lose some messages while running tests than confuse matters by printing duplicate messages in actual use.

gvanrossum · 2018-01-05T23:59:06Z

mypy/main.py

+
+    messages = []
+
+    def flush_errors(a: List[str], serious: bool) -> None:


Can you rename a to something longer?

gvanrossum · 2018-01-08T22:02:20Z

mypy/build.py

@@ -1973,6 +2016,10 @@ def write_cache(self) -> None:
    def dependency_priorities(self) -> List[int]:
        return [self.priorities.get(dep, PRI_HIGH) for dep in self.dependencies]

+    def generate_unused_ignore_notes(self) -> None:
+        if self.options.warn_unused_ignores:


Since you've made this effectively into a per-module option, please add it to the list of such in options.py.

gvanrossum · 2018-01-08T22:17:23Z

mypy/errors.py

@@ -90,15 +93,17 @@ class Errors:
    current error context (nested imports).
    """

-    # List of generated error messages.
-    error_info = None  # type: List[ErrorInfo]
+    # Map from files to generated error messages. Is an OrderedDict so


I wonder if it would be safer to use the module ID rather than the file as the key? Because add_error_info() doesn't call remove_path_prefix(). And IIRC sometimes different passes have different ideas about the filename (normalized or not). However the old code makes the same assumption about error_files I suppose.

Not all error infos have a module, unfortunately.

gvanrossum · 2018-01-08T22:20:32Z

mypy/errors.py


    def raise_error(self) -> None:
        """Raise a CompileError with the generated messages.

        Render the messages suitable for displaying.
        """
-        raise CompileError(self.messages(),
+        # self.new_messages() will format all messages that haven't already
+        # been returned from a new_module_messages() call.


s/new_module_messages/???/

gvanrossum · 2018-01-08T22:22:42Z

mypy/errors.py

@@ -511,6 +547,12 @@ class CompileError(Exception):

    It can be a parse, semantic analysis, type check or other
    compilation-related error.
+
+    CompileErrors raised from an errors object carry all of the


This comment is very helpful. But perhaps a form of it would also be useful in the except clause in build.build, where the logic had me baffled for a bit.

gvanrossum · 2018-01-08T22:29:36Z

mypy/test/testerrorstream.py

+                    alt_lib_path=test_temp_dir,
+                    flush_errors=flush_errors)
+    except CompileError as e:
+        pass


Shouldn't you at least assert that there are no messages in the error object?

gvanrossum · 2018-01-08T22:31:09Z

test-data/unit/check-kwargs.test

        pass
+
+[out]


Consider adding a comment (--) explaining why the errors appear out of order?

gvanrossum

OK, I am happy with the code now. Can you update the docs per my suggestion?

gvanrossum · 2018-01-09T22:35:03Z

mypy/options.py

@@ -34,6 +34,7 @@ class Options:
        "show_none_errors",
        "warn_no_return",
        "warn_return_any",
+        "warn_unused_ignores",


Oh, now the docs also need to be updated (it has separate sections for global and per-module flags).

gvanrossum · 2018-01-09T23:21:44Z

Congrats! A nice piece of work. I'll merge the other thing too as soon as I remember what it was.

marcintustin · 2018-03-07T00:19:21Z

This is breaking pytest integration, any chance of a bugfix release any time soon?

gvanrossum · 2018-03-07T01:16:15Z

@marcintustin Please file a separate bug report with more details.

marcintustin · 2018-03-07T01:20:35Z

@gvanrossum I'm not saying that this PR is causing a problem. I'm saying that the lack of a release including this PR is causing a problem. You still want a separate bug report?

JelleZijlstra · 2018-03-07T01:37:44Z

There has been a release since this was merged, so I'm not sure what you're referring to.

marcintustin · 2018-03-07T02:46:32Z

@JelleZijlstra @gvanrossum Well I suspect then that I've misunderstood the chatter on the issue where this is linked as the cause of the pytest integration problem. :( Apologies; I'll chase up on the integration first.

emmatyping · 2018-03-07T02:54:14Z

@marcintustin this PR broke things as filed in #4681. That issue is missing that it causes errors in pytest-mypy. ~~I'm going to go ahead and submit a fix for it~~ (No fix is needed on the mypy side)

marcintustin · 2018-03-07T02:56:17Z

@ethanhs You're quite right. Thanks for clarifying.

msullivan requested review from JukkaL and gvanrossum December 20, 2017 01:47

JukkaL reviewed Dec 20, 2017

View reviewed changes

ilevkivskyi mentioned this pull request Dec 27, 2017

Fix crash due to checking type variable values too early #4384

Merged

mitar mentioned this pull request Dec 29, 2017

Wrong filenames reported for recursive type not supported error #4413

Closed

gvanrossum requested changes Jan 2, 2018

View reviewed changes

msullivan added 10 commits January 4, 2018 13:04

Add more tests and stream blocking errors as well

76f945f

Check for streaming errors matching in testcheck

310611d

Don't use variable type annotations

059de14

Flush the file buffers after writing for better behavior when wrapped

c9a5a96

ditch the decorator

76bb7f8

Eliminate the indexing

5c9abec

Drop plugin part of the test, do test cleanup

844d5ca

Remove error_files

c9d2355

fix a variable name

699ba0b

msullivan added 2 commits January 4, 2018 14:59

Ditch the caching system, run all error aggregation through the callback

376982d

Get rid of the .messages() method

caa8477

msullivan force-pushed the error_flush branch from 0285070 to caa8477 Compare January 4, 2018 23:58

Key errors based on origin file

92dfc2d

msullivan mentioned this pull request Jan 8, 2018

Exit with code 2 on blocking errors #4443

Merged

gvanrossum requested changes Jan 8, 2018

View reviewed changes

Perform various cleanups

cc64198

gvanrossum approved these changes Jan 9, 2018

View reviewed changes

update warn_unused_ignores documentation

548078c

gvanrossum merged commit 10522cf into master Jan 9, 2018

petr-muller mentioned this pull request Mar 5, 2018

required argument 'flush_errors' error with latest version of mypy realpython/pytest-mypy#6

Closed

derlih mentioned this pull request Mar 6, 2018

In type_check_only function flush_errors parameter should be optional #4681

Closed

gvanrossum mentioned this pull request Mar 8, 2018

Streaming output for daemon mode #4702

Closed


		messages = []

		def flush_errors(a: List[str], serious: bool) -> None:

		pass

		[out]

Uh oh!

Flush error messages incrementally after processing a file #4396

Flush error messages incrementally after processing a file #4396

Uh oh!

Conversation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!