Feature Name: expressive_diagnostics
Start Date: 2023-08-07
RFC PR: FuelLabs/sway-rfcs#30
Sway Issue: FueLabs/sway#5079

Summary

Expressive diagnostics will provide detailed, helpful, and human friendly diagnostics (warnings and errors) to Sway programmers. The diagnostic messages will be focused on the code that the programmer wrote and will lead the programmer to a possible issue resolution. At the same time, the internal implementation of diagnostics will empower Sway language developers to easily define detailed and helpful diagnostics. The implementation, the Diagnostics API, will also enforce consistent warning and error reporting.

Motivation

There is an overwhelming evidence, based on experience^1,2,3,4 and research^5,6, that shows that the quality of compiler diagnostics has a significant influence on:

the programmer's perception of the language
steepness of the language learning curve
and programmer's productivity.

Experience from several communities, like Rust², Elm³, and C++^1,5,7, to name a few, has shown that improving diagnostics had a great impact on the three points listed above.

At the moment, Sway diagnostics vary significantly in level of details, wording, and helpfulness. We have messages that strive to be helpful and give hints to the programmer. E.g.:

Generic type \"{name}\" is not in scope. Perhaps you meant to specify type parameters in \
    the function signature? For example: \n`fn \
    {fn_name}<{comma_separated_generic_params}>({args}) -> ... `

The way how we structure those messages is, however, not standardized. E.g., here is an another attempt to provide a detailed error explanation, formatted and conveyed in a completely different way:

expected: {expected} \n\
    found:    {given} \n\
    help:     The definition of this {decl_type} must \
    match the one in the {interface_name} declaration.

At the moment, although detailed and helpful, these messages are "packed" within a single piece of text pointing to a single place in code. This significantly limits amount of helpful information that can be provided to the programmer. It also limits the presentation of the context in which the diagnostic occurs. The overall context usually spans across several points in code. We want to be able to place the useful information on the exact places in code that are relevant to the diagnostic.

Sway programmers are also confronted with short, sometimes cryptic error messages, containing jargon used by compiler developers. E.g.:

Invalid value \"{value}\"

Constant requires expression.

Internally, expressive diagnostics will empower Sway language developers to easily define detailed and helpful diagnostics that will be focused on the exact code that the programmer wrote. Diagnostics API will also provide a unified way to define diagnostics. (At the moment we have three different approaches for defining diagnostics. CompileError, CompileWarning, and ParseError are representatives of those different approaches.)

Externally, we expect to see the effects experienced in other communities:

better perception of Sway as a language
easier language learning curve
increased efficiency in resolving compiler errors and warnings.

Guide-level explanation

Diagnostics will be displayed in different clients. E.g.:

forc CLI output.
VS Code hints.
VS Code compiler output.
VS Code Problems.

Each client can choose which part of diagnostic to show and how.

When using forc CLI Sway programmers will get detailed diagnostics consisting of the following elements:

Level: Error or Warning.
Code: Unique code of the diagnostic.
Reason: Short description of the diagnostic, not related to a specific issue that the programmer did. The reason answers the question "Why is this an error or warning?" E.g., Because - "Match pattern variable is already defined". It points out the general language rule that was violated.
Issue: Description of the concrete issue caused by the programmer, placed in the source code. E.g., "Variable "x" is already defined in this match arm."
Hints: Detailed descriptions of the diagnostic, placed in the source code. They point to other places in code that give additional contextual information about the issue.
Help: Additional friendly information, that helps better understanding and solving the issue. Same as hints, help entries can be related to a place in code, or they can be placed in the footnotes.

When using IDEs like VS Code Sway programmers will have the experience similar to one offered by the Rust analyzer.

Popup in the editor could provide the reason, issue, hints, as well as the help, similar to this Rust example:

Programmer will have the option to open the full compiler diagnostic, getting the same output like when using forc CLI, similar to this Rust example:

Diagnostics will also be displayed in VS Code problems:

Wording guidelines

When defining diagnostics, it is important to have in mind all the clients listed above. A diagnostic must not be optimized for one output, e.g., CLI but be hardly understandable in e.g., VS Code hints.

To ensure consistency and apply best practices⁵ the diagnostics will follow these guidelines:

Reason will be short and point out the language constraint which was violated. It will not finish in full stop, to emphasize brevity.
Issue will be short and point out the specific situation encountered in code. It is written in plain english, using proper punctuation and grammar rules.
Reason and issue are given in plain english language, free of, e.g., compiler jargon.
Hints and help are as well written in plain english, using proper punctuation and grammar rules.
Hints and help try to give as much of useful context as possible and to be as specific as possible.
Help in footnotes should be used rarely, only for general explanations and suggestions. Preferably, help should be related to a place in code.
Identifier and type names in messages are enclosed in "double quotes". E.g., "x" or "(u32, bool)".
Code samples in messages are enclosed in `grave accents`. E.g., `let x = 0`.
Articles "the" and "a/an" are not used at the beginning of a sentence. E.g., "Variable "X" is already..." instead of "The variable "X" is already...". They can be used in formulations like "This is the original declaration...".

To avoid unnecessary complexity that comes through high number of diagnostic reasons (both for Sway language developers and Sway programmers), we will introduce new error codes restrictively. Reusing existing error codes and reasons will be the preferable option. To communicate specific cases we will use hints and help.

Here is an example. The existing error message:

Name "MyName" is defined multiple times.

could, depending on the context, have the following form:

Reason: Item names must be unique
Issue:  There is already an imported struct with the name "MyName"
Hints:  [error] This is the enum "MyName" that has the same name as the imported struct.
        [info]  The struct "MyName" gets imported here.
        [info]  This is the original declaration of the imported struct "MyName".
Help:   Items like structs, enums, traits, and ABIs must have a unique name in scope.
        Consider renaming the enum "MyName" or using an alias for the imported struct.

Diagnostic codes

A diagnostic code uniquely identifies a particular reason. Considering that we already have these diagnostic areas:

Lexical analysis
Parsing
Parse tree conversion
Type checking
Semantic analysis
Warnings

the pragmatic way to ensure uniqueness and assignment of a new code number is to have a code made of three parts:

E.g, E4001 would be the first error in the semantic analysis area.

Diagnostic codes will be used to:

point to detailed diagnostic explanation similar to Rust online help for error messages.
suppress warnings by using a Sway equivalent of Rust's #[allow].

In case of warnings, in addition to the code explained above, diagnostic will have a human-readable identifier which will allow us to have human-readable, self-documenting #[allow]s, similar to the Clippy lint identifiers.

When defining warnings via proc-macros, the human-readable identifier will be defined, next to the code:

    ...
    #[warning(
        reason(1, name_is_not_idiomatic, "Name is not idiomatic"),
    ...

This identifier can then be used in #[allow]s:

#[allow(name_is_not_idiomatic)]
type int8_t = i8;

In addition, we will add the option to Sway compiler to treat warnings as errors.

Reference-level explanation

We already have the Diagnostic struct modeled after the definition given above. This allows us to enforce the usage of expressive diagnostics via API that models the explained approach.

forc uses the ToDiagnostic trait, implemented at the moment only by CompileError and CompileWarning, to rended expressive diagnostics for error and warning messages that support them.

Both implementations of ToDiagnostic provide a fallback that renders a diagnostic indistinguishable from the existing error and warning messages. This ensures backward compatibility and gradual switch to expressive diagnostics.

At the moment, Diagnostic instances can be created only imperatively, by using the Diagnostics API and manually creating the whole Diagnostic structure, Reason, Code, Issue, Hints, etc. This is cumbersome and requires writing boilerplate code. It can be compared to creating error messages without thiserror as we do in CompileWarning.

The proposal is to have a proc-macro for declarative definition of an expressive diagnostic.

E.g., the const shadowing diagnostic given above would be defined as follows:

#[derive(Diagnostic, Debug, Clone, PartialEq, Eq, Hash)]
#[diagnostic(area = DiagnosticArea::SemanticAnalysis)]
pub enum CompileError {
    ...
    #[error(
        reason(1, "Constants cannot be shadowed"),
        issue(
            name.span(),
            "{variable_or_constant} \"{name}\" shadows {}constant with the same name",
            if constant_decl.is_some() { "imported " } else { "" }
        ),
        info(
            constant_span.clone(),
            "Constant \"{name}\" {} here{}.",
            if constant_decl.is_some() { "gets imported" } else { "is declared" },
            if *is_alias { " as alias" } else { "" }
        ),
        info(
            constant_decl.map(|x| x.clone()),
            "This is the original declaration of the imported constant \"{name}\"."
        ),
        error(
            name.span(),
            "Shadowing via {} \"{name}\" happens here.", 
            if variable_or_constant == "Variable" { "variable" } else { "new constant" }
        ),
        help("Unlike variables, constants cannot be shadowed by other constants or variables."),
        help(
            match (variable_or_constant.as_str(), constant_decl.is_some()) {
                ("Variable", false) => format!("Consider renaming either the variable \"{name}\" or the constant \"{name}\"."),
                ("Constant", false) => "Consider renaming one of the constants.".to_string(),
                (variable_or_constant, true) => format!(
                    "Consider renaming the {} \"{name}\" or using {} for the imported constant.",
                    variable_or_constant.to_lowercase(),
                    if *is_alias { "a different alias" } else { "an alias" }
                ),
                _ => unreachable!("We can have only the listed combinations: variable/constant shadows a non imported/imported constant.")
            }
        )
    )]
    ConstantsCannotBeShadowed {
        variable_or_constant: String,
        name: Ident,
        constant_span: Span,
        constant_decl: Option<Span>,
        is_alias: bool,
    },
    ...
}

Issue and infos point to places in code, and are commonly named labels. The places in code they refer to are given by Spans. issue and info elements of #[error] take Span or Option<Span> as their first argument, as seen above. If the passed Span parameter is None the label is considered not to be in source code.

For hints, not being in source code means that they are not rendered at all. Diagnostics API will ignore them and clients (CLI, LSP) will not be aware of them.

For the issue, clients will always display it as a part of diagnostic description. If it is in code, clients can choose to additionally display it in code. E.g., forc always shows the issue text as a part of the diagnostic description, but if the issue is in code and there is a hint pointing to the same place in code, the issue will not be rendered as a label in code. This overloading of the issue text by a hint is visible on the above example and is enforced via Diagnostics API. It gives us the flexibility to phrase the diagnostic description and the hints in an independent way, while allowing backward compatibility with the current warnings and errors that have only a single message and span.

Using None to denote non-existence of an element allows declarative definition of all the hints, knowing that some of them might not be shown, depending on the concrete Span value passed to the enum variant.

Our Diagnostic derive and thiserror's Error derive would coexist as long as the "old-style" diagnostic do not get fully replaced with expressive diagnostic. The existing fallback mechanism would still be generated by the Diagnostic derive.

Treating warnings as errors should be straightforward. It requires an additional compilation flag whose check will be added at final receivers of the diagnostics, before the next step in the compilation pipeline. E.g., before we generate IR we treat warnings as errors and stop the IR generation.

Drawbacks

The only drawback I can think of is the time and effort needed to fully roll out the expressive diagnostic. This means to replace all of the existing diagnostics with their expressive counterparts.

Out of the experience gained while working on supporting multi-span errors and warnings I can say that crafting a good expressive diagnostic takes time, as well as adjusting existing e2e tests. Coding the diagnostics itself is a negligible effort.

But this replacement can and should be done gradually, as a side task. It should be distributed among Sway language developers and other teams that want to contribute. The same approach was taken, e.g., by Rust and Elm communities.

Rationale and alternatives

Not implementing expressive diagnostics would mean staying behind the modern approach to compiler diagnostics. Expressive diagnostics are becoming de facto standard in compiler development (see the Prior art section below). To position Sway as a modern and developer friendly language, we have to follow this standard. I don't see how we can choose not to implement expressive diagnostics.

To the proposed design, the approach and its alternatives were shortly discussed when implementing support for multi-span errors and warnings.

Alternative 1: Using generic diagnostic structure

This approach would mean having a possibility do define an arbitrary diagnostic, similar to an arbitrary output that can be achieved by using Snippet, Slice, Annotation, and other abstractions provided by annotate-snippets. We have concluded that this additional freedom would not give us any more expressive power, but just create additional confusion when defining diagnostics. It would also very likely lead toward inconsistencies similar to those we have now (different wording, different ways to explain an issue, etc.).

Alternative 2: Evaluating and using `miette` crate

miette crate provides a holistic library for defining, reporting, and rendering diagnostics. We have concluded that using miette would mean changing our existing infrastructure that already works well plus losing the flexibility if we decide to do thing differently then envisioned by miette. The conclusion was to approach miette as an inspiration for our own API.

Prior art

Expressive diagnostics

Expressive diagnostics are gaining relevance, or are already significant part of compilers like:

Rust²
GCC^{5, 7}
Clang¹
Elm³

Microsoft puts an effort in improving diagnostics in their VC++ compiler⁵.

Java error messages, traditionally knowing to be terse and difficult for novices and students, are enhanced with tools like Decaf, or Expresso.

Using macros for diagnostic definition

thiserror and its macros are de facto standard for defining error messages in Rust.

miette follows the same approach for defining rich diagnostics, with its diagnostic macro:

#[diagnostic(
    code(oops::my::bad),
    url(docsrs),
    help("try doing it better next time?")
)]

Using diagnostic codes

Unique diagnostic codes can be find in many compilers including Rust, Clang, and C#. They can be used to:

point to detailed diagnostic explanation
suppress warnings

Rust takes advantage of both possibilities by offering online help for error messages and a possibility to suppress warnings by using #[allow].

Unresolved questions

The RFC has no unresolved questions. During the discussion, the following questions were resolved:

the structure of the Diagnostic API.
the usage of symbolic warning identifiers like e.g., name_is_not_idiomatic in addition to warning codes like e.g., W0001.

Future possibilities

Once we get online documentation for diagnostics (as proposed in Sway 3512) we can extend diagnostic messages with links to detailed explanation by adding an element like:

info: For more information see: https://.../E4001

Similar to Rust, we can also add the explain command to forc. E.g.:

forc explain E4001

Once The Sway Book and The Sway Reference become stable we can also enhance Diagnostic by adding references to the documentation. E.g.:

#[error(
    reason(1, "Constants cannot be shadowed"),
    ...
    book("Shadowing", "relative/path/to/chapter/shadowing"),
    ref("Constants", "relative/path/to/chapter/constants"),
    ...
)]

This would render additional info lines. E.g.:

info: For more information, see:
      - the chapter "Shadowing" in The Sway Book: https://.../shadowing
      - the chapter "Constants" in The Sway Reference: https://.../constants

References

[1] Expressive Diagnostics
[2] Shape of errors to come
[3] Compiler Errors for Humans
[4] Compilers as Assistants
[5] Concepts Error Messages for Humans
[6] Compiler Error Messages Considered Unhelpful: The Landscape of Text-Based Programming Error Message Research
[7] Usability improvements in GCC 8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0012-expressive-diagnostics.md

0012-expressive-diagnostics.md

Summary

Motivation

Guide-level explanation

Wording guidelines

Diagnostic codes

Reference-level explanation

Drawbacks

Rationale and alternatives

Alternative 1: Using generic diagnostic structure

Alternative 2: Evaluating and using `miette` crate

Prior art

Expressive diagnostics

Using macros for diagnostic definition

Using diagnostic codes

Unresolved questions

Future possibilities

References

Files

0012-expressive-diagnostics.md

Latest commit

History

0012-expressive-diagnostics.md

File metadata and controls

Summary

Motivation

Guide-level explanation

Wording guidelines

Diagnostic codes

Reference-level explanation

Drawbacks

Rationale and alternatives

Alternative 1: Using generic diagnostic structure

Alternative 2: Evaluating and using miette crate

Prior art

Expressive diagnostics

Using macros for diagnostic definition

Using diagnostic codes

Unresolved questions

Future possibilities

References

Alternative 2: Evaluating and using `miette` crate