[AST] AutoType within AutoTypeLoc is missing deduced type. #42259

sam-mccall · 2019-08-07T13:13:34Z


Bugzilla Link	42914
Version	trunk
OS	Linux
CC	@gislan,@mizvekov,@zygoloid,@HighCommander4

Extended Description

the program auto x = 4; has an AST that looks like:

VarDecl type=T1

typeloc type=T2
integerliteral

T1 is correctly an auto type that wraps int.
However T2 is undeduced: it's ASTContext::getAutoDeductType().

This irregularity makes it harder/slower to write tools that (e.g.) care what an auto under the cursor expands to.

The text was updated successfully, but these errors were encountered:

HighCommander4 · 2020-01-02T07:21:34Z

I investigated this a bit. What seems to be happening is:

At the time the VarDecl node is created, we haven't seen the
initializer (and thus computed the deduced type) yet, so
we set the VarDecl's type to an empty DeducedType. The
TypeSourceInfo stored in the DeclaratorDecl is set based on
this as well.
Later, we encounter the initializer and compute the deduced
type. This happens in Sema::DeduceVariableDeclarationType().
We update the deduced type on the VarDecl via
ValueDecl::setType() [1]. This updates ValueDecl::DeclType,
but it does not update DeclaratorDecl::DeclInfo, which
continues to point to the TypeSourceInfo wrapping the old,
empty DeducedType.
When RecursiveASTVisitor (which is what clangd uses to
get at the TypeLoc) encounters a DeclaratorDecl, it prefers
to get the TypeLoc via the TypeSourceInfo if there is one [2],
only falling back to ValueDecl::DeclType otherwise.
Therefore, it gives us the undeduced, empty TypeLoc.

[1]

llvm-project/clang/lib/Sema/SemaDecl.cpp

Line 11353 in a2976c4

VDecl->setType(DeducedType);

[2]

llvm-project/clang/include/clang/AST/RecursiveASTVisitor.h

Line 1950 in a2976c4

TRY_TO(TraverseTypeLoc(D->getTypeSourceInfo()->getTypeLoc()));

HighCommander4 · 2020-01-02T07:24:28Z

Naively, I would think the solution ought to be that, in addition to calling VDecl->setType(), Sema::DeduceVariableDeclarationType() should also call VDecl->setTypeSourceInfo().

However, I don't at this stage understand how these types (Type / TypeLoc / TypeSourceInfo) interact well enough to know how to construct a TypeSourceInfo in that place.

sam-mccall · 2020-01-02T14:37:46Z

I heard that Ilya Biryukov and Richard Smith had a conversation about this bug, and it wasn't totally trivial.
But I'm not sure that was written down anywhere, it'd be good to get Richard's take (even if second-hand) on what needs to happen here.

zygoloid · 2020-01-10T22:47:23Z

A couple of issues:

For functions with deduced return types, it's important that the AST representation for the TypeSourceInfo does not contain the deduced type. Redeclaration matching requires that we preserve the non-deduced auto in the declared return type. So tooling that wants to inspect auto types needs to be able to look into the type, not just the type source info, in general.
For variables with deduced types, it's important to serialization that we don't include the deduced type in the type source info. We load the type source info early (before merging), and doing so must not deserialize (say) a lambda used to initialize the variable, because that would introduce a deserialization cycle. We could perhaps avoid that for variables, by removing the type-checks we perform when merging declarations of variables across modules (those checks are strictly-speaking wrong anyway, since they mean we won't merge 'extern int n;' with 'auto n = 0;').

So, I think we could change our model so that we store the deduced type in the TypeSourceInfo of a variable. It would require some contortions in deserialization, and perhaps elsewhere. But I don't think we want to change our model for functions, where it's important that we preserve the declared type. So far, the consistency argument (the TypeSourceInfo should always represent the declared type) has won out over the convenience-for-tooling argument, especially given that tooling will need to deal with the deduced-type-is-not-in-the-TSI case anyway to properly handle functions with deduced return types. But perhaps someone can come up with a smart design that gives us the best of both worlds (perhaps we could represent the auto type in the TSI as a type that is canonically an undeduced auto but that tracks the deduced type via some type sugar?)

HighCommander4 · 2020-01-10T23:13:32Z

especially given that tooling will need to deal with the
deduced-type-is-not-in-the-TSI case anyway to properly handle functions
with deduced return types.

Note, it's RecursiveASTVisitor that ignores the type and visits the TSI in the case where we have a TSI.

Would it be reasonable to change the behaviour of RecursiveASTVisitor here? The actual tooling use cases (at least, the ones we've come across in clangd that I'm aware of) should "just work" if RAV visited the deduced type.

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AST] AutoType within AutoTypeLoc is missing deduced type. #42259

[AST] AutoType within AutoTypeLoc is missing deduced type. #42259

sam-mccall commented Aug 7, 2019

HighCommander4 commented Jan 2, 2020

HighCommander4 commented Jan 2, 2020

sam-mccall commented Jan 2, 2020

zygoloid mannequin commented Jan 10, 2020

HighCommander4 commented Jan 10, 2020

[AST] AutoType within AutoTypeLoc is missing deduced type. #42259

[AST] AutoType within AutoTypeLoc is missing deduced type. #42259

Comments

sam-mccall commented Aug 7, 2019

Extended Description

HighCommander4 commented Jan 2, 2020

HighCommander4 commented Jan 2, 2020

sam-mccall commented Jan 2, 2020

zygoloid mannequin commented Jan 10, 2020

HighCommander4 commented Jan 10, 2020