Data Extraction: Declarations, Tactics, and InfoTrees¶

LeanInteract makes it easy to extract rich data from elaboration, including declarations, tactics, and detailed InfoTrees.

Declarations¶

Set declarations=True to retrieve a list of DeclarationInfo for each declaration introduced in your Lean code. Full details about the fields in DeclarationInfo can be found in the API reference.

from lean_interact import LeanServer, LeanREPLConfig, Command
from lean_interact.interface import CommandResponse

code = """
theorem ex (n : Nat) : n = 5 → n = 5 := by
  intro h; exact h
"""

server = LeanServer(LeanREPLConfig())
res = server.run(Command(cmd=code, declarations=True))
assert isinstance(res, CommandResponse)
for d in res.declarations:
    print(f"Full name: `{d.full_name}`")
    print(f"Kind: `{d.kind}`")
    print(f"Signature: `{d.signature}`")
    print(f"Value: `{d.value}`")
    print(f"Binders: `{d.binders}`")

Full name: `ex`
Kind: `theorem`
Signature: `DeclSignature(pp='(n : Nat) : n = 5 → n = 5', constants=['n', 'Nat'], range=Range(start=Pos(line=2, column=11), synthetic=False, finish=Pos(line=2, column=36)))`
Value: `DeclValue(pp=':= by\n  intro h; exact h', constants=['h'], range=Range(start=Pos(line=2, column=37), synthetic=False, finish=Pos(line=3, column=18)))`
Binders: `DeclBinders(pp='(n : Nat)', groups=['(n : Nat)'], map=[BinderView(id='n', binderInfo='default', type='Nat')], range=Range(start=Pos(line=2, column=11), synthetic=False, finish=Pos(line=2, column=20)))`

For files:

from lean_interact import FileCommand
res = server.run(FileCommand(path="myfile.lean", declarations=True))

Tip: See examples/extract_mathlib_decls.py for a scalable, per-file parallel extractor over Mathlib.

Tactics¶

Use all_tactics=True to collect tactic applications with their goals and used constants.

resp = server.run(Command(cmd=code, all_tactics=True))
for t in resp.tactics:
    print(t.tactic, "::: used:", t.used_constants)

intro h ::: used: ['instOfNatNat', 'Nat', 'OfNat.ofNat', 'Eq']
exact h ::: used: []

InfoTrees¶

Request infotree to obtain structured elaboration information. Accepted values include "full", "tactics", "original", and "substantive". See InfoTreeOptions for details.

from lean_interact.interface import InfoTree, InfoTreeOptions

res = server.run(Command(cmd=code, infotree=InfoTreeOptions.full))
trees: list[InfoTree] = res.infotree or []

# Example: iterate over all command-level nodes and print their kind
for tree in trees:
    for cmd_node in tree.commands():
        print(cmd_node.kind, cmd_node.node.stx)

CommandInfo Syntax(kind='Lean.Parser.Command.declaration', pp='theorem ex (n : Nat) : n = 5 → n = 5 := by intro h; exact h', range=Range(start=Pos(line=2, column=0), synthetic=False, finish=Pos(line=3, column=18)), arg_kinds=['Lean.Parser.Command.declModifiers', 'Lean.Parser.Command.theorem'])
CommandInfo Syntax(kind='Lean.Parser.Command.eoi', pp='', range=Range(start=Pos(line=4, column=0), synthetic=False, finish=Pos(line=4, column=0)), arg_kinds=['«»'])

Root goals and messages¶

You can also ask for root_goals=True to retrieve initial goals for declarations (even if already proved).

print(server.run(Command(cmd=code, root_goals=True)))

CommandResponse(env=3, sorries=[Sorry(start_pos=Pos(line=2, column=40), end_pos=Pos(line=2, column=40), proof_state=2, goal='n : Nat\n⊢ n = 5 → n = 5')])