Tree-Sitter-Lean4
Lean4 is a programming language, commonly used for mathematical theorem proving as a proof assistant.
This project contains:
- Tree-Sitter grammar for parsing Lean 4 source code.
- Rust crate that can be used in combination with
tree-sittercrate with bindings for parsing Lean - Nix flake for using the compiled parser in other tools in the Tree-Sitter ecosystem such as AST-Grep
Important: Lean is a very extensible language. Therefore, the Tree-Sitter grammar is of limited use. For parsing advanced Lean programs you will need to use the Lean kernel. See also Metaprogramming in Lean.
Usage
Rust
Add this crate as a normal dependency in your Cargo.toml file.
During the first build, you need this:
- C code generator
tree-sitter(only if thesrc/parser.cis missing) - C compiler
cc
Then instantiate the parser:
use ;
let mut parser = new;
parser.set_language.expect;
See Tree-Sitter-Rust.
Nix Flake
This is the recommended approach for reproducible builds.
{
inputs = {
nixpkgs.url = "github:NixOS/nixpkgs/nixos-unstable";
tree-sitter-lean.url = "github:wvhulle/tree-sitter-lean";
};
outputs = { nixpkgs, tree-sitter-lean, ... }:
let
system = "x86_64-linux";
pkgs = nixpkgs.legacyPackages.${system};
in
{
devShells.${system}.default = pkgs.mkShell {
# If you want to use ast-grep (see next section)
packages = [ pkgs.ast-grep ];
shellHook = ''
# Create a symlink in the working directory for easy access (add to .gitignore)
ln -sf ${tree-sitter-lean.packages.${system}.grammar}/parser tree-sitter-lean.so
'';
};
};
}
See nixpkgs tree-sitter documentation for more Tree-Sitter with Nix examples.
Push build cache to public cache server:
|
Development
Read writing the grammar. Be careful of adding too many conflicts in the grammar as they cause exponential growth of the state space.
Use the tree-sitter CLI:
Add tests for the Tree-Sitter grammar to ./test/corpus. See writing tests
License
Based on https://github.com/Julian/lean.nvim. Grammar was completely rewritten to be simpler.
MIT