1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713
//! Allows programmatically invoking parol from a `build.rs` script
//!
//! The process of invoking a grammar starts with a [`struct@Builder`] and one of two output modes:
//! 1. Cargo build script output mode, via [Builder::with_cargo_script_output] (easiest)
//! 2. Explicitly specifying an output directory via [Builder::with_explicit_output_dir]
//!
//! ## Cargo integration
//! If this API detects it is running inside a
//! [Cargo `build.rs` script](https://doc.rust-lang.org/stable/cargo/reference/build-scripts.html),
//! then it implicitly enables cargo integration.
//!
//! This has Cargo *automatically* regenerate the parser sources whenever the grammar changes. This
//! is done by implicitly outputting the appropriate
//! [`rerun-if-changed=<grammar>`](https://doc.rust-lang.org/stable/cargo/reference/build-scripts.html#change-detection)
//! instructions to Cargo.
//!
//! ### Defaults
//! When using [`Builder::with_cargo_script_output`], a number of reasonable defaults are set:
//!
//! By default, the output directory is set to the `OUT_DIR` environment variable.
//! By default, the generated parser name is `parser.rs` and the generated grammar action file is `
//!
//! You can
//! ```ignore
//! mod parser {
//! include!(concat!(env!("OUT_DIR"), "/parser.rs"));
//! }
//! ```
//!
//! ### Tradeoffs
//! The disadvantage of using this mode (or using Cargo build scripts in general),
//! is that it adds the `parol` crate as an explicit build dependency.
//!
//! Although this doesn't increase the runtime binary size, it does increase the initial compile
//! times.
//! If someone just wants to `cargo install <your crate>`, Cargo will have to download and execute
//! `parol` to generate your parser code.
//!
//! Contributors to your project (who modify your grammar) will have to download and invoke parol
//! anyways, so this cost primarily affects initial compile times. Also cargo is very intelligent
//! about caching build script outputs.
//!
//! Despite the impact on initial compiles, this is somewhat traditional in the Rust community.
//! It's [the recommended way to use `bindgen`](https://rust-lang.github.io/rust-bindgen/library-usage.html)
//! and it's the only way to use [`pest`](https://pest.rs/).
//!
//! If you are really concerned about compile times, you can use explicit output (below) to avoid
//! invoking pest.
//!
//! ## Explicitly controlling Output Locations
//! If you want more control over the location of generated grammar files,
//! you can invoke [`Builder::with_explicit_output_dir`] to explicitly set an output directory.
//!
//! In addition you must explicitly name your output parser and action files,
//! or the configuration will give an error.
//!
//! This is used to power the command line `parol` tool, and is useful for additional control.
//!
//! Any configured *output* paths (including generated parsers, expanded grammars, etc)
//! are resolved relative to this base output using [Path::join]. This means that specifying
//! absolute paths overrides this explicit base directory.
//!
//! The grammar input file is resolved in the regular manner.
//! It does not use the "output" directory.
//!
//! ### Interaction with version control
//! When using [`Builder::with_cargo_script_output`], the output is put in a subdir of the `target`
//! directory and excluded from version control.
//!
//! This is useful if you want to ignore changes in generated code.
//!
//! However, when specifying an explicit output directory (with [`Builder::with_explicit_output_dir`]),
//! you may have to include the generated sources explicitly into the build process. One way is
//! indicated above where the include! macro is used.
//!
//! Otherwise, you would probably set the output to a sub-directory of `src`.
//! This means that files are version controlled and you would have to commit them whenever changes
//! are made.
//!
//! ## Using the CLI directly
//! Note that explicitly specifying the output directory doesn't avoid running parol on `cargo
//! install`.
//!
//! It does not increase the initial build speed, and still requires compiling and invoking `parol`.
//!
//! If you really want to avoid adding `parol` as a build dependency,
//! you need to invoke the CLI manually to generate the parser sources ahead of time.
//!
//! Using a build script requires adding a build dependency, and cargo will unconditionally execute
//! build scripts on first install.
//! While Cargo's build script caching is excellent, it only activates on recompiles.
//!
//! As such, using the CLI manually is really the only way to improve (initial) compile times.
//!
//! It is (often) not worth it, because it is inconvenient, and the impact only happens on *initial* compiles.
//!
//! ## API Completeness
//! Anything you can do with the main `parol` executable, you should also be able to do with this API.
//!
//! That is because the main executable is just a wrapper around the API
//!
//! However, a couple more advanced features use unstable/internal APIs (see below).
//!
//! As a side note, the CLI does not require you to specify an output location.
//! You can run `parol -f grammar.parol` just fine and it will generate no output.
//!
//! In build scripts, this is typically a mistake (so it errors by default).
//! If you want to disable this sanity check, use [`Builder::disable_output_sanity_checks`]
//!
//! ### Internal APIs
//! The main `parol` command needs a couple of features that do not fit nicely into this API (or interact closely with the crate's internals).
//!
//!
//! Because of that, there are a number of APIs explicitly marked as unstable or internal.
//! Some of these are public and some are private.
//!
//! Expect breaking changes both before and after 1.0 (but especially before).
#![deny(
missing_docs, // Building should be documented :)
)]
use std::collections::BTreeMap;
use std::convert::TryFrom;
use std::path::{Path, PathBuf};
use std::{env, fs};
use crate::parser::parol_grammar::Production;
use crate::{
GrammarConfig, GrammarTypeInfo, LookaheadDFA, ParolGrammar, UserTraitGeneratorBuilder, MAX_K,
};
use clap::{Parser, ValueEnum};
use id_tree::Tree;
use parol_macros::parol;
use parol_runtime::{ParseTreeType, Result};
/// Contains all attributes that should be inserted optionally on top of the generated trait source.
/// * Used in the Builder API. Therefore it mus be public
#[derive(Clone, Debug, Parser, ValueEnum)]
pub enum InnerAttributes {
/// Suppresses clippy warnings like these: `warning: this function has too many arguments (9/7)`
AllowTooManyArguments,
}
impl std::fmt::Display for InnerAttributes {
fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result {
match self {
InnerAttributes::AllowTooManyArguments => {
write!(f, "#![allow(clippy::too_many_arguments)]")
}
}
}
}
/// The default maximum lookahead
///
/// This is used both for the CLI and for the builder.
pub const DEFAULT_MAX_LOOKAHEAD: usize = 5;
/// The default name of the generated grammar module.
pub const DEFAULT_MODULE_NAME: &str = "grammar";
/// The default name of the user type that implements grammar parsing.
pub const DEFAULT_USER_TYPE_NAME: &str = "Grammar";
fn is_build_script() -> bool {
// Although only `OUT_DIR` is necessary for our purposes, it's possible someone else set it.
// Check for a second one to make sure we're actually running under cargo
// See full list of environment variables here: https://is.gd/K6LyzQ
env::var_os("OUT_DIR").is_some() && env::var_os("CARGO_MANIFEST_DIR").is_some()
}
/// Builds the configuration for generating and analyzing `parol` grammars.
///
/// A grammar file is required for almost all possible operations (set with [Builder::grammar_file])
///
/// Does not actually generate anything until finished.
#[derive(Clone)]
pub struct Builder {
/// The base output directory
output_dir: PathBuf,
grammar_file: Option<PathBuf>,
/// Output file for expanded grammar
expanded_grammar_output_file: Option<PathBuf>,
/// Output file for the generated parser source
parser_output_file: Option<PathBuf>,
/// Output file for the generated actions files.
actions_output_file: Option<PathBuf>,
user_type_name: String,
module_name: String,
cargo_integration: bool,
max_lookahead: usize,
/// By default, we want to require that the parser output file is specified.
/// Otherwise we're just wasting time outputting to /dev/null.
///
/// The CLI needs to be able to override this (mostly for debugging), hence the option.
output_sanity_checks: bool,
/// Enables auto-generation of expanded grammar's semantic actions - experimental
auto_generate: bool,
/// Internal debugging for CLI.
debug_verbose: bool,
/// Generate range information for AST types
range: bool,
/// Inner attributes to insert at the top of the generated trait source.
inner_attributes: Vec<InnerAttributes>,
/// Used for auto generation of user's grammar semantic action trait
productions: Vec<Production>,
}
impl Builder {
/// Create a new builder fr use in a Cargo build script (`build.rs`).
///
/// This is the recommended default way to get started.
///
/// All the outputs are set relative to the `OUT_DIR` environment variable,
/// as is standard for [Cargo build script outputs](https://doc.rust-lang.org/stable/cargo/reference/build-scripts.html#outputs-of-the-build-script).
///
/// This sets sensible defaults for every output file name.
///
/// | Method name | CLI Option | Default (relative) name |
/// | -------------------------------|----------------------|-------------------------|
/// | `parser_output_file` | `--parser` or `-p` | "parser.rs" |
/// | `actions_output_file` | `--actions` or `-a` | "grammar_trait.rs" |
/// | `expanded_grammar_output_file` | `--expanded` or `-e` | "grammar-exp.par" |
///
///
/// See the module documentation for how to include these files into your project.
///
/// Panics if used outside of a cargo build script.
pub fn with_cargo_script_output() -> Self {
assert!(is_build_script(), "Cannot use outside of a cargo script");
// Don't worry! $OUT_DIR is unique for every
let out_dir = env::var_os("OUT_DIR").unwrap();
let mut builder = Self::with_explicit_output_dir(out_dir);
// Set those reasonable defaults we promised
builder
.parser_output_file("parser.rs")
.actions_output_file("grammar_trait.rs")
.expanded_grammar_output_file("grammar-exp.par");
// Cargo integration should already be enabled (because we are a build script)
assert!(builder.cargo_integration);
builder
}
/// Internal utility to resolve a path relative to the output directory
fn resolve_output_path(&self, p: impl AsRef<Path>) -> PathBuf {
self.output_dir.join(p)
}
/// Create a new builder with an explicitly specified output directory.
///
/// This requires that output files be specified explicitly,
/// unless this check is disabled with [`Builder::disable_output_sanity_checks`]
///
/// If this detects running inside a build script,
/// it will automatically enable cargo integration.
///
/// If output files are specified using absolute paths,
/// it overrides this explicit output dir.
///
/// See module docs on "explicit output mode" for more details.
pub fn with_explicit_output_dir(output: impl AsRef<Path>) -> Self {
/*
* Most of these correspond to CLI options.
*/
Builder {
output_dir: PathBuf::from(output.as_ref()),
grammar_file: None,
cargo_integration: is_build_script(),
debug_verbose: false,
range: false,
max_lookahead: DEFAULT_MAX_LOOKAHEAD,
module_name: String::from(DEFAULT_MODULE_NAME),
user_type_name: String::from(DEFAULT_USER_TYPE_NAME),
// In this mode, the user must specify explicit outputs.
// The default is /dev/null (`None`)
parser_output_file: None,
actions_output_file: None,
expanded_grammar_output_file: None,
auto_generate: false,
inner_attributes: Vec::new(),
// By default, we require that output files != /dev/null
output_sanity_checks: true,
productions: Vec::new(),
}
}
/// By default, we require that the generated parser and action files are not discarded.
///
/// This disables that check (used for the CLI).
///
/// NOTE: When using [`Builder::with_cargo_script_output`], these are automatically inferred.
pub fn disable_output_sanity_checks(&mut self) -> &mut Self {
self.output_sanity_checks = false;
self
}
/// Set the output location for the generated parser.
///
/// If you are using [Builder::with_cargo_script_output],
/// the default output is "$OUT_DIR/parser.rs".
///
/// If you are using an explicitly specified output directory, then this option is *required*.
pub fn parser_output_file(&mut self, p: impl AsRef<Path>) -> &mut Self {
self.parser_output_file = Some(self.resolve_output_path(p));
self
}
/// Set the actions output location for the generated parser.
///
/// If you are using [Builder::with_cargo_script_output],
/// the default output is "$OUT_DIR/grammar_trait.rs".
///
/// If you are using an explicitly specified output directory, then this option is *required*.
pub fn actions_output_file(&mut self, p: impl AsRef<Path>) -> &mut Self {
self.actions_output_file = Some(self.resolve_output_path(p));
self
}
/// Set the actions output location for the generated parser.
///
/// If you are using [Builder::with_cargo_script_output],
/// the default output is "$OUT_DIR/grammar-exp.par".
///
/// Otherwise, this is ignored.
pub fn expanded_grammar_output_file(&mut self, p: impl AsRef<Path>) -> &mut Self {
self.expanded_grammar_output_file = Some(self.resolve_output_path(p));
self
}
/// Explicitly enable/disable cargo integration.
///
/// This is automatically set to true if you are running a build script,
/// and is `false` otherwise.
pub fn set_cargo_integration(&mut self, enabled: bool) -> &mut Self {
self.cargo_integration = enabled;
self
}
/// Set the grammar file used as input for parol.
///
/// This is required for most operations.
///
/// Does not check that the file exists.
pub fn grammar_file(&mut self, grammar: impl AsRef<Path>) -> &mut Self {
self.grammar_file = Some(PathBuf::from(grammar.as_ref()));
self
}
/// Set the name of the user type that implements the language processing
pub fn user_type_name(&mut self, name: &str) -> &mut Self {
self.user_type_name = name.into();
self
}
/// Set the name of the user module that implements the language processing
///
/// This is the module that contains the [Self::user_type_name]
pub fn user_trait_module_name(&mut self, name: &str) -> &mut Self {
self.module_name = name.into();
self
}
/// Set the maximum lookahead for the generated parser.
///
/// If nothing is specified, the default lookahead is [DEFAULT_MAX_LOOKAHEAD].
///
/// Returns a [BuilderError] if the lookahead is greater than [crate::MAX_K].
pub fn max_lookahead(&mut self, k: usize) -> std::result::Result<&mut Self, BuilderError> {
if k > MAX_K {
return Err(BuilderError::LookaheadTooLarge);
}
self.max_lookahead = k;
Ok(self)
}
/// Debug verbose information to the standard output
///
/// This is an internal method, and is only intended for the CLI.
#[doc(hidden)]
pub fn debug_verbose(&mut self) -> &mut Self {
self.debug_verbose = true;
self
}
/// Generate range information for AST types
///
pub fn range(&mut self) -> &mut Self {
self.range = true;
self
}
/// Inserts the given inner attributes at the top of the generated trait source.
pub fn inner_attributes(&mut self, inner_attributes: Vec<InnerAttributes>) -> &mut Self {
self.inner_attributes = inner_attributes;
self
}
/// Enables the auto-generation of expanded grammar's semantic actions - experimental
///
pub fn enable_auto_generation(&mut self) -> &mut Self {
self.auto_generate = true;
self
}
/// Begin the process of generating the grammar
/// using the specified listener (or None if no listener is desired).
///
/// Returns an error if the build is *configured* incorrectly.
/// In a build script, this is typically a programmer error.
pub fn begin_generation_with<'l>(
&mut self,
listener: Option<&'l mut dyn BuildListener>,
) -> std::result::Result<GrammarGenerator<'l>, BuilderError> {
/*
* For those concerned about performance:
*
* The overhead of all these copies and dyn dispatch is marginal
* in comparison to the actual grammar generation.
*/
let grammar_file = self
.grammar_file
.as_ref()
.ok_or(BuilderError::MissingGrammarFile)?
.clone();
if self.output_sanity_checks {
// Check that we have outputs
if self.parser_output_file.is_none() {
return Err(BuilderError::MissingParserOutputFile);
} else if self.actions_output_file.is_none() {
return Err(BuilderError::MissingActionOutputFile);
}
// Missing expanded grammar file is fine. They might not want that.
}
Ok(GrammarGenerator {
listener: MaybeBuildListener(listener),
grammar_file,
builder: self.clone(),
state: None,
grammar_config: None,
lookahead_dfa_s: None,
})
}
/// Generate the parser, writing it to the pre-configured output files.
pub fn generate_parser(&mut self) -> Result<()> {
self.begin_generation_with(None)
.map_err(|e| parol!("Misconfigured parol generation: {}", e))?
.generate_parser()
}
}
/// Represents in-process grammar generation.
///
/// Most of the time you will want to use [Builder::generate_parser] to bypass this completely.
///
/// This is an advanced API, and unless stated otherwise, all its methods are unstable (see module docs).
///
/// The lifetime parameter `'l` refers to the lifetime of the optional listener.
pub struct GrammarGenerator<'l> {
/// The build listener
///
/// This is a fairly advanced feature
listener: MaybeBuildListener<'l>,
grammar_file: PathBuf,
builder: Builder,
state: Option<State>,
grammar_config: Option<GrammarConfig>,
lookahead_dfa_s: Option<BTreeMap<String, LookaheadDFA>>,
}
impl GrammarGenerator<'_> {
/// Generate the parser, writing it to the pre-configured output files.
pub fn generate_parser(&mut self) -> Result<()> {
self.parse()?;
self.expand()?;
self.post_process()?;
self.write_output()?;
Ok(())
}
//
// Internal APIs
//
#[doc(hidden)]
pub fn parse(&mut self) -> Result<()> {
assert_eq!(self.state, None);
let input = fs::read_to_string(&self.grammar_file).map_err(|e| {
parol!(
"Can't read grammar file {}: {}",
self.grammar_file.display(),
e
)
})?;
if self.builder.cargo_integration {
println!("cargo:rerun-if-changed={}", self.grammar_file.display());
}
let mut parol_grammar = ParolGrammar::new();
let syntax_tree = crate::parser::parse(&input, &self.grammar_file, &mut parol_grammar)?;
self.builder.productions = parol_grammar.productions.clone();
self.listener
.on_initial_grammar_parse(&syntax_tree, &parol_grammar)?;
self.grammar_config = Some(GrammarConfig::try_from(parol_grammar)?);
self.state = Some(State::Parsed);
Ok(())
}
#[doc(hidden)]
pub fn expand(&mut self) -> Result<()> {
assert_eq!(self.state, Some(State::Parsed));
let grammar_config = self.grammar_config.as_mut().unwrap();
// NOTE: it's up to the listener to add appropriate error context
self.listener
.on_intermediate_grammar(IntermediateGrammar::Untransformed, &*grammar_config)?;
let cfg = crate::check_and_transform_grammar(&grammar_config.cfg)?;
// To have at least a preliminary version of the expanded grammar,
// even when the next checks fail, we write out the expanded grammar here.
// In most cases it will be overwritten further on.
if let Some(ref expanded_file) = self.builder.expanded_grammar_output_file {
fs::write(
expanded_file,
crate::render_par_string(grammar_config, /* add_index_comment */ true)?,
)
.map_err(|e| parol!("Error writing left-factored grammar! {}", e))?;
}
// Exchange original grammar with transformed one
grammar_config.update_cfg(cfg);
self.listener
.on_intermediate_grammar(IntermediateGrammar::Transformed, &*grammar_config)?;
if let Some(ref expanded_file) = self.builder.expanded_grammar_output_file {
fs::write(
expanded_file,
crate::render_par_string(grammar_config, /* add_index_comment */ true)?,
)
.map_err(|e| parol!("Error writing left-factored grammar!: {}", e))?;
}
self.state = Some(State::Expanded);
Ok(())
}
#[doc(hidden)]
pub fn post_process(&mut self) -> Result<()> {
assert_eq!(self.state, Some(State::Expanded));
let grammar_config = self.grammar_config.as_mut().unwrap();
self.lookahead_dfa_s = Some(
crate::calculate_lookahead_dfas(grammar_config, self.builder.max_lookahead).map_err(
|e| parol!("Lookahead calculation for the given grammar failed!: {}", e),
)?,
);
if self.builder.debug_verbose {
print!(
"Lookahead DFAs:\n{:?}",
self.lookahead_dfa_s.as_ref().unwrap()
);
}
// Update maximum lookahead size for scanner generation
grammar_config.update_lookahead_size(
self.lookahead_dfa_s
.as_ref()
.unwrap()
.iter()
.max_by_key(|(_, dfa)| dfa.k)
.unwrap()
.1
.k,
);
if self.builder.debug_verbose {
print!("\nGrammar config:\n{:?}", grammar_config);
}
self.state = Some(State::PostProcessed);
Ok(())
}
#[doc(hidden)]
pub fn write_output(&mut self) -> Result<()> {
assert_eq!(self.state, Some(State::PostProcessed));
let grammar_config = self.grammar_config.as_mut().unwrap();
let lexer_source = crate::generate_lexer_source(grammar_config)
.map_err(|e| parol!("Failed to generate lexer source!: {}", e))?;
let user_trait_generator = UserTraitGeneratorBuilder::default()
.user_type_name(self.builder.user_type_name.clone())
.module_name(&self.builder.module_name)
.auto_generate(self.builder.auto_generate)
.range(self.builder.range)
.inner_attributes(self.builder.inner_attributes.clone())
.productions(self.builder.productions.clone())
.grammar_config(grammar_config)
.build()
.unwrap();
let mut type_info: GrammarTypeInfo =
GrammarTypeInfo::try_new(&self.builder.user_type_name)?;
let user_trait_source = user_trait_generator.generate_user_trait_source(&mut type_info)?;
if let Some(ref user_trait_file_out) = self.builder.actions_output_file {
fs::write(user_trait_file_out, user_trait_source)
.map_err(|e| parol!("Error writing generated user trait source!: {}", e))?;
crate::try_format(user_trait_file_out)?;
} else if self.builder.debug_verbose {
println!("\nSource for semantic actions:\n{}", user_trait_source);
}
let ast_type_has_lifetime = type_info.symbol_table.has_lifetime(type_info.ast_enum_type);
let parser_source = crate::generate_parser_source(
grammar_config,
&lexer_source,
self.builder.auto_generate,
&self.builder.user_type_name,
&self.builder.module_name,
self.lookahead_dfa_s.as_ref().unwrap(),
ast_type_has_lifetime,
)?;
if let Some(ref parser_file_out) = self.builder.parser_output_file {
fs::write(parser_file_out, parser_source)
.map_err(|e| parol!("Error writing generated lexer source!: {}", e))?;
crate::try_format(parser_file_out)?;
} else if self.builder.debug_verbose {
println!("\nParser source:\n{}", parser_source);
}
self.state = Some(State::Finished);
Ok(())
}
}
#[derive(Clone, Copy, Debug, PartialEq, Eq)]
enum State {
Parsed,
Expanded,
PostProcessed,
Finished,
}
/// A build listener, for advanced customization of the parser generation.
///
/// This is used by the CLI to implement some of its more advanced options (without cluttering up the main interface).
///
/// The details of this trait are considered unstable.
#[allow(
unused_variables, // All these variables are going to be unused because these are NOP impls....
missing_docs, // This is fine because this is internal.
)]
pub trait BuildListener {
fn on_initial_grammar_parse(
&mut self,
syntax_tree: &Tree<ParseTreeType>,
grammar: &ParolGrammar,
) -> Result<()> {
Ok(())
}
fn on_intermediate_grammar(
&mut self,
stage: IntermediateGrammar,
config: &GrammarConfig,
) -> Result<()> {
Ok(())
}
}
#[derive(Default)]
struct MaybeBuildListener<'l>(Option<&'l mut dyn BuildListener>);
impl<'l> BuildListener for MaybeBuildListener<'l> {
fn on_initial_grammar_parse(
&mut self,
syntax_tree: &Tree<ParseTreeType>,
grammar: &ParolGrammar,
) -> Result<()> {
if let Some(ref mut inner) = self.0 {
inner.on_initial_grammar_parse(syntax_tree, grammar)
} else {
Ok(())
}
}
fn on_intermediate_grammar(
&mut self,
stage: IntermediateGrammar,
config: &GrammarConfig,
) -> Result<()> {
if let Some(ref mut inner) = self.0 {
inner.on_intermediate_grammar(stage, config)
} else {
Ok(())
}
}
}
/// Marks an intermediate stage of the grammar, in between the various transformations that parol does.
///
/// The last transformation is returned by [IntermediateGrammar::LAST]
///
/// This enum gives some degree of access to the individual transformations that parol does.
/// As such, the specific variants are considered unstable.
#[non_exhaustive]
#[derive(Copy, Clone, Debug, PartialEq, Eq, PartialOrd, Ord)]
pub enum IntermediateGrammar {
/// Writes the untransformed parsed grammar
///
/// NOTE: This is different then the initially parsed syntax tree
Untransformed,
/// Writes the transformed parsed grammar
Transformed,
}
impl IntermediateGrammar {
/// The last transformation.
pub const LAST: IntermediateGrammar = IntermediateGrammar::Transformed;
}
/// An error that occurs configuring the [struct@Builder].
#[derive(Debug, thiserror::Error)]
#[non_exhaustive]
pub enum BuilderError {
/// Indicates that the operation needs a grammar file as input,
/// but that one has not been specified.
#[error("Missing an input grammar file")]
MissingGrammarFile,
/// Indicates that no parser output file has been specified.
///
/// This would discard the generated parser, which is typically a mistake.
#[error("No parser output file specified")]
MissingParserOutputFile,
/// Indicates that no parser output file has been specified.
///
/// This would discard the generated parser, which is typically a mistake.
#[error("No action output file specified")]
MissingActionOutputFile,
/// Indicates that the specified lookahead is too large
#[error("Maximum lookahead is {}", MAX_K)]
LookaheadTooLarge,
}