CypherQuery

Struct CypherQuery 

Source
pub struct CypherQuery { /* private fields */ }
Expand description

A Cypher query that can be executed against Lance datasets

Implementations§

Source§

impl CypherQuery

Source

pub fn new(query: &str) -> Result<Self>

Create a new Cypher query from a query string

Source

pub fn with_config(self, config: GraphConfig) -> Self

Set the graph configuration for this query

Source

pub fn with_parameter<K, V>(self, key: K, value: V) -> Self
where K: Into<String>, V: Into<Value>,

Add a parameter to the query

Source

pub fn with_parameters(self, params: HashMap<String, Value>) -> Self

Add multiple parameters to the query

Source

pub fn query_text(&self) -> &str

Get the original query text

Source

pub fn ast(&self) -> &CypherAST

Get the parsed AST

Source

pub fn config(&self) -> Option<&GraphConfig>

Get the graph configuration

Source

pub fn parameters(&self) -> &HashMap<String, Value>

Get query parameters

Source

pub async fn execute( &self, datasets: HashMap<String, RecordBatch>, strategy: Option<ExecutionStrategy>, ) -> Result<RecordBatch>

Execute the query against provided in-memory datasets

This method uses the DataFusion planner by default for comprehensive query support including joins, aggregations, and complex patterns. You can optionally specify a different execution strategy.

§Arguments
  • datasets - HashMap of table name to RecordBatch (nodes and relationships)
  • strategy - Optional execution strategy (defaults to DataFusion)
§Returns

A single RecordBatch containing the query results

§Errors

Returns error if query parsing, planning, or execution fails

§Example
use std::collections::HashMap;
use arrow::record_batch::RecordBatch;
use lance_graph::query::CypherQuery;

// Create in-memory datasets
let mut datasets = HashMap::new();
datasets.insert("Person".to_string(), person_batch);
datasets.insert("KNOWS".to_string(), knows_batch);

// Parse and execute query
let query = CypherQuery::parse("MATCH (p:Person)-[:KNOWS]->(f) RETURN p.name, f.name")?
    .with_config(config);
// Use the default DataFusion strategy
let result = query.execute(datasets, None).await?;
// Use the Simple strategy explicitly
let result = query.execute(datasets, Some(ExecutionStrategy::Simple)).await?;
Source

pub async fn explain( &self, datasets: HashMap<String, RecordBatch>, ) -> Result<String>

Explain the query execution plan using in-memory datasets

Returns a formatted string showing the query execution plan at different stages:

  • Graph Logical Plan (graph-specific operators)
  • DataFusion Logical Plan (optimized relational plan)
  • DataFusion Physical Plan (execution plan with optimizations)

This is useful for understanding query performance, debugging, and optimization.

§Arguments
  • datasets - HashMap of table name to RecordBatch (nodes and relationships)
§Returns

A formatted string containing the execution plan at multiple levels

§Errors

Returns error if planning fails

§Example
use std::collections::HashMap;
use arrow::record_batch::RecordBatch;
use lance_graph::query::CypherQuery;

// Create in-memory datasets
let mut datasets = HashMap::new();
datasets.insert("Person".to_string(), person_batch);
datasets.insert("KNOWS".to_string(), knows_batch);

let query = CypherQuery::parse("MATCH (p:Person) WHERE p.age > 30 RETURN p.name")?
    .with_config(config);

let plan = query.explain(datasets).await?;
println!("{}", plan);
Source

pub async fn to_sql( &self, datasets: HashMap<String, RecordBatch>, ) -> Result<String>

Convert the Cypher query to a DataFusion SQL string

This method generates a SQL string that corresponds to the DataFusion logical plan derived from the Cypher query. It uses the datafusion-sql unparser.

WARNING: This method is experimental and the generated SQL dialect may change.

Case Sensitivity Limitation: All table names in the generated SQL are lowercased (e.g., Person becomes person, Company becomes company), due to the internal handling of DataFusion’s SQL unparser. Note that this only affects the SQL string representation - actual query execution with execute() handles case-sensitive labels correctly.

If you need case-sensitive table names in the SQL output, consider:

  • Using lowercase labels consistently in your Cypher queries and table names
  • Post-processing the SQL string to replace table names with the correct case
§Arguments
  • datasets - HashMap of table name to RecordBatch (nodes and relationships)
§Returns

A SQL string representing the query

Source

pub async fn execute_with_context( &self, ctx: SessionContext, ) -> Result<RecordBatch>

Execute query with a DataFusion SessionContext, automatically building the catalog

This is a convenience method that builds the graph catalog by querying the SessionContext for table schemas. The GraphConfig determines which tables to look up (node labels and relationship types).

This method is ideal for integrating with DataFusion’s rich data source ecosystem (CSV, Parquet, Delta Lake, Iceberg, etc.) without manually building a catalog.

§Arguments
  • ctx - DataFusion SessionContext with pre-registered tables
§Returns

Query results as an Arrow RecordBatch

§Errors

Returns error if:

  • GraphConfig is not set (use .with_config() first)
  • Required tables are not registered in the SessionContext
  • Query execution fails
§Example
use datafusion::execution::context::SessionContext;
use datafusion::prelude::CsvReadOptions;
use lance_graph::{CypherQuery, GraphConfig};

// Step 1: Create GraphConfig
let config = GraphConfig::builder()
    .with_node_label("Person", "person_id")
    .with_relationship("KNOWS", "src_id", "dst_id")
    .build()?;

// Step 2: Register data sources in DataFusion
let ctx = SessionContext::new();
ctx.register_csv("Person", "data/persons.csv", CsvReadOptions::default()).await?;
ctx.register_parquet("KNOWS", "s3://bucket/knows.parquet", Default::default()).await?;

// Step 3: Execute query (catalog is built automatically)
let query = CypherQuery::parse("MATCH (p:Person)-[:KNOWS]->(f) RETURN p.name")?
    .with_config(config);
let result = query.execute_with_context(ctx).await?;
§Note

The catalog is built by querying the SessionContext for schemas of tables mentioned in the GraphConfig. Table names must match between GraphConfig (node labels/relationship types) and SessionContext (registered table names).

Source

pub async fn execute_with_catalog_and_context( &self, catalog: Arc<dyn GraphSourceCatalog>, ctx: SessionContext, ) -> Result<RecordBatch>

Execute query with an explicit catalog and session context

This is the most flexible API for advanced users who want to provide their own catalog implementation or have fine-grained control over both the catalog and session context.

§Arguments
  • catalog - Graph catalog containing node and relationship schemas for planning
  • ctx - DataFusion SessionContext with registered data sources for execution
§Returns

Query results as an Arrow RecordBatch

§Errors

Returns error if query parsing, planning, or execution fails

§Example
use std::sync::Arc;
use datafusion::execution::context::SessionContext;
use lance_graph::source_catalog::InMemoryCatalog;
use lance_graph::query::CypherQuery;

// Create custom catalog
let catalog = InMemoryCatalog::new()
    .with_node_source("Person", custom_table_source);

// Create SessionContext
let ctx = SessionContext::new();
ctx.register_table("Person", custom_table).unwrap();

// Execute with explicit catalog and context
let query = CypherQuery::parse("MATCH (p:Person) RETURN p.name")?
    .with_config(config);
let result = query.execute_with_catalog_and_context(Arc::new(catalog), ctx).await?;
Source

pub async fn execute_simple( &self, datasets: HashMap<String, RecordBatch>, ) -> Result<RecordBatch>

Execute simple single-table queries (legacy implementation)

This method supports basic projection/filter/limit workflows on a single table. For full query support including joins and complex patterns, use execute() instead.

Note: This implementation is retained for backward compatibility and simple use cases.

Source

pub fn referenced_node_labels(&self) -> Vec<String>

Get all node labels referenced in this query

Source

pub fn referenced_relationship_types(&self) -> Vec<String>

Get all relationship types referenced in this query

Source

pub fn variables(&self) -> Vec<String>

Get all variables used in this query

Trait Implementations§

Source§

impl Clone for CypherQuery

Source§

fn clone(&self) -> CypherQuery

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for CypherQuery

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

impl<T> Allocation for T
where T: RefUnwindSafe + Send + Sync,

Source§

impl<T> ErasedDestructor for T
where T: 'static,