align_spans | Converts the spans defined in text to those defined in original_text .
|
align_spans_by_mapping | Converts the spans by the given mapping .
Generally speaking, the character correspondence between two texts is not
necessarily surjective, not injective, not even a methematical map - some
character in textA may not have a correspondence in textB , or may have
multiple correspondences in textB . Thus, mapping should be provided as
Vec<Vec<Span>> .
|
get_original_spans | Returns the span indices of original_text from the tokens based on the
shortest edit script (SES).
|
lift_span_index | Convert span indices to target_spans based indices.
Expects target_spans is sorted and not overlapping.
|
lift_spans_index | Convert spans indices on target_spans
|
remove_span_overlaps | Remove overlapping spans from given spans .
First, longest spans are remained - if the two spans are overlapped, the
first span will be remained. If the two spans are overlapped and their start
positions are same, the longer span will be remained.
|
remove_span_overlaps_idx | Remove overlapping spans from given spans , and returns remained span indices.
First, longest spans are remained - if the two spans are overlapped, the
first span will be remained. If the two spans are overlapped and their start
positions are same, the longer span will be remained.
|