| align_spans | Converts the spans defined in text to those defined in original_text.
|
| align_spans_by_mapping | Converts the spans by the given mapping.
Generally speaking, the character correspondence between two texts is not
necessarily surjective, not injective, not even a methematical map - some
character in textA may not have a correspondence in textB, or may have
multiple correspondences in textB. Thus, mapping should be provided as
Vec<Vec<Span>>.
|
| get_original_spans | Returns the span indices of original_text from the tokens based on the
shortest edit script (SES).
|
| lift_span_index | Convert span indices to target_spans based indices.
Expects target_spans is sorted and not overlapping.
|
| lift_spans_index | Convert spans indices on target_spans
|
| remove_span_overlaps | Remove overlapping spans from given spans.
First, longest spans are remained - if the two spans are overlapped, the
first span will be remained. If the two spans are overlapped and their start
positions are same, the longer span will be remained.
|
| remove_span_overlaps_idx | Remove overlapping spans from given spans, and returns remained span indices.
First, longest spans are remained - if the two spans are overlapped, the
first span will be remained. If the two spans are overlapped and their start
positions are same, the longer span will be remained.
|