Expand description
Encode extraction results into 128-float feature vectors.
§Missing data strategy
0.0means “absent/unknown” for all optional features.NodeFlagsbits distinguish “value is genuinely zero” from “unknown”:features[48] = 0.0+HAS_PRICE→ price is genuinely $0 (free)features[48] = 0.0+ noHAS_PRICE→ price not found on page
- Non-USD currency: stored raw in original currency. No conversion.
- Price ranges (“$200–$350”): low end stored in
features[48]. - Text ratings (“Excellent”): mapped to numeric 0.0–1.0.
Structs§
- Feature
Encode Result - Result of feature encoding, including computed flags.
Functions§
- encode_
features - Encode extraction results + navigation info into a 128-float feature vector.
- encode_
features_ from_ patterns - Encode features from pattern-based extraction (Layer 1.5) without browser rendering.
- encode_
features_ from_ structured_ data - Encode features from structured data (Layer 1) without browser rendering.
- encode_
features_ with_ flags - Encode extraction results and also return computed
NodeFlags. - merge_
features - Merge features from multiple sources, picking the highest-confidence value per dimension.