Skip to main content

Module origins

Module origins 

Source
Expand description

Functionalities related to managing software origins in the Software Heritage graph.

Software origins (origins for short) represent places used to develop and distribute software source code, such as version control system repositories and (source) package repositories. Origins are represented in the Software Heritage graph as nodes with type NodeType::Origin.

Statics§

PROTOCOL_RE
Regular expression matching any URI/URL protocol prefix, e.g., “http://”.

Functions§

fuzzy_find_origins
Search if a given set of origins, specified by URL, exist in the graph.
normalize_origin_url
Normalize an origin URL to a canonical origin identifier.