Struct git_pack::index::File [−][src]
pub struct File { /* fields omitted */ }
Expand description
A representation of a pack index file
Implementations
Instantiation
Iteration and access
Returns 20 bytes sha1 at the given index in our list of (sorted) sha1 hashes. The index ranges from 0 to self.num_objects()
Panics
If index
is out of bounds.
Returns the offset into our pack data file at which to start reading the object at index
.
Panics
If index
is out of bounds.
Returns the CRC32 of the object at the given index
.
Note: These are always present for index version 2 or higher.
Panics
If index
is out of bounds.
Returns the index
of the given SHA1 for use with the oid_at_index()
,
pack_offset_at_index()
or crc32_at_index()
.
An iterator over all Entries
of this index file.
Traversal with index
pub fn traverse_with_index<P, Processor, E>(
&self,
check: SafetyCheck,
thread_limit: Option<usize>,
new_processor: impl Fn() -> Processor + Send + Clone,
progress: P,
pack: &File,
should_interrupt: Arc<AtomicBool>
) -> Result<(ObjectId, Outcome, P), Error<E>> where
P: Progress,
Processor: FnMut(Kind, &[u8], &Entry, &mut <<P as Progress>::SubProgress as Progress>::SubProgress) -> Result<(), E>,
E: Error + Send + Sync + 'static,
pub fn traverse_with_index<P, Processor, E>(
&self,
check: SafetyCheck,
thread_limit: Option<usize>,
new_processor: impl Fn() -> Processor + Send + Clone,
progress: P,
pack: &File,
should_interrupt: Arc<AtomicBool>
) -> Result<(ObjectId, Outcome, P), Error<E>> where
P: Progress,
Processor: FnMut(Kind, &[u8], &Entry, &mut <<P as Progress>::SubProgress as Progress>::SubProgress) -> Result<(), E>,
E: Error + Send + Sync + 'static,
Iterate through all decoded objects in the given pack
and handle them with a Processor
, using an index to reduce waste
at the cost of memory.
For more details, see the documentation on the traverse()
method.
Verify and validate the content of the index file
pub fn traverse_with_lookup<P, C, Processor, E>(
&self,
new_processor: impl Fn() -> Processor + Send + Clone,
new_cache: impl Fn() -> C + Send + Clone,
progress: P,
pack: &File,
_: Options
) -> Result<(ObjectId, Outcome, P), Error<E>> where
P: Progress,
C: DecodeEntry,
E: Error + Send + Sync + 'static,
Processor: FnMut(Kind, &[u8], &Entry, &mut <<P as Progress>::SubProgress as Progress>::SubProgress) -> Result<(), E>,
pub fn traverse_with_lookup<P, C, Processor, E>(
&self,
new_processor: impl Fn() -> Processor + Send + Clone,
new_cache: impl Fn() -> C + Send + Clone,
progress: P,
pack: &File,
_: Options
) -> Result<(ObjectId, Outcome, P), Error<E>> where
P: Progress,
C: DecodeEntry,
E: Error + Send + Sync + 'static,
Processor: FnMut(Kind, &[u8], &Entry, &mut <<P as Progress>::SubProgress as Progress>::SubProgress) -> Result<(), E>,
Iterate through all decoded objects in the given pack
and handle them with a Processor
using a cache to reduce the amount of
waste while decoding objects.
For more details, see the documentation on the traverse()
method.
Traversal of pack data files using an index file
pub fn traverse<P, C, Processor, E>(
&self,
pack: &File,
progress: Option<P>,
new_processor: impl Fn() -> Processor + Send + Clone,
new_cache: impl Fn() -> C + Send + Clone,
_: Options
) -> Result<(ObjectId, Outcome, Option<P>), Error<E>> where
P: Progress,
C: DecodeEntry,
E: Error + Send + Sync + 'static,
Processor: FnMut(Kind, &[u8], &Entry, &mut DoOrDiscard<<<P as Progress>::SubProgress as Progress>::SubProgress>) -> Result<(), E>,
pub fn traverse<P, C, Processor, E>(
&self,
pack: &File,
progress: Option<P>,
new_processor: impl Fn() -> Processor + Send + Clone,
new_cache: impl Fn() -> C + Send + Clone,
_: Options
) -> Result<(ObjectId, Outcome, Option<P>), Error<E>> where
P: Progress,
C: DecodeEntry,
E: Error + Send + Sync + 'static,
Processor: FnMut(Kind, &[u8], &Entry, &mut DoOrDiscard<<<P as Progress>::SubProgress as Progress>::SubProgress>) -> Result<(), E>,
Iterate through all decoded objects in the given pack
and handle them with a Processor
.
The return value is (pack-checksum, Outcome
, progress
), thus the pack traversal will always verify
the whole packs checksum to assure it was correct. In case of bit-rod, the operation will abort early without
verifying all objects using the interrupt mechanism mechanism.
Algorithms
Using the Options::algorithm
field one can chose between two algorithms providing different tradeoffs. Both invoke
new_processor()
to create functions receiving decoded objects, their object kind, index entry and a progress instance to provide
progress information.
Algorithm::DeltaTreeLookup
builds an index to avoid any unnecessary computation while resolving objects, avoiding the need for a cache entirely, renderingnew_cache()
unused. One could also calltraverse_with_index()
directly.Algorithm::Lookup
uses a cache created bynew_cache()
to avoid having to re-compute all bases of a delta-chain while decoding objects. One could also calltraverse_with_lookup()
directly.
Use thread_limit
to further control parallelism and check
to define how much the passed
objects shall be verified beforehand.
Verify and validate the content of the index file
Returns the trailing hash stored at the end of this index file.
It’s a hash over all bytes of the index.
Returns the hash of the pack data file that this index file corresponds to.
It should crate::data::File::checksum()
of the corresponding pack data file.
pub fn verify_checksum(
&self,
progress: impl Progress,
should_interrupt: &AtomicBool
) -> Result<ObjectId, Error>
pub fn verify_checksum(
&self,
progress: impl Progress,
should_interrupt: &AtomicBool
) -> Result<ObjectId, Error>
Validate that our index_checksum()
matches the actual contents
of this index file, and return it if it does.
The most thorough validation of integrity of both index file and the corresponding pack data file, if provided. Returns the checksum of the index file, the traversal outcome and the given progress if the integrity check is successful.
If pack
is provided, it is expected (and validated to be) the pack belonging to this index.
It will be used to validate internal integrity of the pack before checking each objects integrity
is indeed as advertised via its SHA1 as stored in this index, as well as the CRC32 hash.
The last member of the Option is a function returning an implementation of crate::cache::DecodeEntry
to be used if
the index::traverse::Algorithm
is Lookup
.
To set this to None
, use None::<(_, _, _, fn() -> crate::cache::Never)>
.
The thread_limit
optionally specifies the amount of threads to be used for the pack traversal.
make_cache
is only used in case a pack
is specified, use existing implementations in the crate::cache
module.
Tradeoffs
The given progress
is inevitably consumed if there is an error, which is a tradeoff chosen to easily allow using ?
in the
error case.
Various ways of writing an index file from pack entries
pub fn write_data_iter_to_stream<F, F2>(
kind: Version,
make_resolver: F,
entries: impl Iterator<Item = Result<Entry, Error>>,
thread_limit: Option<usize>,
root_progress: impl Progress,
out: impl Write,
should_interrupt: &AtomicBool
) -> Result<Outcome, Error> where
F: FnOnce() -> Result<F2>,
F2: for<'r> Fn(EntryRange, &'r mut Vec<u8>) -> Option<()> + Send + Clone,
pub fn write_data_iter_to_stream<F, F2>(
kind: Version,
make_resolver: F,
entries: impl Iterator<Item = Result<Entry, Error>>,
thread_limit: Option<usize>,
root_progress: impl Progress,
out: impl Write,
should_interrupt: &AtomicBool
) -> Result<Outcome, Error> where
F: FnOnce() -> Result<F2>,
F2: for<'r> Fn(EntryRange, &'r mut Vec<u8>) -> Option<()> + Send + Clone,
Write information about entries
as obtained from a pack data file into a pack index file via the out
stream.
The resolver produced by make_resolver
must resolve pack entries from the same pack data file that produced the
entries
iterator.
kind
is the version of pack index to produce, use crate::index::Version::default()
if in doubt.
tread_limit
is used for a parallel tree traversal for obtaining object hashes with optimal performance.
root_progress
is the top-level progress to stay informed about the progress of this potentially long-running
computation.
Remarks
- neither in-pack nor out-of-pack Ref Deltas are supported here, these must have been resolved beforehand.
make_resolver()
will only be called after the iterator stopped returning elements and produces a function that provides all bytes belonging to a pack entry writing them to the given mutable outputVec
. It should returnNone
if the entry cannot be resolved from the pack that produced theentries
iterator, causing the write operation to fail.
Basic file information