Module tfhe_cuda_backend::cuda_bind

source ·

Functions§

cleanup_cuda_apply_bivariate_lut_kb_64^⚠
cleanup_cuda_apply_univariate_lut_kb_64^⚠
cleanup_cuda_full_propagation^⚠
cleanup_cuda_integer_bitop^⚠
cleanup_cuda_integer_comparison^⚠
cleanup_cuda_integer_div_rem^⚠
cleanup_cuda_integer_mult^⚠
cleanup_cuda_integer_radix_arithmetic_scalar_shift^⚠
cleanup_cuda_integer_radix_cmux^⚠
cleanup_cuda_integer_radix_logical_scalar_shift^⚠
cleanup_cuda_integer_radix_overflowing_sub^⚠
cleanup_cuda_integer_radix_scalar_mul^⚠
cleanup_cuda_integer_radix_scalar_rotate^⚠
cleanup_cuda_integer_radix_shift_and_rotate^⚠
cleanup_cuda_integer_radix_sum_ciphertexts_vec^⚠
cleanup_cuda_multi_bit_programmable_bootstrap^⚠
This cleanup function frees the data for the multi-bit PBS on GPU contained in pbs_buffer for 64-bit inputs.
cleanup_cuda_programmable_bootstrap^⚠
This cleanup function frees the data for the low latency PBS on GPU contained in pbs_buffer for 32 or 64-bit inputs.
cleanup_cuda_propagate_single_carry^⚠
cuda_add_lwe_ciphertext_vector_64^⚠
Perform the addition of two u64 input LWE ciphertext vectors.
cuda_add_lwe_ciphertext_vector_plaintext_vector_64^⚠
Perform the addition of a u64 input LWE ciphertext vector with a u64 input plaintext vector.
cuda_apply_bivariate_lut_kb_64^⚠
cuda_apply_univariate_lut_kb_64^⚠
cuda_bitop_integer_radix_ciphertext_kb_64^⚠
cuda_cmux_integer_radix_ciphertext_kb_64^⚠
cuda_comparison_integer_radix_ciphertext_kb_64^⚠
cuda_convert_lwe_ciphertext_vector_to_cpu_64^⚠
Copy number_of_cts LWE ciphertext represented with 64 bits in the standard domain from the GPU to the CPU gpu_index using the stream v_stream. All ciphertexts must be concatenated.
cuda_convert_lwe_ciphertext_vector_to_gpu_64^⚠
Copy number_of_cts LWE ciphertext represented with 64 bits in the standard domain from the CPU to the GPU gpu_index using the stream v_stream. All ciphertexts must be concatenated.
cuda_convert_lwe_multi_bit_programmable_bootstrap_key_64^⚠
Copy a multi-bit bootstrap key src represented with 64 bits in the standard domain from the CPU to the GPU gpu_index using the stream v_stream. The resulting bootstrap key dest on the GPU is an array of uint64_t values.
cuda_convert_lwe_programmable_bootstrap_key_64^⚠
Copy a bootstrap key src represented with 64 bits in the standard domain from the CPU to the GPU gpu_index using the stream v_stream, and convert it to the Fourier domain on the GPU. The resulting bootstrap key dest on the GPU is an array of f64 values.
cuda_create_stream^⚠
Create a new Cuda stream on GPU gpu_index
cuda_destroy_stream^⚠
Destroy the Cuda stream v_stream
cuda_drop^⚠
Free memory for pointer ptr on GPU gpu_index synchronously
cuda_drop_async^⚠
Free memory for pointer ptr on GPU gpu_index asynchronously, using stream v_stream
cuda_full_propagation_64_inplace^⚠
cuda_get_max_shared_memory^⚠
Get the maximum amount of shared memory on GPU gpu_index
cuda_get_number_of_gpus^⚠
Get the total number of Nvidia GPUs detected on the platform
cuda_integer_div_rem_radix_ciphertext_kb_64^⚠
cuda_integer_mult_radix_ciphertext_kb_64^⚠
cuda_integer_radix_arithmetic_scalar_shift_kb_64_inplace^⚠
cuda_integer_radix_logical_scalar_shift_kb_64_inplace^⚠
cuda_integer_radix_overflowing_sub_kb_64^⚠
cuda_integer_radix_scalar_rotate_kb_64_inplace^⚠
cuda_integer_radix_shift_and_rotate_kb_64_inplace^⚠
cuda_integer_radix_sum_ciphertexts_vec_kb_64^⚠
cuda_keyswitch_lwe_ciphertext_vector_64^⚠
Perform keyswitch on a batch of 64 bits input LWE ciphertexts.
cuda_malloc_async^⚠
Allocate size memory on GPU gpu_index asynchronously
cuda_memcpy_async_gpu_to_gpu^⚠
Copy size memory asynchronously from src to dest on the same GPU gpu_index using the Cuda stream v_stream.
cuda_memcpy_async_to_cpu^⚠
Copy size memory asynchronously from src on GPU gpu_index to dest on CPU using the Cuda stream v_stream.
cuda_memcpy_async_to_gpu^⚠
Copy size memory asynchronously from src on CPU to dest on GPU gpu_index using the Cuda stream v_stream.
cuda_memset_async^⚠
Copy size memory asynchronously from src on CPU to dest on GPU gpu_index using the Cuda stream v_stream.
cuda_mult_lwe_ciphertext_vector_cleartext_vector_64^⚠
Perform the multiplication of a u64 input LWE ciphertext vector with a u64 input cleartext vector.
cuda_multi_bit_programmable_bootstrap_lwe_ciphertext_vector_64^⚠
Perform bootstrapping on a batch of input u64 LWE ciphertexts using the multi-bit algorithm.
cuda_negate_integer_radix_ciphertext_64_inplace^⚠
cuda_negate_lwe_ciphertext_vector_64^⚠
Perform the negation of a u64 input LWE ciphertext vector.
cuda_programmable_bootstrap_lwe_ciphertext_vector_64^⚠
Perform bootstrapping on a batch of input u64 LWE ciphertexts.
cuda_propagate_single_carry_kb_64_inplace^⚠
cuda_scalar_addition_integer_radix_ciphertext_64_inplace^⚠
cuda_scalar_bitop_integer_radix_ciphertext_kb_64^⚠
cuda_scalar_comparison_integer_radix_ciphertext_kb_64^⚠
cuda_scalar_multiplication_integer_radix_ciphertext_64_inplace^⚠
cuda_setup_multi_gpu^⚠
cuda_synchronize_device^⚠
Synchronize all streams on GPU gpu_index
cuda_synchronize_stream^⚠
Synchronize Cuda stream
scratch_cuda_apply_bivariate_lut_kb_64^⚠
scratch_cuda_apply_univariate_lut_kb_64^⚠
scratch_cuda_full_propagation_64^⚠
scratch_cuda_integer_div_rem_radix_ciphertext_kb_64^⚠
scratch_cuda_integer_mult_radix_ciphertext_kb_64^⚠
scratch_cuda_integer_radix_arithmetic_scalar_shift_kb_64^⚠
scratch_cuda_integer_radix_bitop_kb_64^⚠
scratch_cuda_integer_radix_cmux_kb_64^⚠
scratch_cuda_integer_radix_comparison_kb_64^⚠
scratch_cuda_integer_radix_logical_scalar_shift_kb_64^⚠
scratch_cuda_integer_radix_overflowing_sub_kb_64^⚠
scratch_cuda_integer_radix_scalar_rotate_kb_64^⚠
scratch_cuda_integer_radix_shift_and_rotate_kb_64^⚠
scratch_cuda_integer_radix_sum_ciphertexts_vec_kb_64^⚠
scratch_cuda_integer_scalar_mul_kb_64^⚠
scratch_cuda_multi_bit_programmable_bootstrap_64^⚠
This scratch function allocates the necessary amount of data on the GPU for the multi-bit PBS on 64-bit inputs into pbs_buffer.
scratch_cuda_programmable_bootstrap_64^⚠
This scratch function allocates the necessary amount of data on the GPU for the low latency PBS on 64-bit inputs, into pbs_buffer. It also configures SM options on the GPU in case FULLSM or PARTIALSM mode are going to be used.
scratch_cuda_propagate_single_carry_kb_64_inplace^⚠

Module tfhe_cuda_backend::cuda_bindCopy item path

Functions§

Module tfhe_cuda_backend::cuda_bind