libcudf  24.04.00
Files | Typedefs | Enumerations | Functions

Files

file  hashing.hpp
 

Typedefs

using cudf::hash_value_type = uint32_t
 Type of hash value.
 

Enumerations

enum class  cudf::hash_id { cudf::HASH_IDENTITY = 0 , cudf::HASH_MURMUR3 , cudf::HASH_SPARK_MURMUR3 , cudf::HASH_MD5 }
 Identifies the hash function to be used. More...
 

Functions

std::unique_ptr< columncudf::hash (table_view const &input, hash_id hash_function=hash_id::HASH_MURMUR3, uint32_t seed=DEFAULT_HASH_SEED, rmm::cuda_stream_view stream=cudf::get_default_stream(), rmm::mr::device_memory_resource *mr=rmm::mr::get_current_device_resource())
 Computes the hash value of each row in the input set of columns. More...
 

Detailed Description

Enumeration Type Documentation

◆ hash_id

enum cudf::hash_id
strong

Identifies the hash function to be used.

Enumerator
HASH_IDENTITY 

Identity hash function that simply returns the key to be hashed.

HASH_MURMUR3 

Murmur3 hash function.

HASH_SPARK_MURMUR3 

Spark Murmur3 hash function.

HASH_MD5 

MD5 hash function.

Definition at line 41 of file hashing.hpp.

Function Documentation

◆ hash()

std::unique_ptr<column> cudf::hash ( table_view const &  input,
hash_id  hash_function = hash_id::HASH_MURMUR3,
uint32_t  seed = DEFAULT_HASH_SEED,
rmm::cuda_stream_view  stream = cudf::get_default_stream(),
rmm::mr::device_memory_resource *  mr = rmm::mr::get_current_device_resource() 
)

Computes the hash value of each row in the input set of columns.

Deprecated:
Since 23.08
Parameters
inputThe table of columns to hash
hash_functionThe hash function enum to use
seedOptional seed value to use for the hash function
streamCUDA stream used for device memory operations and kernel launches
mrDevice memory resource used to allocate the returned column's device memory
Returns
A column where each row is the hash of a column from the input