How to write aggregate functions
How to write aggregate functions
Databend allows us to write custom aggregate functions through rust code. It's not an easy way because you need to be a rustacean first. Databend has a plan to support writing UDAFs in other languages(like js, web assembly) in the future.
In this section we will talk about how to write aggregate functions in Databend.
AggregateFunction trait introduction
All aggregate functions implement AggregateFunction
trait, and we register them into a global static FunctionFactory
, the factory is just an index map and the key is the name of the aggregate function.
note
Function name in Databend is case-insensitive.
pub trait AggregateFunction: fmt::Display + Sync + Send {
fn name(&self) -> &str;
fn return_type(&self) -> Result<DataType>;
fn nullable(&self, _input_schema: &DataSchema) -> Result<bool>;
fn init_state(&self, place: StateAddr);
fn state_layout(&self) -> Layout;
fn accumulate(&self, _place: StateAddr, _arrays: &[Series], _input_rows: usize) -> Result<()>;
fn accumulate_keys(
&self,
_places: &[StateAddr],
_offset: usize,
_arrays: &[Series],
_input_rows: usize,
) -> Result<()>;
fn serialize(&self, _place: StateAddr, _writer: &mut BytesMut) -> Result<()>;
fn deserialize(&self, _place: StateAddr, _reader: &mut &[u8]) -> Result<()>;
fn merge(&self, _place: StateAddr, _rhs: StateAddr) -> Result<()>;
fn merge_result(&self, _place: StateAddr) -> Result<DataValue>;
}
Understand the functions
- The function
name
indicates the name of this function, such assum
,min
. - The function
return_type
indicates the return type of the function, it may vary with different arguments, such assum(int8)
->int64
,sum(uint8)
->uint64
,sum(float64)
->float64
. - The function
nullable
indicates whether thereturn_type
is nullable or not.
Before we start to introduce the function init_state
, let's ask a question first:
what's aggregate function state?
It indicates the temporary results of an aggregate function. Because an aggregate function accumulates data in columns block by block and there will be some intermediate results after the aggregation. Therefore, the state must be mergeable, serializable.
For example, in the avg
aggregate function, we can represent the state like:
struct AggregateAvgState<T: BinarySer + BinaryDe> {
pub value: T,
pub count: u64,
}
- The function
init_state
initializes the aggregate function state, we ensure the memory is already allocated, and we just need to initial the state with the initial value. - The function
state_layout
indicates the memory layout of the state. - The function
accumulate
is used in aggregation with a single batch, which means the whole block can be aggregated in a single state, no other keys. The SQL query, which applies aggregation without group-by columns, will hit this function.
Noted that the argument _arrays
is the function arguments, we can safely get the array by index without index bound check because we must validate the argument numbers and types in function constructor.
The _input_rows
is the rows of the current block, and it may be useful when the _arrays
is empty, e.g., count()
function.
- The function
accumulate_keys
is similar to accumulate, but we must take into consideration the keys and offsets, for which each key represents a unique memory address named place. - The function
serialize
serializes state into binary. - The function
deserialize
deserializes state from binary. - The function
merge
, can be used to merge other state into current state. - The function
merge_result
, can be used to represent the aggregate function state into one-row field.
Example
Let's take an example of aggregate function sum
.
It's declared as AggregateSumFunction<T, SumT>
, we can accept varying integer types like u8
, i8
. T
and SumT
is logic types which implement DFPrimitiveType
. e.g., T
is u8
and SumT
must be u64
.
Also, we can dispatch it using macros by matching the types of the arguments. Take a look at the with_match_primitive_type
to understand the dispatch macros.
The AggregateSumState
will be
struct AggregateSumState<T> {
pub value: Option<T>,
}
The generic T
is from SumT
, the Option<T>
can return null
if nothing is passed into this function.
Let's take into the function accumulate_keys
, because this is the only function that a little hard to understand in this case.
The places
is the memory address of the first state in this row, so we can get the address of AggregateSumState<T>
using places[row] + offset
, then using place.get::<AggregateSumState<SumT>>()
to get the value of the corresponding state.
Since we already know the array type of this function, we can safely cast it to arrow's PrimitiveArray<T>
, here we make two branches to reduce the branch prediction of CPU, null
and no_null
. In no_null
case, we just iterate the array and apply the sum
, this is good for compiler to optimize the codes into vectorized codes.
Ok, this example is pretty easy. If you already read this, you may have the ability to write a new function.
Refer to other examples
As you see, adding a new aggregate function in Databend is not as hard as you think.
Before you start to add one, please refer to other aggregate function examples, such as min
, count
, max
, avg
.
Testing
To be a good engineer, don't forget to test your codes, please add unit tests and stateless tests after you finish the new aggregate functions.
Summary
We welcome all community users to contribute more powerful functions to Databend. If you find any problems, feel free to open an issue in GitHub, we will use our best efforts to help you.