iceberg-cpp
Loading...
Searching...
No Matches
Classes
strict_metrics_evaluator.h File Reference
#include <memory>
#include "iceberg/expression/expression.h"
#include "iceberg/iceberg_export.h"
#include "iceberg/result.h"
#include "iceberg/type_fwd.h"

Go to the source code of this file.

Classes

class  iceberg::StrictMetricsEvaluator
 Evaluates an Expression against DataFile. More...
 

Detailed Description

Evaluates an Expression on a DataFile to test whether all rows in the file match.

This evaluation is strict: it returns true if all rows in a file must match the expression. For example, if a file's ts column has min X and max Y, this evaluator will return true for ts < Y+1 but not for ts < Y-1.

Files are passed to #eval(ContentFile), which returns true if all rows in the file must contain matching rows and false if the file may contain rows that do not match.

Due to the comparison implementation of ORC stats, for float/double columns in ORC files, if the first value in a file is NaN, metrics of this file will report NaN for both upper and lower bound despite that the column could contain non-NaN data. Thus in some scenarios explicitly checks for NaN is necessary in order to not include files that may contain rows that don't match.