iceberg-cpp
Loading...
Searching...
No Matches
Classes | Public Member Functions | Protected Member Functions | Protected Attributes | List of all members
iceberg::SnapshotUpdate Class Referenceabstract

Base class for operations that produce snapshots. More...

#include <snapshot_update.h>

Inheritance diagram for iceberg::SnapshotUpdate:
iceberg::PendingUpdate iceberg::ErrorCollector iceberg::FastAppend

Classes

struct  ApplyResult
 Result of applying a snapshot update. More...
 

Public Member Functions

Kind kind () const override
 Return the kind of this pending update.
 
bool IsRetryable () const override
 Whether this update can be retried after a commit conflict.
 
auto & DeleteWith (this auto &self, std::function< Status(const std::string &)> delete_func)
 Set a callback to delete files instead of the table's default.
 
auto & StageOnly (this auto &self)
 Stage a snapshot in table metadata, but not update the current snapshot id.
 
auto & SetTargetBranch (this auto &self, const std::string &branch)
 Perform operations on a particular branch.
 
auto & Set (this auto &self, const std::string &property, const std::string &value)
 Set a summary property.
 
Result< ApplyResultApply ()
 Apply the update's changes to create a new snapshot.
 
Status Finalize (Result< const TableMetadata * > commit_result) override
 Finalize the snapshot update, cleaning up any uncommitted files.
 
- Public Member Functions inherited from iceberg::PendingUpdate
virtual Status Commit ()
 Apply the pending changes and commit.
 
 PendingUpdate (const PendingUpdate &)=delete
 
PendingUpdateoperator= (const PendingUpdate &)=delete
 
 PendingUpdate (PendingUpdate &&) noexcept=default
 
PendingUpdateoperator= (PendingUpdate &&) noexcept=default
 
- Public Member Functions inherited from iceberg::ErrorCollector
 ErrorCollector (ErrorCollector &&)=default
 
ErrorCollectoroperator= (ErrorCollector &&)=default
 
 ErrorCollector (const ErrorCollector &)=default
 
ErrorCollectoroperator= (const ErrorCollector &)=default
 
template<typename... Args>
auto & AddError (this auto &self, ErrorKind kind, const std::format_string< Args... > fmt, Args &&... args)
 Add a specific error and return reference to derived class.
 
auto & AddError (this auto &self, Error err)
 Add an existing error object and return reference to derived class.
 
auto & AddError (this auto &self, std::unexpected< Error > err)
 Add an unexpected result's error and return reference to derived class.
 
bool has_errors () const
 Check if any errors have been collected.
 
size_t error_count () const
 Get the number of errors collected.
 
Status CheckErrors () const
 Check for accumulated errors and return them if any exist.
 
void ClearErrors ()
 Clear all accumulated errors.
 
const std::vector< Error > & errors () const
 Get read-only access to all collected errors.
 

Protected Member Functions

 SnapshotUpdate (std::shared_ptr< TransactionContext > ctx)
 
Result< std::vector< ManifestFile > > WriteDataManifests (std::span< const std::shared_ptr< DataFile > > files, const std::shared_ptr< PartitionSpec > &spec, std::optional< int64_t > data_sequence_number=std::nullopt)
 Write data manifests for the given data files.
 
Result< std::vector< ManifestFile > > WriteDeleteManifests (std::span< const std::shared_ptr< DataFile > > files, const std::shared_ptr< PartitionSpec > &spec)
 Write delete manifests for the given delete files.
 
const std::string & target_branch () const
 
bool can_inherit_snapshot_id () const
 
const std::string & commit_uuid () const
 
int32_t manifest_count () const
 
int32_t attempt () const
 
int64_t target_manifest_size_bytes () const
 
virtual void CleanUncommitted (const std::unordered_set< std::string > &committed)=0
 Clean up any uncommitted manifests that were created.
 
virtual std::string operation ()=0
 A string that describes the action that produced the new snapshot.
 
virtual Status Validate (const TableMetadata &current_metadata, const std::shared_ptr< Snapshot > &snapshot)
 Validate the current metadata.
 
virtual Result< std::vector< ManifestFile > > Apply (const TableMetadata &metadata_to_update, const std::shared_ptr< Snapshot > &snapshot)=0
 Apply the update's changes to the given metadata and snapshot.
 
virtual std::unordered_map< std::string, std::string > Summary ()=0
 Get the summary map for this operation.
 
virtual bool CleanupAfterCommit () const
 Check if cleanup should happen after commit.
 
int64_t SnapshotId ()
 Get or generate the snapshot ID for the new snapshot.
 
Status DeleteFile (const std::string &path)
 Delete a file at the given path.
 
std::string ManifestPath ()
 
std::string ManifestListPath ()
 
SnapshotSummaryBuildersummary_builder ()
 
- Protected Member Functions inherited from iceberg::PendingUpdate
 PendingUpdate (std::shared_ptr< TransactionContext > ctx)
 
const TableMetadatabase () const
 

Protected Attributes

SnapshotSummaryBuilder summary_
 
- Protected Attributes inherited from iceberg::PendingUpdate
std::shared_ptr< TransactionContextctx_
 
- Protected Attributes inherited from iceberg::ErrorCollector
std::vector< Errorerrors_
 

Additional Inherited Members

- Public Types inherited from iceberg::PendingUpdate
enum class  Kind : uint8_t {
  kExpireSnapshots , kSetSnapshot , kUpdateLocation , kUpdatePartitionSpec ,
  kUpdatePartitionStatistics , kUpdateProperties , kUpdateSchema , kUpdateSnapshot ,
  kUpdateSnapshotReference , kUpdateSortOrder , kUpdateStatistics
}
 

Detailed Description

Base class for operations that produce snapshots.

This class provides common functionality for creating new snapshots, including manifest list writing and cleanup.

Member Function Documentation

◆ Apply() [1/2]

Result< SnapshotUpdate::ApplyResult > iceberg::SnapshotUpdate::Apply ( )

Apply the update's changes to create a new snapshot.

This method validates the changes, applies them to the metadata, and creates a new snapshot without committing it. The snapshot is stored internally and can be accessed after Apply() succeeds.

Returns
A result containing the new snapshot, or an error

◆ Apply() [2/2]

virtual Result< std::vector< ManifestFile > > iceberg::SnapshotUpdate::Apply ( const TableMetadata metadata_to_update,
const std::shared_ptr< Snapshot > &  snapshot 
)
protectedpure virtual

Apply the update's changes to the given metadata and snapshot.

Parameters
metadata_to_updateThe base table metadata to apply changes to
snapshotSnapshot to apply the changes to
Returns
A vector of manifest files for the new snapshot

Implemented in iceberg::FastAppend.

◆ CleanUncommitted()

virtual void iceberg::SnapshotUpdate::CleanUncommitted ( const std::unordered_set< std::string > &  committed)
protectedpure virtual

Clean up any uncommitted manifests that were created.

Manifests may not be committed if Apply is called multiple times because a commit conflict has occurred. Implementations may keep around manifests because the same changes will be made by both Apply calls. This method instructs the implementation to clean up those manifests and passes the paths of the manifests that were actually committed.

Parameters
committedA set of manifest paths that were actually committed

Implemented in iceberg::FastAppend.

◆ CleanupAfterCommit()

virtual bool iceberg::SnapshotUpdate::CleanupAfterCommit ( ) const
inlineprotectedvirtual

Check if cleanup should happen after commit.

Returns
True if cleanup should happen after commit

Reimplemented in iceberg::FastAppend.

◆ DeleteFile()

Status iceberg::SnapshotUpdate::DeleteFile ( const std::string &  path)
protected

Delete a file at the given path.

Parameters
pathThe path of the file to delete
Returns
A status indicating the result of the deletion

◆ DeleteWith()

auto & iceberg::SnapshotUpdate::DeleteWith ( this auto &  self,
std::function< Status(const std::string &)>  delete_func 
)
inline

Set a callback to delete files instead of the table's default.

Parameters
delete_funcA function used to delete file locations
Returns
Reference to this for method chaining
Note
Cannot be called more than once

◆ Finalize()

Status iceberg::SnapshotUpdate::Finalize ( Result< const TableMetadata * >  commit_result)
overridevirtual

Finalize the snapshot update, cleaning up any uncommitted files.

Reimplemented from iceberg::PendingUpdate.

◆ IsRetryable()

bool iceberg::SnapshotUpdate::IsRetryable ( ) const
inlineoverridevirtual

Whether this update can be retried after a commit conflict.

Implements iceberg::PendingUpdate.

◆ kind()

Kind iceberg::SnapshotUpdate::kind ( ) const
inlineoverridevirtual

Return the kind of this pending update.

Implements iceberg::PendingUpdate.

◆ operation()

virtual std::string iceberg::SnapshotUpdate::operation ( )
protectedpure virtual

A string that describes the action that produced the new snapshot.

Returns
A string operation name

Implemented in iceberg::FastAppend.

◆ Set()

auto & iceberg::SnapshotUpdate::Set ( this auto &  self,
const std::string &  property,
const std::string &  value 
)
inline

Set a summary property.

Parameters
propertyThe property name
valueThe property value
Returns
Reference to this for method chaining

◆ SetTargetBranch()

auto & iceberg::SnapshotUpdate::SetTargetBranch ( this auto &  self,
const std::string &  branch 
)
inline

Perform operations on a particular branch.

Parameters
branchWhich is name of SnapshotRef of type branch
Returns
Reference to this for method chaining

◆ StageOnly()

auto & iceberg::SnapshotUpdate::StageOnly ( this auto &  self)
inline

Stage a snapshot in table metadata, but not update the current snapshot id.

Returns
Reference to this for method chaining

◆ Summary()

virtual std::unordered_map< std::string, std::string > iceberg::SnapshotUpdate::Summary ( )
protectedpure virtual

Get the summary map for this operation.

Returns
A map of summary properties

Implemented in iceberg::FastAppend.

◆ Validate()

virtual Status iceberg::SnapshotUpdate::Validate ( const TableMetadata current_metadata,
const std::shared_ptr< Snapshot > &  snapshot 
)
inlineprotectedvirtual

Validate the current metadata.

Child operations can override this to add custom validation.

Parameters
current_metadataCurrent table metadata to validate
snapshotEnding snapshot on the lineage which is being validated

◆ WriteDataManifests()

Result< std::vector< ManifestFile > > iceberg::SnapshotUpdate::WriteDataManifests ( std::span< const std::shared_ptr< DataFile > >  files,
const std::shared_ptr< PartitionSpec > &  spec,
std::optional< int64_t >  data_sequence_number = std::nullopt 
)
protected

Write data manifests for the given data files.

Parameters
filesData files to write
specThe partition spec to use
data_sequence_numberOptional data sequence number for the files
Returns
A vector of manifest files

◆ WriteDeleteManifests()

Result< std::vector< ManifestFile > > iceberg::SnapshotUpdate::WriteDeleteManifests ( std::span< const std::shared_ptr< DataFile > >  files,
const std::shared_ptr< PartitionSpec > &  spec 
)
protected

Write delete manifests for the given delete files.

Parameters
filesDelete files to write
specThe partition spec to use
Returns
A vector of manifest files

The documentation for this class was generated from the following files: