event_reader.parquet_block_data_store

Documentation for eth_defi.event_reader.parquet_block_data_store Python module.

Parquet dataset backed block data storage like block headers or trades.

Classes

Store block data as Parquet dataset.

Exceptions

Do not allow gaps in data.

exception NoGapsWritten

Bases: Exception

Do not allow gaps in data.

with_traceback(): Exception.with_traceback(tb) – set self.__traceback__ to tb and return self.

class ParquetDatasetBlockDataStore

Store block data as Parquet dataset.

Partitions are keyed by block number.
Partitioning allows fast incremental updates, by overwriting the last two partitions,

Parameters

__init__(path, partition_size=100000)

Parameters

is_virgin()

Has this store any stored data.

load(since_block_number=0)

Load data from parquet.

Parameters: since_block_number (int) – May return earlier rows than this if a block is a middle of a partition
Return type: pandas.core.frame.DataFrame

save(df, since_block_number=0, check_contains_all_blocks=True)

Save all data from parquet.

If there are existing block headers written, any data will be overwritten on per partition basis.

Parameters

since_block_number (int) – Write only the latest data after this block number (inclusive)
check_contains_all_blocks – Check that we have at least one data record for every block. Note that trades might not happen on every block.
df (pandas.core.frame.DataFrame) –

save_incremental(df)

Write all partitions we are missing from the data.

peak_last_block()

Return the last block number stored on the disk.