K - Type of a key in upstream data.V - Type of a value in upstream data.C - type of a partition context.public class SimpleLabeledDatasetDataBuilder<K,V,C extends Serializable> extends Object implements PartitionDataBuilder<K,V,C,SimpleLabeledDatasetData>
data builder that makes SimpleLabeledDatasetData.| Constructor and Description |
|---|
SimpleLabeledDatasetDataBuilder(Preprocessor<K,V> vectorizer)
Constructs a new instance of partition
data builder that makes SimpleLabeledDatasetData. |
| Modifier and Type | Method and Description |
|---|---|
SimpleLabeledDatasetData |
build(LearningEnvironment env,
Iterator<UpstreamEntry<K,V>> upstreamData,
long upstreamDataSize,
C ctx)
Builds a new partition
data from a partition upstream data and partition context. |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitandThen, buildpublic SimpleLabeledDatasetDataBuilder(Preprocessor<K,V> vectorizer)
data builder that makes SimpleLabeledDatasetData.vectorizer - Function that extracts labeled vectors from an upstream data.public SimpleLabeledDatasetData build(LearningEnvironment env, Iterator<UpstreamEntry<K,V>> upstreamData, long upstreamDataSize, C ctx)
data from a partition upstream data and partition context.
Important: there is no guarantee that there will be no more than one UpstreamEntry with given key,
UpstreamEntry should be thought rather as a container saving all data from upstream, but omitting uniqueness
constraint. This constraint is omitted to allow upstream data transformers in DatasetBuilder replicating
entries. For example it can be useful for bootstrapping.build in interface PartitionDataBuilder<K,V,C extends Serializable,SimpleLabeledDatasetData>env - Learning environment.upstreamData - Partition upstream data.upstreamDataSize - Partition upstream data size.ctx - Partition context.data.
GridGain In-Memory Computing Platform : ver. 8.9.26 Release Date : October 16 2025