Package org.apache.beam.runners.dataflow
Class BatchStatefulParDoOverrides
- java.lang.Object
-
- org.apache.beam.runners.dataflow.BatchStatefulParDoOverrides
-
public class BatchStatefulParDoOverrides extends java.lang.ObjectPTransformOverrideFactoriesthat expands to correctly implement statefulParDousing window-unawareBatchViewOverrides.GroupByKeyAndSortValuesOnlyto linearize processing per key.For the Fn API, the
PTransformOverrideFactoryis only required to perform per key grouping and expansion.This implementation relies on implementation details of the Dataflow runner, specifically standard fusion behavior of
ParDotransforms following aGroupByKey.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classBatchStatefulParDoOverrides.BatchStatefulDoFn<K,V,OutputT>A key-preservingDoFnthat explodes an iterable that has been grouped by key and window.
-
Constructor Summary
Constructors Constructor Description BatchStatefulParDoOverrides()
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static <K,InputT,OutputT>
org.apache.beam.sdk.runners.PTransformOverrideFactory<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<K,InputT>>,org.apache.beam.sdk.values.PCollectionTuple,org.apache.beam.sdk.transforms.ParDo.MultiOutput<org.apache.beam.sdk.values.KV<K,InputT>,OutputT>>multiOutputOverrideFactory(DataflowPipelineOptions options)Returns aPTransformOverrideFactorythat replaces a multi-outputParDowith a composite transform specialized for theDataflowRunner.static <K,InputT,OutputT>
org.apache.beam.sdk.runners.PTransformOverrideFactory<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<K,InputT>>,org.apache.beam.sdk.values.PCollection<OutputT>,org.apache.beam.sdk.transforms.ParDo.SingleOutput<org.apache.beam.sdk.values.KV<K,InputT>,OutputT>>singleOutputOverrideFactory()Returns aPTransformOverrideFactorythat replaces a single-outputParDowith a composite transform specialized for theDataflowRunner.
-
-
-
Method Detail
-
singleOutputOverrideFactory
public static <K,InputT,OutputT> org.apache.beam.sdk.runners.PTransformOverrideFactory<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<K,InputT>>,org.apache.beam.sdk.values.PCollection<OutputT>,org.apache.beam.sdk.transforms.ParDo.SingleOutput<org.apache.beam.sdk.values.KV<K,InputT>,OutputT>> singleOutputOverrideFactory()
Returns aPTransformOverrideFactorythat replaces a single-outputParDowith a composite transform specialized for theDataflowRunner.
-
multiOutputOverrideFactory
public static <K,InputT,OutputT> org.apache.beam.sdk.runners.PTransformOverrideFactory<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.values.KV<K,InputT>>,org.apache.beam.sdk.values.PCollectionTuple,org.apache.beam.sdk.transforms.ParDo.MultiOutput<org.apache.beam.sdk.values.KV<K,InputT>,OutputT>> multiOutputOverrideFactory(DataflowPipelineOptions options)
Returns aPTransformOverrideFactorythat replaces a multi-outputParDowith a composite transform specialized for theDataflowRunner.
-
-