Class AvroIO.ParseFiles<T>
- java.lang.Object
-
- org.apache.beam.sdk.transforms.PTransform<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.io.FileIO.ReadableFile>,org.apache.beam.sdk.values.PCollection<T>>
-
- org.apache.beam.sdk.extensions.avro.io.AvroIO.ParseFiles<T>
-
- All Implemented Interfaces:
java.io.Serializable,org.apache.beam.sdk.transforms.display.HasDisplayData
- Enclosing class:
- AvroIO
public abstract static class AvroIO.ParseFiles<T> extends org.apache.beam.sdk.transforms.PTransform<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.io.FileIO.ReadableFile>,org.apache.beam.sdk.values.PCollection<T>>Implementation ofAvroIO.parseFilesGenericRecords(org.apache.beam.sdk.transforms.SerializableFunction<org.apache.avro.generic.GenericRecord, T>).- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description ParseFiles()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description org.apache.beam.sdk.values.PCollection<T>expand(org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.io.FileIO.ReadableFile> input)voidpopulateDisplayData(org.apache.beam.sdk.transforms.display.DisplayData.Builder builder)AvroIO.ParseFiles<T>withCoder(org.apache.beam.sdk.coders.Coder<T> coder)Specifies the coder for the result of theparseFn.AvroIO.ParseFiles<T>withDesiredBundleSizeBytes(long desiredBundleSizeBytes)Set a value for the bundle size for parallel reads.AvroIO.ParseFiles<T>withFileExceptionHandler(org.apache.beam.sdk.io.ReadAllViaFileBasedSource.ReadFileRangesFnExceptionHandler exceptionHandler)Specifies if exceptions should be logged only for streaming pipelines.AvroIO.ParseFiles<T>withUsesReshuffle(boolean usesReshuffle)Specifies if a Reshuffle should run before file reads occur.-
Methods inherited from class org.apache.beam.sdk.transforms.PTransform
addAnnotation, compose, compose, getAdditionalInputs, getAnnotations, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, setDisplayData, setResourceHints, toString, validate, validate
-
-
-
-
Method Detail
-
withCoder
public AvroIO.ParseFiles<T> withCoder(org.apache.beam.sdk.coders.Coder<T> coder)
Specifies the coder for the result of theparseFn.
-
withUsesReshuffle
public AvroIO.ParseFiles<T> withUsesReshuffle(boolean usesReshuffle)
Specifies if a Reshuffle should run before file reads occur.
-
withFileExceptionHandler
public AvroIO.ParseFiles<T> withFileExceptionHandler(org.apache.beam.sdk.io.ReadAllViaFileBasedSource.ReadFileRangesFnExceptionHandler exceptionHandler)
Specifies if exceptions should be logged only for streaming pipelines.
-
withDesiredBundleSizeBytes
public AvroIO.ParseFiles<T> withDesiredBundleSizeBytes(long desiredBundleSizeBytes)
Set a value for the bundle size for parallel reads. Default is 64 MB. You may want to use a lower value (e.g. 1 MB) for streaming applications.
-
expand
public org.apache.beam.sdk.values.PCollection<T> expand(org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.io.FileIO.ReadableFile> input)
- Specified by:
expandin classorg.apache.beam.sdk.transforms.PTransform<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.io.FileIO.ReadableFile>,org.apache.beam.sdk.values.PCollection<T>>
-
populateDisplayData
public void populateDisplayData(org.apache.beam.sdk.transforms.display.DisplayData.Builder builder)
- Specified by:
populateDisplayDatain interfaceorg.apache.beam.sdk.transforms.display.HasDisplayData- Overrides:
populateDisplayDatain classorg.apache.beam.sdk.transforms.PTransform<org.apache.beam.sdk.values.PCollection<org.apache.beam.sdk.io.FileIO.ReadableFile>,org.apache.beam.sdk.values.PCollection<T>>
-
-