Generate the dataframe that will be used in the OpPipeline calling this method
Generate the dataframe that will be used in the OpPipeline calling this method
features to generate from the dataset read in by this reader
op parameters
spark instance to do the reading and conversion from RDD to Dataframe
A dataframe containing columns with all of the raw input features expected by the pipeline
All the reader's sub readers (used in joins)
All the reader's sub readers (used in joins)
sub readers
Reader type tag
Reader type tag
Full reader input type name
Full reader input type name
full input type name
Default method for extracting this reader's parameters from readerParams in OpParams
Default method for extracting this reader's parameters from readerParams in OpParams
contains map of reader type to ReaderParams instances
ReaderParams instance if it exists
Inner join
Inner join
Type of data read by right data reader
reader from right side of join
join keys to use
joined reader
Join readers
Join readers
Type of data read by right data reader
reader from right side of join
type of join to perform
join keys to use
joined reader
Left Outer join
Left Outer join
Type of data read by right data reader
reader from right side of join
join keys to use
joined reader
Outer join
Outer join
Type of data read by right data reader
reader from right side of join
join keys to use
joined reader
Short reader input type name
Short reader input type name
short reader input type name