The name of the parent feature(s) for the column. Usually a column has one parent feature, but can have many (eg. in the case of multiple Text columns being vectorized using a shared hash space)
The type of the parent feature(s) for the column
The name of the group an column belongs to (usually the parent feature, but in the case of Maps, this is the keys). Every other column in the same vector that has this grouping should be mutually exclusive to this one. If there is no grouping then this field is None
A name for an binary indicator value (null indicator or result of a pivot or whatever that value is), otherwise None eg this is none when the column is from a numeric group that is not pivoted
A name for a value that is continuous (not a binary indicator) eg for geolocation (lat, lon, accuracy) or for dates that have been converted to a circular representation the time window and x or y coordinate, otherwise None
Index of the vector this info is associated with (this is updated when OpVectorColumnMetadata is passed into OpVectorMetadata
A name for a value that is continuous (not a binary indicator) eg for geolocation (lat, lon, accuracy) or for dates that have been converted to a circular representation the time window and x or y coordinate, otherwise None
Get the feature grouping qualified by the parent feature name
Get the feature grouping qualified by the parent feature name
Optional string of feature grouping
The name of the group an column belongs to (usually the parent feature, but in the case of Maps, this is the keys).
The name of the group an column belongs to (usually the parent feature, but in the case of Maps, this is the keys). Every other column in the same vector that has this grouping should be mutually exclusive to this one. If there is no grouping then this field is None
Does column have parent features of which are subtypes of feature type O
Does column have parent features of specified feature type O
Index of the vector this info is associated with (this is updated when OpVectorColumnMetadata is passed into OpVectorMetadata
A name for an binary indicator value (null indicator or result of a pivot or whatever that value is), otherwise None eg this is none when the column is from a numeric group that is not pivoted
Is this column corresponds to a null-encoded value
Is this column corresponds to a null-encoded value
true if this column corresponds to a null-encoded value
Is this column corresponds the other category of a one hot encoded categorical
Is this column corresponds the other category of a one hot encoded categorical
true if this column corresponds to the other category of a one hot encoded categorical
Make unique name for this column
Make unique name for this column
String name for this column
The name of the parent feature(s) for the column.
The name of the parent feature(s) for the column. Usually a column has one parent feature, but can have many (eg. in the case of multiple Text columns being vectorized using a shared hash space)
The type of the parent feature(s) for the column
Return parent features names with the key (grouping) from any map parents included in name
Return parent features names with the key (grouping) from any map parents included in name
Sequence of parent feature names, simple names when features are not maps, names plus keys for columns with map parent features
Write this instance to json string
Write this instance to json string
should pretty print
json string of the instance
Convert this column into Spark metadata.
Convert this column into Spark metadata.
column Spark metadata
Convert this column into Spark metadata.
Convert this column into Spark metadata.
Indexes of vector columns that match this OpVectorColumnMetadata
column Spark metadata
This instance json string
This instance json string
json string of the instance
Represents the metadata a column in a vector.
Because we expect every vector column to have been produced by some vectorization process, we provide the name of the feature that led to this column.
Also note that each column's indicator value should be unique, meaning that they represent mutually exclusive values. The output of a hashing vectorizer, for instance, does not produce mutually exclusive values.
The name of the parent feature(s) for the column. Usually a column has one parent feature, but can have many (eg. in the case of multiple Text columns being vectorized using a shared hash space)
The type of the parent feature(s) for the column
The name of the group an column belongs to (usually the parent feature, but in the case of Maps, this is the keys). Every other column in the same vector that has this grouping should be mutually exclusive to this one. If there is no grouping then this field is None
A name for an binary indicator value (null indicator or result of a pivot or whatever that value is), otherwise None eg this is none when the column is from a numeric group that is not pivoted
A name for a value that is continuous (not a binary indicator) eg for geolocation (lat, lon, accuracy) or for dates that have been converted to a circular representation the time window and x or y coordinate, otherwise None
Index of the vector this info is associated with (this is updated when OpVectorColumnMetadata is passed into OpVectorMetadata