public class MLOpsSparkUtils extends Object
| Modifier and Type | Class and Description |
|---|---|
static class |
MLOpsSparkUtils.PredictionsDataParameters |
static class |
MLOpsSparkUtils.PredictionStatisticsParameters |
static class |
MLOpsSparkUtils.RawPredictionDataParameters |
| Constructor and Description |
|---|
MLOpsSparkUtils() |
| Modifier and Type | Method and Description |
|---|---|
static void |
reportActuals(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentId,
String modelId,
String channelConfig)
Report actual values for predictions.
|
static void |
reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
List<String> targetColumnNames)
Report predictions and features given a DataFrame (Dataset of Row).
|
static void |
reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
List<String> targetColumnNames,
Integer predictionsServiced)
Report predictions and features given a DataFrame (Dataset of Row).
|
static void |
reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
List<String> targetColumnNames,
String assocIdColName)
Report predictions and features given a DataFrame (Dataset of Row).
|
static void |
reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
List<String> targetColumnNames,
String assocIdColName,
Integer predictionsServiced)
Report predictions and features given a DataFrame (Dataset of Row).
|
static void |
reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
String[] targetColumnNames)
Report predictions and features given a DataFrame (Dataset of Row).
|
static void |
reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
String[] targetColumnNames,
Integer predictionsServiced)
Report predictions and features given a DataFrame (Dataset of Row).
|
static void |
reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
String[] targetColumnNames,
String assocIdColName)
Report predictions and features given a DataFrame (Dataset of Row).
|
static void |
reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
String[] targetColumnNames,
String assocIdColName,
Integer predictionsServiced)
Report predictions and features given a DataFrame (Dataset of Row).
|
public static void reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
String[] targetColumnNames,
String assocIdColName,
Integer predictionsServiced)
throws com.datarobot.mlops.common.exceptions.DRCommonException
This variant of the function gets the target columns as a String[] and accepts association ids and allows the caller to override the number of predictions serviced, which by default is the number of predictions included in the dataframe.
dataFrame - The DataFrame to reportdeploymentID - ID of the deployment this data belongs to in MLOps DataRobotmodelId - Model Id used for predictionschannelConfig - A string containing the MLOps channel configurationscoringTime - The time it took to perform scoring in milliseconds.targetColumnNames - An array of column names to use for the target of the prediction. If only one item
is provided, this is assumed to be a regression prediction. If more than one, it is
a classification prediction.assocIdColName - The name of the column containing the association ids for the predictionspredictionsServiced - The number of predictions actually serviced. This may be higher than the number
of predictions reported here in the dataframe if downsampling is applied before
reporting.com.datarobot.mlops.common.exceptions.DRCommonExceptionpublic static void reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
String[] targetColumnNames,
String assocIdColName)
throws com.datarobot.mlops.common.exceptions.DRCommonException
This variant of the function gets the target columns as a String[] and accepts association ids.
dataFrame - The DataFrame to reportdeploymentID - ID of the deployment this data belongs to in MLOps DataRobotmodelId - Model Id used for predictionschannelConfig - A string containing the MLOps channel configurationscoringTime - The time it took to perform scoring in milliseconds.targetColumnNames - An array of column names to use for the target of the prediction. If only one item
is provided, this is assumed to be a regression prediction. If more than one, it is
a classification prediction.assocIdColName - The name of the column containing the association ids for the predictionscom.datarobot.mlops.common.exceptions.DRCommonExceptionpublic static void reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
List<String> targetColumnNames,
String assocIdColName,
Integer predictionsServiced)
throws com.datarobot.mlops.common.exceptions.DRCommonException
This variant of the function gets the target columns as a List of String and allows the caller to override the number of predictions serviced, which by default is the number of predictions included in the dataframe.
dataFrame - The DataFrame to reportdeploymentID - Id of the deployment this data belongs to in DataRobot MLOpsmodelId - Model Id used for predictionschannelConfig - A string containing MLOps channel configurationscoringTime - Time it took to perform scoring in millisecondstargetColumnNames - An array of column names to use for the target of the prediction. If only one item
is provided, this is assumed to be a regression prediction. If more than one, it is
a classification prediction.assocIdColName - The name of the column containing the association ids for the predictionspredictionsServiced - The number of predictions actually serviced. This may be higher than the number
of predictions reported here in the dataframe if downsampling is applied before
reporting.com.datarobot.mlops.common.exceptions.DRCommonExceptionpublic static void reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
List<String> targetColumnNames,
String assocIdColName)
throws com.datarobot.mlops.common.exceptions.DRCommonException
This variant of the function gets the target columns as a List of String.
dataFrame - The DataFrame to reportdeploymentID - Id of the deployment this data belongs to in DataRobot MLOpsmodelId - Model Id used for predictionschannelConfig - A string containing MLOps channel configurationscoringTime - Time it took to perform scoring in millisecondstargetColumnNames - An array of column names to use for the target of the prediction. If only one item
is provided, this is assumed to be a regression prediction. If more than one, it is
a classification prediction.assocIdColName - The name of the column containing the association ids for the predictionscom.datarobot.mlops.common.exceptions.DRCommonExceptionpublic static void reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
String[] targetColumnNames,
Integer predictionsServiced)
throws com.datarobot.mlops.common.exceptions.DRCommonException
This variant of the function gets the target columns as a String[] and does not accept association ids and allows the caller to override the number of predictions serviced, which by default is the number of predictions included in the dataframe.
dataFrame - The DataFrame to reportdeploymentID - Id of the deployment this data belongs to in DataRobot MLOpsmodelId - Model Id used for predictionschannelConfig - A string containing MLOps channel configurationscoringTime - Time it took to perform scoring in millisecondstargetColumnNames - An array of column names to use for the target of the prediction. If only one item
if provided, this is assumed to be a regression prediction. If more than one, it is
a classification prediction.predictionsServiced - The number of predictions actually serviced. This may be higher than the number
of predictions reported here in the dataframe if downsampling is applied before
reporting.com.datarobot.mlops.common.exceptions.DRCommonExceptionpublic static void reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
String[] targetColumnNames)
throws com.datarobot.mlops.common.exceptions.DRCommonException
This variant of the function gets the target columns as a String[] and does not accept association ids.
dataFrame - The DataFrame to reportdeploymentID - Id of the deployment this data belongs to in DataRobot MLOpsmodelId - Model Id used for predictionschannelConfig - A string containing MLOps channel configurationscoringTime - Time it took to perform scoring in millisecondstargetColumnNames - An array of column names to use for the target of the prediction. If only one item
if provided, this is assumed to be a regression prediction. If more than one, it is
a classification prediction.com.datarobot.mlops.common.exceptions.DRCommonExceptionpublic static void reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
List<String> targetColumnNames,
Integer predictionsServiced)
throws com.datarobot.mlops.common.exceptions.DRCommonException
This variant of the function gets the target columns as a List of String, does not accept association ids, and allows the caller to override the number of predictions serviced, which by default is the number of predictions included in the dataframe.
dataFrame - The DataFrame to reportdeploymentID - Id of the deployment this data belongs to in DataRobot MLOpsmodelId - Model Id used for predictionschannelConfig - A string containing MLOps channel configurationscoringTime - Time it took to perform scoring in millisecondstargetColumnNames - An array of column names to use for the target of the prediction. If only one item
if provided, this is assumed to be a regression prediction. If more then one, it is
a classification prediction.predictionsServiced - The number of predictions actually serviced. This may be higher than the number
of predictions reported here in the dataframe if downsampling is applied before
reporting.com.datarobot.mlops.common.exceptions.DRCommonExceptionpublic static void reportPredictions(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentID,
String modelId,
String channelConfig,
Double scoringTime,
List<String> targetColumnNames)
throws com.datarobot.mlops.common.exceptions.DRCommonException
This variant of the function gets the target columns as a List of String and does not accept association ids.
dataFrame - The DataFrame to reportdeploymentID - Id of the deployment this data belongs to in DataRobot MLOpsmodelId - Model Id used for predictionschannelConfig - A string containing MLOps channel configurationscoringTime - Time it took to perform scoring in millisecondstargetColumnNames - An array of column names to use for the target of the prediction. If only one item
if provided, this is assumed to be a regression prediction. If more then one, it is
a classification prediction.com.datarobot.mlops.common.exceptions.DRCommonExceptionpublic static void reportActuals(org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> dataFrame,
String deploymentId,
String modelId,
String channelConfig)
throws com.datarobot.mlops.common.exceptions.DRCommonException
dataFrame - the DataFrame containing the actuals as specified abovedeploymentId - The deployment id these actuals belong tomodelId - The model id that used for the predictionschannelConfig - MLOps channel configuration stringcom.datarobot.mlops.common.exceptions.DRCommonExceptionCopyright © 2022. All rights reserved.