Update Linear and Logistic Regression Parameters & Improve Documentation

- Add comprehensive documentation for supported: - Optimizers (SIMPLE_SGD, LINEAR_DECAY_SGD, etc.) - Objective types (ABSOLUTE_LOSS, HUBER, SQUARED_LOSS) - Momentum types (STANDARD, NESTEROV) - Fix parameter name typos Signed-off-by: rithin-pullela-aws <[email protected]>
opensearch-project · Dec 23, 2024 · aef0607 · aef0607
1 parent 1006421
commit aef0607
Showing 1 changed file with 29 additions and 14 deletions.
diff --git a/_ml-commons-plugin/algorithms.md b/_ml-commons-plugin/algorithms.md
@@ -59,20 +59,31 @@ The training process supports multithreading, but the number of threads must be
 
 ## Linear regression
 
-Linear regression maps the linear relationship between inputs and outputs. In ML Commons, the linear regression algorithm is adopted from the public machine learning library [Tribuo](https://tribuo.org/), which offers multidimensional linear regression models. The model supports the linear optimizer in training, including popular approaches like Linear Decay, SQRT_DECAY, [ADA](https://www.jmlr.org/papers/volume12/duchi11a/duchi11a.pdf), [ADAM](https://tribuo.org/learn/4.1/javadoc/org/tribuo/math/optimisers/Adam.html), and [RMS_DROP](https://tribuo.org/learn/4.1/javadoc/org/tribuo/math/optimisers/RMSProp.html). 
+Linear regression maps the linear relationship between inputs and outputs. In ML Commons, the linear regression algorithm is adopted from the public machine learning library [Tribuo](https://tribuo.org/), which offers multidimensional linear regression models. The model supports the linear optimizer in training, including popular approaches like Linear Decay, SQRT_DECAY, [ADA](https://www.jmlr.org/papers/volume12/duchi11a/duchi11a.pdf), [ADAM](https://tribuo.org/learn/4.1/javadoc/org/tribuo/math/optimisers/Adam.html), and [RMS_PROP](https://tribuo.org/learn/4.1/javadoc/org/tribuo/math/optimisers/RMSProp.html).
+
+**Optimisers supported:** `SIMPLE_SGD`, `LINEAR_DECAY_SGD`, `SQRT_DECAY_SGD`, `ADA_GRAD`, `ADA_DELTA`, `ADAM`, and `RMS_PROP`.  
+**Objectives supported:** `ABSOLUTE_LOSS`, `HUBER`, and `SQUARED_LOSS`.  
+**momentum_type supported:** `STANDARD`, and `NESTEROV`.  
 
 ### Parameters
 
 Parameter | Type   | Description | Default value
 :--- |:--- | :--- | :---
-`learningRate` | Double | The initial step size used in an iterative optimization algorithm. | `0.01`
-`momentumFactor` | Double | The extra weight factors that accelerate the rate at which the weight is adjusted. This helps move the minimization routine out of local minima.  | `0`
+`target` | String | The name of the target variable to predict. Identifies which feature the model will learn to predict during training. | `NA`
+`learning_rate` | Double | The initial step size used in an iterative optimization algorithm. | `0.01`
+`momentum_factor` | Double | The extra weight factors that accelerate the rate at which the weight is adjusted. This helps move the minimization routine out of local minima.  | `0`
 `epsilon` | Double | The value for stabilizing gradient inversion. | `1.00E-06`
 `beta1` | Double | The exponential decay rates for the moment estimates. |  `0.9`
 `beta2` | Double | The exponential decay rates for the moment estimates. |  `0.99`
-`decayRate` | Double | The Root Mean Squared Propagation (RMSProp). | `0.9`
-`momentumType` | String | The defined Stochastic Gradient Descent (SGD) momentum type that helps accelerate gradient vectors in the right directions, leading to a fast convergence.| `STANDARD`
-`optimizerType` | String | The optimizer used in the model. | `SIMPLE_SGD`
+`decay_rate` | Double | The Root Mean Squared Propagation (RMSProp). | `0.9`
+`momentum_type` | String | The defined Stochastic Gradient Descent (SGD) momentum type that helps accelerate gradient vectors in the right directions, leading to a fast convergence.| `STANDARD`
+`optimiser` | String | The optimizer used in the model. | `SIMPLE_SGD`
+`objective` | String | The Objective function used. | `SQUARED_LOSS` 
+`epochs` | Integer | The number of iterations. | `5`|
+`batch_size` | Integer | The size of min batches. | `1`
+`logging_interval` | Integer | The frequency of logging during training iterations. Set to -1 to disable logging. | `-1`
+`seed` | Long | Random seed for reproducible results. Controls the initialization of random number generators. | `12345`
+
 
 
 ### Supported APIs
@@ -412,23 +423,27 @@ The Localization algorithm can only be executed directly. Therefore, it cannot b
 
 A classification algorithm, logistic regression models the probability of a discrete outcome given an input variable. In ML Commons, these classifications include both binary and multi-class. The most common is the binary classification, which takes two values, such as "true/false" or "yes/no", and predicts the outcome based on the values specified. Alternatively, a multi-class output can categorize different inputs based on type. This makes logistic regression most useful for situations where you are trying to determine how your inputs fit best into a specified category. 
 
+**Optimisers supported:** `SIMPLE_SGD`, `LINEAR_DECAY_SGD`, `SQRT_DECAY_SGD`, `ADA_GRAD`, `ADA_DELTA`, `ADAM`, and `RMS_PROP`.  
+**Objectives supported:** `HINGE`, and `LOGMULTICLASS`.  
+**momentum_type supported:** `STANDARD`, and `NESTEROV`.  
+
 ### Parameters
 
 | Parameter | Type | Description | Default value |
 |---|---|---|---|
-| `learningRate` | Double | The initial step size used in an iterative optimization algorithm. | `1` |
-| `momentumFactor` | Double | The extra weight factors that accelerate the rate at which the weight is adjusted. This helps move the minimization routine out of local minima. | `0` |
+| `learning_rate` | Double | The initial step size used in an iterative optimization algorithm. | `1` |
+| `momentum_factor` | Double | The extra weight factors that accelerate the rate at which the weight is adjusted. This helps move the minimization routine out of local minima. | `0` |
 | `epsilon` | Double | The value for stabilizing gradient inversion. | `0.1` |
 | `beta1` | Double | The exponential decay rates for the moment estimates. | `0.9` |
 | `beta2` | Double | The exponential decay rates for the moment estimates. | `0.99` |
-| `decayRate` | Double | The Root Mean Squared Propagation (RMSProp). | `0.9` |
-| `momentumType` | String | The Stochastic Gradient Descent (SGD) momentum that helps accelerate gradient vectors in the right direction, leading to faster convergence between vectors. | `STANDARD` |
-| `optimizerType` | String | The optimizer used in the model.  | `AdaGrad` |
+| `decay_rate` | Double | The Root Mean Squared Propagation (RMSProp). | `0.9` |
+| `momentum_type` | String | The Stochastic Gradient Descent (SGD) momentum that helps accelerate gradient vectors in the right direction, leading to faster convergence between vectors. | `STANDARD` |
+| `optimiser` | String | The optimizer used in the model.  | `ADA_GRAD` |
 | `target` | String | The target field. | null |
-| `objectiveType` | String | The objective function type. | `LogMulticlass` |
+| `objective` | String | The objective function type. | `LOGMULTICLASS` |
 | `epochs` | Integer | The number of iterations. | `5` |
-| `batchSize` | Integer | The size of min batches. | `1` |
-| `loggingInterval` | Integer | The interval of logs lost after many iterations. The interval is `1` if the algorithm contains no logs. | `1000` |
+| `batch_size` | Integer | The size of min batches. | `1` |
+| `logging_interval` | Integer | The interval of logs lost after many iterations. The interval is `1` if the algorithm contains no logs. | `1000` |
 
 ### Supported APIs