Distributed Processing
Parameters
Parameter

Default Value

Description
 

algorithmFPType 
float 
The floatingpoint type that the algorithm uses for intermediate computations. Can be
float
or
double
.
 
method 
fastCSR 
Performanceoriented computation method for CSR numeric tables, the only method supported by the algorithm.
 
nFactors 
10

The total number of factors.
 
fullNUsers 
0

The total number of users
m
.
 
partition 
Not applicable

Numeric table of size either 1 x 1 that provides the number of input data parts; or (
nblocks
+ 1) x 1, where
nblocks
is the number of input data parts, and the
i
th element contains the offset of the transposed
i
th data part to be computed by the initialization algorithm.
 
engine 
SharePtr< engines:: mt19937:: Batch>() 
Pointer to the random number generator engine that is used internally at the initialization step.

Step 1  on Local Nodes
Input
Input ID

Input
 

dataColumnSlice 
An
n _{
i}
x
m
numeric table with the part of the input data set. Each node holds
n _{
i}
rows of the full transposed input data set
R ^{
T}
. The input should be an object of
CSRNumericTable
class.

Output
Partial Result ID

Result
 

partialModel 
The model with initialized item factors. The result can only be an object of the
PartialModel
class.
 
outputOfInitForComputeStep3 
A keyvalue data collection that maps components of the partial model to the local nodes.
 
offsets 
A keyvalue data collection of size
nblocks
that holds the starting offsets of the factor indices on each node.
 
outputOfStep1ForStep2 
A keyvalue data collection of size
nblocks
that contains the parts of the input numeric table:
j
th element of this collection is a numeric table of size
m _{
j}
x
n _{
i}
, where
m _{
j}
are defined by the
partition
parameter.

Step 2  on Local Nodes
Input
Input ID

Input
 

inputOfStep2FromStep1 
A keyvalue data collection of size
nblocks
that contains the parts of the input data set:
i
th element of this collection is a numeric table of size
m _{
i}
x
n _{
i}
. Each numeric table in the collection should be an object of
CSRNumericTable
class.

Output
Partial Result ID

Result
 

dataRowSlice 
An
m _{
j}
x
n
numeric table with the mining data.
j
th node gets
m _{
j}
rows of the full input data set
R
.
 
outputOfInitForComputeStep3 
A keyvalue data collection that maps components of the partial model to the local nodes.
 
offsets 
A keyvalue data collection of size
nblocks
that holds the starting offsets of the factor indices on each node.
