mv-apply operator

Applies a subquery to each record, and returns the union of the results of all subqueries.

For example, assume a table T has a column Metric of type dynamic whose values are arrays of real numbers. The following query will locate the two biggest values in each Metric value, and return the records corresponding to these values.

T | mv-apply Metric to typeof(real) on 
(
   top 2 by Metric desc
)

The mv-apply operator has the following processing steps:

  1. Uses the mv-expand operator to expand each record in the input into subtables (order is preserved).
  2. Applies the subquery for each of the subtables.
  3. Adds zero or more columns to the resulting subtable. These columns contain the values of the source columns that aren't expanded, and are repeated where needed.
  4. Returns the union of the results.

The mv-apply operator gets the following inputs:

  1. One or more expressions that evaluate into dynamic arrays to expand. The number of records in each expanded subtable is the maximum length of each of those dynamic arrays. Null values are added where multiple expressions are specified and the corresponding arrays have different lengths.

  2. Optionally, the names to assign the values of the expressions after expansion. These names become the columns names in the subtables. If not specified, the original name of the column is used when the expression is a column reference. A random name is used otherwise.

    Note

    It is recommended to use the default column names.

  3. The data types of the elements of those dynamic arrays, after expansion. These become the column types of the columns in the subtables. If not specified, dynamic is used.

  4. Optionally, the name of a column to add to the subtables that specifies the 0-based index of the element in the array that resulted in the subtable record.

  5. Optionally, the maximum number of array elements to expand.

The mv-apply operator can be thought of as a generalization of the mv-expand operator (in fact, the latter can be implemented by the former, if the subquery includes only projections.)

Syntax

T | mv-apply [ItemIndex] ColumnsToExpand [RowLimit] on ( SubQuery )

Where ItemIndex has the syntax:

with_itemindex = IndexColumnName

ColumnsToExpand is a comma-separated list of one or more elements of the form:

[Name =] ArrayExpression [to typeof (Typename)]

RowLimit is simply:

limit RowLimit

and SubQuery has the same syntax of any query statement.

Arguments

  • ItemIndex: If used, indicates the name of a column of type long that is appended to the input as part of the array-expansion phase and indicates the 0-based array index of the expanded value.

  • Name: If used, the name to assign the array-expanded values of each array-expanded expression. If not specified, the name of the column will be used if available. A random name is generated if ArrayExpression is not a simple column name.

  • ArrayExpression: An expression of type dynamic whose values will be array-expanded. If the expression is the name of a column in the input, the input column is removed from the input and a new column of the same name (or ColumnName if specified) appears in the output.

  • Typename: If used, the name of the type that the individual elements of the dynamic array ArrayExpression take. Elements that do not conform to this type will be replaced by a null value. (If unspecified, dynamic is used by default.)

  • RowLimit: If used, a limit on the number of records to generate from each record of the input. (If unspecified, 2147483647 is used.)

  • SubQuery: A tabular query expression with an implicit tabular source that gets applied to each array-expanded subtable.

Notes

  • Unlike the mv-expand operator, the mv-apply operator does not support bagexpand=array expansion. If the expression to be expanded is a property bag and not an array, it is possible to use an inner mv-expand operator (see example below).

Examples

Getting the largest element from the array

let _data =
range x from 1 to 8 step 1
| summarize l=make_list(x) by xMod2 = x % 2;
_data
| mv-apply element=l to typeof(long) on 
(
   top 1 by element
)
xMod2 l element
1 [1, 3, 5, 7] 7
0 [2, 4, 6, 8] 8

Calculating the sum of the largest two elements in an array

let _data =
range x from 1 to 8 step 1
| summarize l=make_list(x) by xMod2 = x % 2;
_data
| mv-apply l to typeof(long) on
(
   top 2 by l
   | summarize SumOfTop2=sum(l)
)
xMod2 l SumOfTop2
1 [1,3,5,7] 12
0 [2,4,6,8] 14

Using with_itemindex for working with a subset of the array

let _data =
range x from 1 to 10 step 1
| summarize l=make_list(x) by xMod2 = x % 2;
_data
| mv-apply with_itemindex=index element=l to typeof(long) on 
(
   // here you have 'index' column
   where index >= 3
)
| project index, element
index element
3 7
4 9
3 8
4 10

Applying mv-apply to a property bag

In the following example, mv-apply is used in combination with an inner mv-expand to remove values that don't start with "555" from a property bag:

datatable(SourceNumber:string,TargetNumber:string,CharsCount:long)
[
    '555-555-1234','555-555-1212',46,
    '555-555-1212','',int(null)
]
| extend values = pack_all()
| mv-apply removeProperties = values on 
(
    mv-expand kind = array values
    | where values[1] !startswith "555"
    | summarize propsToRemove = make_set(values[0])
)
| extend values = bag_remove_keys(values, propsToRemove)
| project-away propsToRemove
SourceNumber TargetNumber CharsCount values
555-555-1234 555-555-1212 46 {
"SourceNumber": "555-555-1234",
"TargetNumber": "555-555-1212"
}
555-555-1212     {
"SourceNumber": "555-555-1212"
}

See also