The LDA transform implements LightLDA, a state-of-the-art implementation of Latent Dirichlet Allocation.

Dirichlet prior on document-topic vectors

Dirichlet prior on vocab-topic vectors

New column definition(s) (optional form: name:srcs)

Input dataset

Compute log likelihood over local dataset on this iteration interval

Number of Metropolis Hasting step

The number of burn-in iterations

Number of iterations

The threshold of maximum count of tokens per doc

The number of words to summarize the topic

The number of training threads. Default value depends on number of logical processors.

The number of topics in the LDA

Whether to output the topic-word summary in text format

Reset the random number generator for each document


