formerly, the site knowceans.org served as a code repository. this is the mirror site and may become the only one in the future. most of this code is gpl or lgpl. latent dirichlet allocation in java: lda-j (version 20050325) is a Java 1.5 port of David Blei's lda-c. See the javadoc See the C-implementation lda-c LdaGibbsSampler.java, a working "hack" of the MCMC algorithm for LDA in one Java class