fastTopics is an R package implementing fast, scalable optimization algorithms for fitting topic models (“grade of membership” models) and non-negative matrix factorizations to count data. The methods exploit the special relationship between the multinomial topic model (also “probabilistic latent semantic indexing”) and Poisson non-negative matrix factorization. The package provides tools to compare, annotate and visualize model fits, including functions to efficiently create “structure plots” and identify key features in topics. The fastTopics package is a successor to the CountClust package.
If you find a bug, or you have a question or feedback on this software, please post an issue.
If you find the fastTopics package or any of the source code in this repository useful for your work, please cite:
Kushal K. Dey, Chiaowen Joyce Hsiao and Matthew Stephens (2017). Visualizing the structure of RNA-seq expression data using grade of membership models. PLoS Genetics 13, e1006599.
Peter Carbonetto, Kevin Luo, Kushal Dey, Joyce Hsiao and Matthew Stephens (2021). fastTopics: fast algorithms for fitting topic models and non-negative matrix factorizations to count data. R package version 0.4-11. https://github.com/stephenslab/fastTopics
Copyright (c) 2019-2021, Peter Carbonetto and Matthew Stephens.
All source code and software in this repository are made available under the terms of the MIT license.
Install and load the package:
Note that installing the package will require a C++ compiler setup that is appropriate for the version of R installed on your computer. For details, refer to the documentation on the CRAN website.
Also, try running the small example that illustrates the fast model fitting algorithms:
See the package documentation for more information.