Goldwater, S., Griffiths, T. L., Johnson, M. (2011). Producing power-law distributions and damping word frequencies with two-stage language models. Journal of Machine Learning Research, 12, 2335-2382. (pdf)
Goldwater, S., Griffiths, T. L., & Johnson, M. (2007). Distributional cues to word segmentation: Context is important. Proceedings of the 31st Boston University Conference on Language Development.(pdf)
Goldwater, S., & Griffiths, T. L. (2007). A fully Bayesian approach to unsupervised part-of-speech tagging. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL'07).(pdf)
Johnson, M., Griffiths, T. L., & Goldwater, S. (2007). Bayesian inference for PCFGs via Markov chain Monte Carlo. Proceedings of the North American Conference on Computational Linguistics (NAACL'07).(pdf)
Goldwater, S., Griffiths, T. L., & Johnson, M. (2006). Interpolating between types and tokens by estimating power law generators. Advances in Neural Information Processing Systems 18. (pdf) (note: this version of the paper is slightly modified from the hardcopy proceedings)
Goldwater, S., Griffiths, T. L., & Johnson, M. (2006). Contextual dependencies in unsupervised word segmentation. Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics.(pdf)