Implications of capacity-limited, generative models for human vision

Joseph Scott German; Robert A. Jacobs

doi:10.1017/S0140525X23001772

Implications of capacity-limited, generative models for human vision

Published online by Cambridge University Press: 06 December 2023

Joseph Scott German and

Robert A. Jacobs

Show author details

Joseph Scott German: Affiliation:
Department of Cognitive Science, University of California, San Diego, La Jolla, CA, USA jgerman@ucsd.edu
Robert A. Jacobs: Affiliation:
Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY, USA rjacobs@ur.rochester.edu https://www2.bcs.rochester.edu/sites/jacobslab/people.html

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Although discriminative deep neural networks are currently dominant in cognitive modeling, we suggest that capacity-limited, generative models are a promising avenue for future work. Generative models tend to learn both local and global features of stimuli and, when properly constrained, can learn componential representations and response biases found in people's behaviors.

Type: Open Peer Commentary
Information: Behavioral and Brain Sciences , Volume 46 , 2023 , e391

DOI: https://doi.org/10.1017/S0140525X23001772 [Opens in a new window]
Copyright: Copyright © The Author(s), 2023. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Alemi, A. A., Poole, B., Fischer, I., Dillon, J. V., Saurous, R. A., & Murphy, K. (2017). An information-theoretic analysis of deep latent variable models. arXiv preprint arXiv:1711.00464. Retrieved from https://arxiv.org/pdf/1711.00464v1.pdf Google Scholar

Alemi, A. A., Poole, B., Fischer, I., Dillon, J. V., Saurous, R. A., & Murphy, K. (2018). Fixing a broken ELBO. arXiv preprint arXiv:1711.00464v3. Retrieved from https://arxiv.org/pdf/1711.00464v3.pdf Google Scholar

Ballé, J., Laparra, V., & Simoncelli, E. P. (2016). End-to-end optimized image compression. arXiv preprint arXiv:1611.01704. Retrieved from https://arxiv.org/pdf/1611.01704.pdf Google Scholar

Bates, C. J., & Jacobs, R. A. (2020). Efficient data compression in perception and perceptual memory. Psychological Review, 127, 891–917.CrossRef Google Scholar PubMed

Bates, C. J., & Jacobs, R. A. (2021). Optimal attentional allocation in the presence of capacity constraints in uncued and cued visual search. Journal of Vision, 21(5), 3, 1–23.CrossRef Google Scholar PubMed

Bates, C. J., Lerch, R. A., Sims, C. R., & Jacobs, R. A. (2019). Adaptive allocation of human visual working memory capacity during statistical and categorical learning. Journal of Vision, 19(2), 11, 1–23.CrossRef Google Scholar PubMed

Biederman, I. (1987). Recognition-by-components: A theory of human image understanding. Psychological Review, 94(2), 115–147.CrossRef Google Scholar PubMed

Burgess, C. P., Higgins, I., Pal, A., Matthey, L., Watters, N., Desjardins, G., … Lerchner, A. (2018). Understanding disentangling in β-VAE. arXiv preprint arXiv:1804.03599. Retrieved from https://arxiv.org/pdf/1804.03599.pdf Google Scholar

Erdogan, G., & Jacobs, R. A. (2017). Visual shape perception as Bayesian inference of 3D object-centered shape representations. Psychological Review, 124, 740–761.CrossRef Google Scholar PubMed

German, J. S., & Jacobs, R. A. (2020). Can machine learning account for human visual object shape similarity judgments? Vision Research, 167, 87–99.CrossRef Google Scholar PubMed

Lake, B. M., Salakhutdinov, R., & Tenenbaum, J. B. (2015). Human-level concept learning through probabilistic program induction. Science (New York, N.Y.), 350(6266), 1332–1338.CrossRef Google Scholar PubMed

Nash, C., & Williams, C. K. I. (2017). The shape variational autoencoder: A deep generative model of part-segmented 3D objects. Eurographics Symposium on Geometry Processing, 36(5), 1–11.Google Scholar

Sims, C. R., Jacobs, R. A., & Knill, D. C. (2012). An ideal observer analysis of visual working memory. Psychological Review, 119, 807–830.CrossRef Google Scholar PubMed