We present a new, fully generative model for constructing astronomical catalogs from optical telescope image sets. Each pixel intensity is treated as a random variable with parameters that depend on the latent properties of stars and galaxies. These latent properties are themselves modeled as random. We compare two procedures for posterior inference. One procedure is based on Markov chain Monte Carlo (MCMC) while the other is based on variational inference (VI). The MCMC procedure excels at quantifying uncertainty, while the VI procedure is 1000 times faster. On a supercomputer, the VI procedure efficiently uses 665,000 CPU cores to construct an astronomical catalog from 50 terabytes of images in 14.6 minutes, demonstrating the scaling characteristics necessary to construct catalogs for upcoming astronomical surveys.
@article{regier2019approximate, year = {2019}, title = {APPROXIMATE INFERENCE FOR CONSTRUCTING ASTRONOMICAL CATALOGS FROM IMAGES}, author = {Regier, Jeffrey and Miller, Andrew C and Schlegel, David and Adams, Ryan P and Mcauliffe, Jon D}, journal = {The Annals of Applied Statistics}, volume = {13}, number = {3}, pages = {1884--1926}, publisher = {JSTOR} }